Merge pull request #401 from dssg/push_to_base

Refactor functionality to ExperimentBase class [Resolves #400]
dssg · May 15, 2018 · df187f2 · df187f2
2 parents 6da7dc2 + ec80f3e
commit df187f2
Show file tree

Hide file tree

Showing 28 changed files with 673 additions and 645 deletions.
diff --git a/README.rst b/README.rst
@@ -60,134 +60,11 @@ The first phase implemented in Triage is the ``Experiment``. An experiment repre
 Documentation
 ---------------------------
 - `Dirty Duck Tutorial <https://dssg.github.io/dirtyduck/>`_
+- `Running an Experiment <https://dssg.github.io/triage/experiments/running>`_
 - `Experiment Algorithm Deep Dive <https://dssg.github.io/triage/experiments/algorithm>`_
 - `Experiment Config v5 Upgrade Guide <https://dssg.github.io/triage/experiments/upgrade-to-v5>`_
 
 
-Instantiating an Experiment
----------------------------
-
-An ``Experiment`` class, once instantiated, provides access to a variety of useful pieces of information about the experiment, as well as the ability to run it and get results.
-
-First, we'll look at how to instantiate the Experiment::
-
-    SingleThreadedExperiment(
-        config=experiment_config,
-        db_engine=sqlalchemy.create_engine(...),
-        model_storage_class=FSModelStorageEngine,
-        project_path='/path/to/directory/to/save/data'
-    )
-
-These lines are a bit dense: what is happening here?
-
-- ``SingleThreadedExperiment``:  There are different Experiment classes available in ``triage.experiments`` to use, and they each represent a different way of executing the experiment, which we'll talk about in more detail later. The simplest (but slowest) is the ``SingleThreadedExperiment``.
-- ``config=experiment_config``: The bulk of the work needed in designing an experiment will be in creating this experiment configuration. An up-to-date example is at `example_experiment_config.yaml <example_experiment_config.yaml>`_; more detailed instructions on each section are located in the example file. Generally these would be easiest to store as a file (or multiple files that you construct together) like that YAML file, but the configuration is passed in dict format to the Experiment constructor and you can store it however you wish.
-- ``db_engine=sqlalchemy.create_engine(...)``: A SQLAlchemy database engine. This will be used both for querying your source tables and writing results metadata.
-- ``model_storage_class=FSModelStorageEngine``: The path to a model storage engine class. The library that Triage uses for model training and evaluation, `catwalk <https://github.com/dssg/catwalk>`_, provides multiple classes that handle storing trained models in different mediums, such as on the local filesystem or Amazon S3. We recommend starting with the ``catwalk.storage.FSModelStorageEngine`` to save models on the local filesystem.
-- ``project_path='/path/to/directory/to/save/data'``: The path to where you would like to store design matrices and trained models. May be an s3 path (e.g. s3://bucket-name/project-directory), in which case s3 will be used to store matrices and models.
-
-With that in mind, a more full version of the experiment instantiation might look like this::
-
-    import sqlalchemy
-    import yaml
-    import logging
-
-    from triage.component.catwalk.storage import FSModelStorageEngine
-    from triage.experiments import SingleThreadedExperiment
-
-    with open('my_experiment_config.yaml') as f:
-    	experiment_config = yaml.load(f)
-    with open('my_database_creds') as f:
-    	db_connection_string = yaml.load(f)['db_connection_string']
-
-    logging.basicConfig(level=logging.INFO)
-
-    experiment = SingleThreadedExperiment(
-        config=experiment_config,
-        db_engine=sqlalchemy.create_engine(db_connection_string),
-        model_storage_class=FSModelStorageEngine,
-        project_path='/home/research/myproject'
-    )
-
-Validating an Experiment
-------------------------
-
-Configuring an experiment is very complicated, and running an experiment can take a long time as data scales up. If there are any misconfigured values, it's going to help out a lot to figure out what they are before we run the Experiment. So we recommend running the ``.validate()`` method on the Experiment first. If any problems are detectable in your Experiment, either in configuration or the database tables referenced by it, this method will throw an exception. For instance, if I refer to the 'cat_complaints' table in a feature aggregation but it doesn't exist, I'll see something like this::
-
-    experiment.validate()
-
-    (Pdb) experiment.validate()
-    *** ValueError: from_obj query does not run.
-    from_obj: "cat_complaints"
-    Full error: (psycopg2.ProgrammingError) relation "cat_complaints" does not exist
-    LINE 1: explain select * from cat_complaints
-                                  ^
-     [SQL: 'explain select * from cat_complaints']
-
-If the validation runs without any errors, you should see a success message (either in your log or console). At this point, the Experiment should be ready to run.
-
-We'd like to add more validations for common misconfiguration problems over time. If you got an unexpected error that turned out to be related to a confusing configuration value, help us out by adding to the `validation module <triage/experiments/validate.py>`_ and submitting a pull request!
-
-Running an Experiment
----------------------
-
-Once you're at this point, running the experiment is simple::
-
-    experiment.run()
-
-This will run the entire experiment. This could take a while, so we recommend checking logging messages (INFO level will catch a lot of useful information) and keeping an eye on its progress.
-
-Evaluating results of an Experiment
------------------------------------
-
-After the experiment run, a results schema will be created and populated in the configured database with the following tables:
-
-* experiments - The experiment configuration and a hash
-* models - A model describes a trained classifier; you'll have one row for each trained file that gets saved.
-* model_groups - A model groups refers to all models that share parameters like classifier type, hyperparameters, etc, but *have different training windows*. Look at these to see how classifiers perform over different training windows.
-* feature_importances - The sklearn feature importances results for each trained model
-* predictions - Prediction probabilities for entities generated against trained models
-* evaluations - Metric scores of trained models over given testing windows
-
-Here's an example query, which returns the top 10 model groups by precision at the top 100 entities::
-
-    select
-    	model_groups.model_group_id,
-    	model_groups.model_type,
-    	model_groups.model_parameters,
-    	max(evaluations.value) as max_precision
-    from model_groups
-    	join models using (model_group_id)
-    	join evaluations using (model_id)
-    where
-    	metric = 'precision@'
-    	and parameter = '100_abs'
-    group by 1,2,3
-    order by 4 desc
-    limit 10
-
-The resulting schema is also readable by `Tyra <https://github.com/tyra>`_, our model evaluation webapp.
-
-Restarting an Experiment
-------------------------
-
-If an experiment fails for any reason, you can restart it. Each matrix and each model file is saved with a filename matching a hash of its unique attributes, so when the experiment is rerun, it will by default reuse the matrix or model instead of rebuilding it. If you would like to change this behavior and replace existing versions of matrices and models, set ``replace=True`` in the Experiment constructor.
-
-Inspecting an Experiment before running
----------------------------------------
-
-Before you run an experiment, you can inspect properties of the Experiment object to ensure that it is configured in the way you want. Some examples:
-
-- ``experiment.all_as_of_times`` for debugging temporal config. This will show all dates that features and labels will be calculated at.
-- ``experiment.feature_dicts`` will output a list of feature dictionaries, representing the feature tables and columns configured in this experiment
-- ``experiment.matrix_build_tasks`` will output a list representing each matrix that will be built.
-
-Experiment Classes
-------------------
-
-- *SingleThreadedExperiment*: An experiment that performs all tasks serially in a single thread. Good for simple use on small datasets, or for understanding the general flow of data through a pipeline.
-- *MultiCoreExperiment*: An experiment that makes use of the multiprocessing library to parallelize various time-consuming steps. Takes an ``n_processes`` keyword argument to control how many workers to use.
-
 Background
 ==========
 

diff --git a/docs/mkdocs.yml b/docs/mkdocs.yml
@@ -19,7 +19,3 @@ pages:
     - Feature Definition Deep Dive: experiments/features.md
     - Experiment Algorithm: experiments/algorithm.md
     - Experiment Architecture: experiments/architecture.md
-  - API Docs:
-    - ExperimentBase: triage.experiments.base.md
-    - SingleThreadedExperiment: triage.experiments.singlethreaded.md
-    - MultiCoreExperiment: triage.experiments.multicore.md
diff --git a/docs/sources/experiments/defining.md b/docs/sources/experiments/defining.md
@@ -1,3 +1,6 @@
 # Defining an Experiment
 
-This doc is coming soon. In the meantime, check out the [Example Experiment Definition](https://github.com/dssg/triage/blob/master/example_experiment_config.yaml) for an overview of the different sections of an experiment definition.
+This doc is coming soon. In the meantime, check out:
+
+- [Example Experiment Definition](https://github.com/dssg/triage/blob/master/example_experiment_config.yaml) for an overview of the different sections of an experiment definition.
+- [Dirty Duck](https://dssg.github.io/dirtyduck) for a beginning-to-end walkthrough to Triage, include deep dives on creating experiment configuration.