Add benchmark testing apis #66

ChengFR · 2020-11-30T12:55:15Z

The following tasks have been accomplished:

add benchmark testing APIs in cardea/benchmark (Resolve Benchmark testing apis #64).
create customized primitives in cardea/primitives and makes the setup (Resolve Primitive setup #61).
add a function execute_pipeline_from_pipeline in cardea/modeling/modeler.py. This function loads MLPipeline instances instead of primitives for testing (Resolve Modeler load pipelines instead of lists of primitives #65).

…-encoder, and categorizer

…with MLPipeline objects as inputs

…sers to benchmark cardea from task/tasks

…nstance as input

… project root

…index

CLAassistant · 2020-11-30T12:55:21Z

All committers have signed the CLA.

sarahmish

Looks great @ChengFR! I have some minor comments, if you would kindly address them.

I also want to leave out the python notebooks that are on the other branch (1, 2, 3, and 4) from this PR, and keep it dedicated to benchmarking (benchmarking and tasks can be left for illustration usage purposes).

cardea/benchmark/benchmark.py

sarahmish · 2020-11-30T19:28:55Z

cardea/benchmark/benchmark.py

+                raise TypeError("Unsupported file type {}.".format(extension))
+        pipeline.set_hyperparameters(init_hyperparameters)
+
+    # Load Dataset.


Load/ Construct Dataset. In the future, the "data_loader" and "problem_definition" will construct the feature matrix. I am thinking we can make feature_matrix a more generic name like, dataset. which can be used for all stages to generate the feature matrix.

Dataset can be:

feature matrix (pandas.DataFrame).

entityset (Featuretools.EntitySet).

raw data (list of pandas.DataFrame).

This is different than the consensus we reached in the design document, but is more generic.. Otherwise always assume I am reading my data from a path_to_dataset from the task itself.

We can proceed without doing any changes, but I am just marking them here in case we refer back to this idea.

The feature_matrix in the inputs of evaluate_task is a reserved parameter. When we run multiple pipelines on the same dataset from an entityset or raw data, we don't want to run the featurization multiple times. In this case, the upper-level function benchmark will do the featurization and pass the feature_matrix to evaluate_task.

So I don't think it is necessary to make it more general.

cardea/benchmark/benchmark.py

sarahmish · 2020-11-30T19:47:20Z

cardea/benchmark/benchmark_cli.py

+    # Add customized primitives from a local source.
+    add_primitives_path(MLBLOCKS_PRIMITIVES)


Is this necessary? I tried running the pipeline directly and it was fine.

Our definition of MLBLOCKS_PRIMITIVES is in the entry point (view setup.py), so it should be working.

It seems that in my testing environment, without this sentence, the customized primitives cannot be found. Can you double-check this problem?

After investigating this, the entry_point mentioned is only possible in the newer version of MLBlocks. We can keep this implementation for now, but mark it to be removed after the upgrade.

sarahmish · 2020-11-30T20:10:02Z

cardea/benchmark/task.py

+PIPELINE_DIR = './cardea/pipelines'
+VERIFIED_DIR = './benchmark/verified'


I would use another approach (same as the reason earlier).

from import dirname PIPELINE_DIR = os.path.join(os.path.dirname(__file__), 'pipelines') VERIFIED_DIR = os.path.join(os.path.dirname(os.path.dirname(__file__)), 'benchmark', 'verified')

Current paths in task configurations are all relative paths from the root. Since we will share the benchmarking results, this design ensures that the tasks are runnable on others' devices. Are you suggesting we use absolute paths instead (which is more readable but not runnable on different devices)?

I see your intention but the problem is this is difficult to trace. My suggestion would be to have the absolute path and the users should be following the same configuration of Cardea.

I imagine that PIPELINE_DIR will be defined in the same manner (in all cases) but the VERIFIED_DIR will change depending on the user. Is this the case you are referring to by

not runnable on different devices?

If that's what you mean, then I think that is not an issue, it can always be overriden by the user to specify their own path. But generally speaking, absolute paths are more maintainable and traceable as well.

This reverts commit 08a2ed4.

…nchmark

sarahmish

Looks good @ChengFR!

sarahmish and others added 30 commits October 15, 2020 22:36

split into multiple notebooks

08a2ed4

add benchmarking functions

b6c25b2

add benchmarking cases

b46563d

update the benchmarking cases

14a91b3

reformat benchmark file

750af73

reformat benchmark file

517a96f

reformat benchmark file

b58cfb9

update benchmark apis

e34b84d

initial primitives setup

cf2e51f

added an example

f1fe839

Merge remote-tracking branch 'origin/primitives' into benchmark

4094586

add customized preprocessing transformers, including imputer, one-hot…

5428a6b

…-encoder, and categorizer

add function execute_pipeline_from_pipeline which runs the pipelines …

0c49e40

…with MLPipeline objects as inputs

add two example end-to-end pipelines

426ec0c

add an example of running modeler from pipelines

40b02aa

add new primitive: pruner

2ae5d3d

add 8 end-to-end pipelines

726dbd5

add apis: benchmark_from_tasks(...), evaluate_task(...), that allow u…

0e75171

…sers to benchmark cardea from task/tasks

remove redundant primitives

0baa911

update modeler by adding k-fold validation function with a pipeline i…

7a43e01

…nstance as input

implement model and hp saving operations in benchmarking functions

6948476

remove out-of-date customized primitives and piptlines

61652ef

order the imports

04db966

update annotations

a70b507

add an example to do benchmarking from tasks

c407c87

update modeler to support feature inputs in type of np.array

8d0f98d

update primitives to support both pd.DataFrame and np.array inputs

8ee83b8

update benchmark file structure

469a9fa

add missing primitives into the cardea source

cfa6fbe

replace absolute paths in the task configs to relative paths from the…

620d44f

… project root

ChengFR added 4 commits November 29, 2020 20:45

update benchmark functions to store trained models with test dataset …

cf69a1c

…index

add dependence to pyCLI; add command line app for benchmarking.

9e7d0c1

sort imports

57ba42a

update Imputer primitive name and primitive info

2b3b240

ChengFR requested a review from sarahmish November 30, 2020 12:55

sarahmish requested changes Nov 30, 2020

View reviewed changes

ChengFR added 9 commits December 1, 2020 16:23

resolve path assignment issues

0e2ae49

resolve default metric assignment issues

7290239

formalize the function and class descriptions

950b69c

Revert "split into multiple notebooks"

748b3d0

This reverts commit 08a2ed4.

update primitive file structure

6f84882

Merge branch 'master' into benchmark

7156017

remove out-of-date benchmarking notebooks

10c3e03

remove out-of-date notebooks

23658f4

Merge branch 'benchmark' of https://github.com/DAI-Lab/Cardea into be…

4155340

…nchmark

sarahmish approved these changes Dec 3, 2020

View reviewed changes

ChengFR merged commit 38987e9 into master Dec 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmark testing apis #66

Add benchmark testing apis #66

ChengFR commented Nov 30, 2020

CLAassistant commented Nov 30, 2020 •

edited

Loading

sarahmish left a comment

sarahmish Nov 30, 2020

ChengFR Dec 1, 2020

sarahmish Nov 30, 2020 •

edited

Loading

ChengFR Dec 1, 2020

sarahmish Dec 1, 2020

sarahmish Nov 30, 2020

ChengFR Nov 30, 2020

sarahmish Dec 1, 2020

sarahmish left a comment

		# Add customized primitives from a local source.
		add_primitives_path(MLBLOCKS_PRIMITIVES)

		PIPELINE_DIR = './cardea/pipelines'
		VERIFIED_DIR = './benchmark/verified'

Add benchmark testing apis #66

Add benchmark testing apis #66

Conversation

ChengFR commented Nov 30, 2020

CLAassistant commented Nov 30, 2020 • edited Loading

sarahmish left a comment

Choose a reason for hiding this comment

sarahmish Nov 30, 2020

Choose a reason for hiding this comment

ChengFR Dec 1, 2020

Choose a reason for hiding this comment

sarahmish Nov 30, 2020 • edited Loading

Choose a reason for hiding this comment

ChengFR Dec 1, 2020

Choose a reason for hiding this comment

sarahmish Dec 1, 2020

Choose a reason for hiding this comment

sarahmish Nov 30, 2020

Choose a reason for hiding this comment

ChengFR Nov 30, 2020

Choose a reason for hiding this comment

sarahmish Dec 1, 2020

Choose a reason for hiding this comment

sarahmish left a comment

Choose a reason for hiding this comment

CLAassistant commented Nov 30, 2020 •

edited

Loading

sarahmish Nov 30, 2020 •

edited

Loading