Run pipelines on datasets #10

jacob-rosenthal · 2020-10-08T18:07:11Z

Implement a method to run a preprocessing pipeline on a dataset.
Should basically be a convenience function for running pipeline on each individual image.

Pseudocode:

mydataset = pathml.datasets.PESO.download()
mypipeline = pathml.preprocessing.default_pipeline()

mypipeline.run(mydataset)
### should be equivalent to:
for wsi in mydataset:
    mypipeline.run(wsi)

The text was updated successfully, but these errors were encountered:

jacob-rosenthal · 2020-10-08T19:01:48Z

This should ideally be parallelizable

jacob-rosenthal · 2020-11-06T22:29:55Z

Should probably use something like this: https://joblib.readthedocs.io/en/latest/parallel.html#

jacob-rosenthal added the enhancement New feature or request label Oct 21, 2020

jacob-rosenthal mentioned this issue Nov 17, 2020

Multiprocessing #38

Merged

jacob-rosenthal closed this as completed Jan 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run pipelines on datasets #10

Run pipelines on datasets #10

jacob-rosenthal commented Oct 8, 2020

jacob-rosenthal commented Oct 8, 2020

jacob-rosenthal commented Nov 6, 2020

Run pipelines on datasets #10

Run pipelines on datasets #10

Comments

jacob-rosenthal commented Oct 8, 2020

jacob-rosenthal commented Oct 8, 2020

jacob-rosenthal commented Nov 6, 2020