Investigate overriding default MemoryDataSet #8

marrrcin · 2022-03-18T15:05:29Z

Right now the plug-in requires all intermediate datasets to be stored in GCS, which forces the pipeline authors to remember that all intermediate datasets need to be explicitly defined in the data catalog.
Consider overriding the create_default_data_set in the ThreadRunner https://github.com/kedro-org/kedro/blob/74fbd752b6bf77c409f016ed73e60a4ea38d6a95/kedro/runner/thread_runner.py#L51 or find other extension point to make the use of intermediate datasets transparent to the users (we could store them in GCS, as we already have paths to bucket and unique run id in the plugin).

The text was updated successfully, but these errors were encountered:

em-pe · 2022-03-22T12:29:40Z

We need to clarify the details and make it environment based as this may not be an optimal setup for a local runs, just the pipelines one.

* #8 Auto-dataset via custom runner

em-pe added enhancement New feature or request nice to have It would be nice to have but it's not urgent. labels Mar 24, 2022

marrrcin added a commit that referenced this issue Aug 1, 2022

#8 Auto-dataset via custom runner - WIP

3f18d3c

marrrcin mentioned this issue Aug 1, 2022

#8 Auto-dataset via custom runner #63

Merged

2 tasks

marrrcin added a commit that referenced this issue Aug 2, 2022

#8 Auto-dataset via custom runner - tested and polished

05b7706

marrrcin added a commit that referenced this issue Aug 2, 2022

#8 Auto-dataset via custom runner - cleaned up e2e config

b3d9e28

marrrcin added a commit that referenced this issue Aug 2, 2022

#8 Auto-dataset via custom runner - mocks

2fc7697

marrrcin added a commit that referenced this issue Aug 3, 2022

#8 Auto-dataset via custom runner - parameter rename

9c95b93

marrrcin closed this as completed in #63 Aug 3, 2022

marrrcin added a commit that referenced this issue Aug 3, 2022

#8 Auto-dataset via custom runner (#63)

2b7bdd0

* #8 Auto-dataset via custom runner

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate overriding default MemoryDataSet #8

Investigate overriding default MemoryDataSet #8

marrrcin commented Mar 18, 2022 •

edited

em-pe commented Mar 22, 2022

Investigate overriding default MemoryDataSet #8

Investigate overriding default MemoryDataSet #8

Comments

marrrcin commented Mar 18, 2022 • edited

em-pe commented Mar 22, 2022

marrrcin commented Mar 18, 2022 •

edited