Support for lazily created streams #671

lrzpellegrini · 2021-06-17T20:03:59Z

This PR introduces the support for lazy streams (discussed in #600)

This PR introduces some changes on the internals of GenericCLScenario and adds proper helpers to support the creation of streams based on generator expressions or functions.

This also introduces the support for dropping the references to previous experiences to allow for a more fine-grained memory management (to be documented).

Beware that the helper used to create data incremental benchmarks and the helper used to create a validation stream will still work in non-lazy mode (will load all experiences at once). This behavior will be fixed in the future.

The behavior for non-lazy benchmarks will not be affected by these changes.

coveralls · 2021-06-17T20:19:02Z

Pull Request Test Coverage Report for Build 947677077

391 of 417 (93.76%) changed or added relevant lines in 7 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.6%) to 78.99%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
avalanche/benchmarks/scenarios/generic_cl_scenario.py	53	59	89.83%
avalanche/benchmarks/scenarios/generic_benchmark_creation.py	25	33	75.76%
avalanche/benchmarks/scenarios/lazy_dataset_sequence.py	50	62	80.65%

Totals
Change from base Build 942071966:	0.6%
Covered Lines:	9429
Relevant Lines:	11937

💛 - Coveralls

vlomonaco · 2021-06-18T10:05:32Z

Hi @lrzpellegrini this is awesome, thanks! We are looking forward to see this integrated in the benchmark generators (and manipulators). Do you thing this could be done with a simple flag? Also, what's the disadvantage of making the whole system lazy?

lrzpellegrini · 2021-06-18T10:40:14Z

For the moment there are no disadvantages in supporting lazy streams. However, for the vast majority of helpers adding a lazy option won't help in reducing times. The helpers that can benefit the most from a lazy approach are the ones that load the whole dataset in memory, but we currently don't have any of those. In the future, we may want to implement a path-based tensor dataset, in which case the laziness may help a lot.

lrzpellegrini added 5 commits June 11, 2021 17:38

Merge remote-tracking branch 'upstream/master'

d1d6501

Merge remote-tracking branch 'upstream/master'

169b715

Minimal support for lazy streams (issue ContinualAI#600)

bae7c7c

Fixed PEP8 issue

658b2f6

Adapted documentation. Added unit test. Removed stale code.

792e7fc

lrzpellegrini requested review from vlomonaco and AntonioCarta June 17, 2021 20:03

lrzpellegrini mentioned this pull request Jun 18, 2021

add CTrL integration #561

Merged

lrzpellegrini merged commit 176f31a into ContinualAI:master Jun 18, 2021

lrzpellegrini deleted the issue_600 branch June 18, 2021 13:45

This was referenced Jun 18, 2021

Lazy data incremental and validation stream helpers #673

Closed

Support for endless streams #680

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for lazily created streams #671

Support for lazily created streams #671

lrzpellegrini commented Jun 17, 2021 •

edited

Loading

coveralls commented Jun 17, 2021

vlomonaco commented Jun 18, 2021

lrzpellegrini commented Jun 18, 2021

Support for lazily created streams #671

Support for lazily created streams #671

Conversation

lrzpellegrini commented Jun 17, 2021 • edited Loading

coveralls commented Jun 17, 2021

Pull Request Test Coverage Report for Build 947677077

💛 - Coveralls

vlomonaco commented Jun 18, 2021

lrzpellegrini commented Jun 18, 2021

lrzpellegrini commented Jun 17, 2021 •

edited

Loading