Add memory and storage awareness #289

jpsamaroo · 2021-10-15T14:18:25Z

This PR adds awareness of memory and storage (disk, etc.) to Dagger, adding a new "storage subsystem", similar to the existing "processor subsystem". The intention is that by modeling storage resources explicitly - specifically detecting their real-time capacities and free space, and providing methods to move data to-and-from storage - we can teach the scheduler to swap data to disk when memory is full, or any other kind of capacity-protecting movement or scheduling.

We will additionally begin tracking GC allocations at runtime, and use estimates of such allocations to limit scheduling when the scheduler knows that memory would otherwise become exhausted. This should make it easier to execute code over "big data", even when such data is too large for a single worker, or even all workers, to keep in memory at one time. This model should also be extensible to GPUs (which have their own memory space), so that GPU OOMs can be avoided.

Todo:

krynju · 2021-10-16T11:04:49Z

Hey, two general ideas:

caching to disk toggle - and I guess it should be off by default
explicitly caching chunks/suggesting a cache? - in DTable a lot of chunks can be marked as "free to cache", because it's known they are not needed in the next stages of processing

jpsamaroo · 2022-01-26T22:44:35Z

I ended up deciding to implement this logic in MemPool, since it's the most reasonable place to do this, and it has the greatest control over memory management: JuliaData/MemPool.jl#60. With that PR posted and basically ready to go, I'm slightly changing what we'll be implementing in Dagger:

(Optional) Provide allocator thunk option to indicate which MemPool StorageDevice to use (defaults to the global device)
Detect storage resource capacity for all allocator sub-devices upon first use
Implement a user-programmable interface for detecting thunk temporary memory allocations at runtime, and store this estimate per-signature
Query the allocator before moving data or executing thunks to ensure that the ensuing allocations won't exceed the memory allocation limit (needs an API in MemPool, plus local tracking of thunks actively fetching/executing); thunks will be paused until there is space available for their data and estimated local allocations
(Optional) Estimate storage device utilization via returned thunk metadata
(Optional) Add memory wait costs to estimate_task_costs
(Optional) Re-enable capacity monitoring based on memory availability (need to implement a threshold for how many over-capacity thunks can be scheduled per-worker before pausing scheduling)

The non-optional items in this list are the basics necessary to let Dagger handle "big data" problems; the MemPool PR also gives us swap-to-disk automatically, so we don't need to worry about that for now. The optional items are useful for improving scheduling decisions, which are helpful, but not strictly necessary (and will be partially obviated by future work-stealing).

~~Once I have a working alternative, I'll likely close this PR.~~ PR updated!

jpsamaroo · 2022-02-03T19:32:55Z

@krynju

caching to disk toggle - and I guess it should be off by default

By default we'll follow whatever MemPool.GLOBAL_DEVICE is set to, which defaults to memory-only.

explicitly caching chunks/suggesting a cache? - in DTable a lot of chunks can be marked as "free to cache", because it's known they are not needed in the next stages of processing

This would be a decision for the MemPool allocator to make. I think I'd like to see how far we can get with basic allocation strategies (maybe MRU or similar), before we consider passing such information directly to the allocator. I'd prefer not to end up with an API like Linux's madvise, so I'll take some time to think on it.

Track worker storage resources and devices Track thunk return value allocations Expand procutil option to time_util and alloc_util Add storage option for specifying MemPool storage device Format bytes in debug logs Add locking around CHUNK_CACHE Move return value Chunks to MemPool device Chunk: Update tochunk docstring Walk data to determine serialization safety Drop Julia 1.6 support

Split suites out into individual files Provide usage info when run without BENCHMARK env var Add option to save logs to output file Add DTable CSV/Arrow reading suite

jpsamaroo added enhancement needs tests scheduler performance data movement needs docs storage labels Oct 15, 2021

jpsamaroo mentioned this pull request Oct 20, 2021

Is affinity() supposed to return a Pair{OSProc, UInt64} or a Vector{Pair{OSProc, UInt64}}? #295

Closed

jpsamaroo mentioned this pull request Jan 2, 2022

Add automatic swap-to-disk support JuliaData/MemPool.jl#60

Merged

23 tasks

jpsamaroo mentioned this pull request Feb 1, 2022

Add with_options for propagating options #328

Merged

6 tasks

jpsamaroo force-pushed the jps/storage branch from 47c7e2a to e510748 Compare February 3, 2022 18:26

jpsamaroo force-pushed the jps/storage branch from e510748 to b9378e0 Compare February 6, 2022 20:48

jpsamaroo force-pushed the jps/storage branch 2 times, most recently from b9e22d1 to 19c8787 Compare March 7, 2022 18:16

jpsamaroo mentioned this pull request Apr 8, 2022

Implement user-defined error handler mechanism #339

Open

jpsamaroo force-pushed the jps/storage branch 2 times, most recently from a424693 to d33746b Compare July 4, 2022 20:44

jpsamaroo force-pushed the jps/storage branch from d33746b to e5058d9 Compare July 23, 2022 18:00

jpsamaroo added 9 commits July 23, 2022 13:01

processors: re-org and rename

1aaabd9

processors: Return Sets from queries

aff40bf

thunk: Disallow serialization

883d970

Sch: Debug processor rejection

dae6365

tochunk: Pass kwargs to poolset

cbffe52

Context: Always include worker 1

c4ae8b1

options: Add dispatch-based options

830cc9d

benchmarks: Split out suites

4a80afb

Split suites out into individual files Provide usage info when run without BENCHMARK env var Add option to save logs to output file Add DTable CSV/Arrow reading suite

jpsamaroo added 3 commits July 23, 2022 13:04

CI: Use Julia 1.7 for docs

dc53146

CI: Use Julia 1.7 for appveyor

e6e63e2

test: Disable fault tolerance testing

e6be535

jpsamaroo force-pushed the jps/storage branch from e5058d9 to e6be535 Compare July 23, 2022 18:10

CI: Use Julia 1.7 for macOS tests

239f8f4

jpsamaroo marked this pull request as ready for review July 23, 2022 18:23

jpsamaroo merged commit 17e5b2e into master Jul 23, 2022

jpsamaroo deleted the jps/storage branch July 23, 2022 18:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add memory and storage awareness #289

Add memory and storage awareness #289

jpsamaroo commented Oct 15, 2021 •

edited

krynju commented Oct 16, 2021 •

edited

jpsamaroo commented Jan 26, 2022 •

edited

jpsamaroo commented Feb 3, 2022

Add memory and storage awareness #289

Add memory and storage awareness #289

Conversation

jpsamaroo commented Oct 15, 2021 • edited

krynju commented Oct 16, 2021 • edited

jpsamaroo commented Jan 26, 2022 • edited

jpsamaroo commented Feb 3, 2022

jpsamaroo commented Oct 15, 2021 •

edited

krynju commented Oct 16, 2021 •

edited

jpsamaroo commented Jan 26, 2022 •

edited