-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[air] pyarrow.fs
persistence (10/n): Unify Tune and Train sessions to support new persistence path in FunctionTrainable
#38284
Merged
ericl
merged 34 commits into
ray-project:master
from
justinvyu:air/persistence/unify_sessions
Aug 12, 2023
Merged
Changes from 26 commits
Commits
Show all changes
34 commits
Select commit
Hold shift + click to select a range
1d8f49b
Move _TrainingResult to session.py
justinvyu c88df86
Prototype unified session
justinvyu 7bca696
Merge branch 'master' of https://github.com/ray-project/ray into air/…
justinvyu 8871231
Fix incorrect merge
justinvyu d87100d
Implement reset with unified session (for actor reuse)
justinvyu 5455c2c
Eager mode in session (difference in tune/train behavior)
justinvyu e1bd135
Working for train again
justinvyu 60ea1ee
Remove unused ckpt index code
justinvyu 3d23988
Fix lint
justinvyu 4d209e7
Merge branch 'master' of https://github.com/ray-project/ray into air/…
justinvyu 67a0721
Add dict checkpoint utils for tests
justinvyu 2a35b3e
Add env var as a constant
justinvyu 7915999
Remove prints
justinvyu 74a2f19
Fix tune.run sync config = None issue
justinvyu dfff147
Merge branch 'master' of https://github.com/ray-project/ray into air/…
justinvyu 35aaf2c
Rename eager_mode -> synchronous_result_reporting
justinvyu d08e6e4
Improve some comments + some cleanups
justinvyu c2e33cc
Some more cleanups (remove unused code)
justinvyu 9b475ec
Remove reference to global session
justinvyu bf32992
Remove shared storage context (store in global session instead)
justinvyu b59e0b6
Update tuner e2e test to test restoration / checkpointing
justinvyu 86b5520
Fix lint
justinvyu 6c3b4e5
More cleanups + docstrings
justinvyu ce9920b
synch result reporting logic is flipped...
justinvyu a97025f
Handle trainable outputs correctly
justinvyu b14bce5
Propagate storage on trial.reset (for restarting upon restore)
justinvyu 0792283
Merge branch 'master' of https://github.com/ray-project/ray into air/…
justinvyu 90480aa
Guard the tune session assertion
justinvyu e9ff63c
thread join timeout = 0 on cleanup + report any errors left in the queue
justinvyu c3246dd
Add back saving_to for now
justinvyu 7f68633
Convert path to str for env var
justinvyu 1498f05
TIL local variables override imports even if they're set conditionally
justinvyu 5b40250
Merge branch 'master' of https://github.com/ray-project/ray into air/…
justinvyu 5d72ca8
Clarify comment a bit
justinvyu File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Rllib's
LearnerGroup
usesBackendExecutor
, and they may not be inside a Tune session (??). But they never call this code and only use it to start and stop aWorkerGroup
. Maybe they should just use theWorkerGroup
abstraction directly 😅