-
Notifications
You must be signed in to change notification settings - Fork 5.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Ray 2.7 Examples][2/n] Revamp the GPT-J DeepSpeed Example #38600
[Ray 2.7 Examples][2/n] Revamp the GPT-J DeepSpeed Example #38600
Conversation
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
Remove all 'preprocessor' imports, now they are no longer used? |
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
@richardliaw Done! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have been meaning to copy edit this notebook. I hope you are able to incorporate them in this PR without too much friction.
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com> Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com>
@angelinalg Thank you so much for the polish! The whole example now feels much more natural and fluid!! |
Deprecates passing in preprocessor to Trainers. Also deprecates Chain and BatchMapper preprocessors. Removes all usage of preprocessors on non-tabular data from the docs and examples. Closes #38290 Depends on #38634 and #38600 --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com> Signed-off-by: Amog Kamsetty <amogkam@users.noreply.github.com> Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>
…ct#38600) Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com> Signed-off-by: e428265 <arvind.chandramouli@lmco.com>
Deprecates passing in preprocessor to Trainers. Also deprecates Chain and BatchMapper preprocessors. Removes all usage of preprocessors on non-tabular data from the docs and examples. Closes ray-project#38290 Depends on ray-project#38634 and ray-project#38600 --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com> Signed-off-by: Amog Kamsetty <amogkam@users.noreply.github.com> Co-authored-by: matthewdeng <matthew.j.deng@gmail.com> Signed-off-by: e428265 <arvind.chandramouli@lmco.com>
…ct#38600) Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com> Signed-off-by: Victor <vctr.y.m@example.com>
Deprecates passing in preprocessor to Trainers. Also deprecates Chain and BatchMapper preprocessors. Removes all usage of preprocessors on non-tabular data from the docs and examples. Closes ray-project#38290 Depends on ray-project#38634 and ray-project#38600 --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com> Signed-off-by: Amog Kamsetty <amogkam@users.noreply.github.com> Co-authored-by: matthewdeng <matthew.j.deng@gmail.com> Signed-off-by: Victor <vctr.y.m@example.com>
Why are these changes needed?
Revamp the GPT-J deepspeed finetuning example with new apis. The main changes are:
TransformersTrainer
withTorchTrainer
+ integration utilitiesPreprocessors
TransformersPredictor
Rendered doc: https://anyscale-ray--38600.com.readthedocs.build/en/38600/ray-air/examples/gptj_deepspeed_fine_tuning.html
Release test passed: https://buildkite.com/ray-project/release-tests-pr/builds/50013#018a1af2-c070-4482-a413-8632f20991ae
Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.