[Unified TorchTrainer] Add HF Transformers TorchTrainer Utilities #38083

woshiyyya · 2023-08-03T18:57:38Z

Why are these changes needed?

Add HF Transformers related utilities for Unified TorchTrainer.

Rendered doc: https://anyscale-ray--38083.com.readthedocs.build/en/38083/

Add RayTrainReportCallback and prepare_trainer API for transformer integration
- Add Unit tests for these APIs
Add one basic example using transformers.trainer + Ray TorchTrainer
Update user guides for Transformers + TorchTrainer
- Getting start page
- API page
- Checkpoint loading and saving user guides
- Data Loading and Ingestion user guides

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

python/ray/train/huggingface/transformers/_transformers_utils.py

…/add_transformers_utilities

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

…/add_transformers_utilities

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

…/add_transformers_utilities

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

matthewdeng

This is awesome

python/ray/train/huggingface/transformers/_transformers_utils.py

doc/source/_toc.yml

python/ray/train/huggingface/__init__.py

doc/source/train/user-guides/checkpoints.rst

matthewdeng · 2023-08-11T19:00:07Z

doc/source/train/user-guides/checkpoints.rst

+        You should properly configure the `logging_strategy`, `save_strategy` 
+        and `evaluation_strategy`, so that at the checkpoint saving step, transformers 
+        trainer also reports the latest monitoring metrics (e.g. `eval_loss` in the above case).


I think it would be helpful to include these directly in the example itself, as it's currently not being shown.

In other words, the example should make it clear that our callback is complementary to Transformers logging/checkpointing.

Make sense. Like explicitly define the save_strategy and evaluation_strategy, and put these info as comments in the example?

doc/source/train/user-guides/checkpoints.rst

doc/source/train/user-guides/data-loading-preprocessing.rst

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com> Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com>

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

doc/source/train/user-guides/checkpoints.rst

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

…y-project#38083) Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com> Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com> Co-authored-by: matthewdeng <matthew.j.deng@gmail.com> Signed-off-by: NripeshN <nn2012@hw.ac.uk>

…y-project#38083) Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com> Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com> Co-authored-by: matthewdeng <matthew.j.deng@gmail.com> Signed-off-by: harborn <gangsheng.wu@intel.com>

…y-project#38083) Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com> Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com> Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>

…y-project#38083) Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com> Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com> Co-authored-by: matthewdeng <matthew.j.deng@gmail.com> Signed-off-by: e428265 <arvind.chandramouli@lmco.com>

…y-project#38083) Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com> Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com> Co-authored-by: matthewdeng <matthew.j.deng@gmail.com> Signed-off-by: Victor <vctr.y.m@example.com>

woshiyyya and others added 3 commits August 3, 2023 11:56

init

e8ea452

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

Merge branch 'master' into train/unified-api/add_transformers_utilities

3e6eabd

update utilities

57e458b

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

woshiyyya marked this pull request as ready for review August 9, 2023 01:28

woshiyyya added 2 commits August 8, 2023 18:36

polish code

690bdf7

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

polish code

6ed560b

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

matthewdeng reviewed Aug 9, 2023

View reviewed changes

python/ray/train/huggingface/transformers/_transformers_utils.py Outdated Show resolved Hide resolved

python/ray/train/huggingface/transformers/_transformers_utils.py Outdated Show resolved Hide resolved

python/ray/train/huggingface/transformers/_transformers_utils.py Show resolved Hide resolved

woshiyyya added 2 commits August 10, 2023 10:02

Merge remote-tracking branch 'upstream/master' into train/unified-api…

9730923

…/add_transformers_utilities

finish

d562460

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

woshiyyya force-pushed the train/unified-api/add_transformers_utilities branch from 7957cf1 to d562460 Compare August 10, 2023 17:57

woshiyyya added 4 commits August 10, 2023 14:58

add UT

4d4ecf6

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

fix lint

fa41ab1

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

add unittests

8cc2c9c

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

fix tests

07b9953

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

woshiyyya assigned matthewdeng Aug 11, 2023

add basic example

7f32f04

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

woshiyyya requested review from richardliaw, krfricke, xwjiang2010, amogkam, Yard1, maxpumperla and a team as code owners August 11, 2023 02:08

woshiyyya and others added 7 commits August 10, 2023 19:10

Merge branch 'master' into train/unified-api/add_transformers_utilities

3655a58

fix doc

1fd6695

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

fix import path

4c6ccb5

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

Merge remote-tracking branch 'upstream/master' into train/unified-api…

29d44a8

…/add_transformers_utilities

wip

fcac881

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

update getting start page

93f12c5

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

add to toc tree

19ee0a1

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

woshiyyya added 4 commits August 11, 2023 01:16

add report and data integration

8fdf7c6

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

fix doc

a5e5cfe

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

Merge remote-tracking branch 'upstream/master' into train/unified-api…

8602ff2

…/add_transformers_utilities

update report callback

8ad6e11

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

woshiyyya requested a review from matthewdeng August 11, 2023 18:49

matthewdeng reviewed Aug 11, 2023

View reviewed changes

woshiyyya and others added 4 commits August 11, 2023 12:04

Apply suggestions from code review

1722273

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com> Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com>

address comments

ba8bdd8

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

polish the basic example

4f496a3

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

address comments

f35a499

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

woshiyyya requested a review from matthewdeng August 11, 2023 20:44

woshiyyya added 2 commits August 11, 2023 13:46

minor fix

7427d98

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

fix Title underline too short

dcaca15

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

matthewdeng approved these changes Aug 11, 2023

View reviewed changes

doc/source/train/user-guides/checkpoints.rst Outdated Show resolved Hide resolved

woshiyyya added 2 commits August 11, 2023 16:20

fix lazy import and update checkpoint user guides

18cbca8

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

update emphasize lines

88c7acf

Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>

matthewdeng merged commit 3d69d15 into ray-project:master Aug 12, 2023
108 of 117 checks passed

This was referenced Aug 21, 2023

[train] Implement TorchTrainer subclass simplifications #38295

Closed

[Train][Ray 2.7] Revamp Ray Train examples with new APIs #38681

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Unified TorchTrainer] Add HF Transformers TorchTrainer Utilities #38083

[Unified TorchTrainer] Add HF Transformers TorchTrainer Utilities #38083

woshiyyya commented Aug 3, 2023 •

edited

Loading

matthewdeng left a comment

matthewdeng Aug 11, 2023

woshiyyya Aug 11, 2023

[Unified TorchTrainer] Add HF Transformers TorchTrainer Utilities #38083

[Unified TorchTrainer] Add HF Transformers TorchTrainer Utilities #38083

Conversation

woshiyyya commented Aug 3, 2023 • edited Loading

Why are these changes needed?

Related issue number

Checks

matthewdeng left a comment

Choose a reason for hiding this comment

matthewdeng Aug 11, 2023

Choose a reason for hiding this comment

woshiyyya Aug 11, 2023

Choose a reason for hiding this comment

woshiyyya commented Aug 3, 2023 •

edited

Loading