Huggingface integration #137

pacifikus · 2021-08-13T10:15:31Z

Closes #83

Save model at the end of each iteration Co-authored-by: David de la Iglesia Castro <daviddelaiglesiacastro@gmail.com>

daavoo

Thanks for the contribution @pacifikus ! It looks great.

Could you:

Open a P.R. in https://github.com/iterative/dvc.org adding HuggingFace to the list of https://dvc.org/doc/dvclive/user-guide/ml-frameworks . You should be able to use some of the other pages as reference

daavoo · 2021-08-13T15:48:09Z

dvclive/huggingface.py

+        super().__init__()
+        self.model_file = model_file
+
+    def on_evaluate(


Is there a reason why the Callbacks that are part of HuggingFace use on_log instead of on_evaluate:

https://huggingface.co/transformers/_modules/transformers/integrations.html#CometCallback

This callback does look simpler than those other implementations, but I'm just curious about that

Thank you for your comment!

PR in dvc.org: iterative/dvc.org#2718

Yes, there is a reason why the Callbacks that are part of HuggingFace use on_log instead of on_evaluate. As you can see in https://huggingface.co/transformers/main_classes/callback.html#trainercallback, metrics are only available in the on_evaluate event. Thus, we can get the loss values both at the train and eval from on_log, but eval metrics only from on_evaluate.

The on_evaluate event has its disadvantage: we cannot get the training loss during this event, only the evaluation loss.

Thanks for the explanation. Would it make sense to extend DvcLiveCallback to use both on_log (for train and eval losses) and on_evaluate (for eval metrics) ?

Yes, I think it's a good idea. I did it

I think that the huggingface docs might be a little misleading. I just debug the test locally and it seems that the same values available as metrics inside on_evaluate are previously being passed as logs to on_log.

Yes, you're right, it's strange, but metrics logs are also written in on_log event. Thank you for your comment!

Hmm, now for different operating systems, different len(first(logs.values())) values in the test are obtained

Well that is strange, I will debug locally the macOS test case which is the one failing

Still don't understand what the bug was. If I have to guess, it might be related with first behaving differently in MacOS. Anyway I made the test more explicit and it now passes 🎉

codecov-commenter · 2021-08-14T18:02:33Z

Codecov Report

Merging #137 (c236003) into master (01c0513) will increase coverage by 0.43%.
The diff coverage is 100.00%.

❗ Current head c236003 differs from pull request most recent head 9b48440. Consider uploading reports for the commit 9b48440 to get more accurate results

@@            Coverage Diff             @@
##           master     #137      +/-   ##
==========================================
+ Coverage   90.00%   90.43%   +0.43%     
==========================================
  Files          12       13       +1     
  Lines         310      324      +14     
==========================================
+ Hits          279      293      +14     
  Misses         31       31

Impacted Files	Coverage Δ
dvclive/huggingface.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 01c0513...9b48440. Read the comment docs.

dvclive/huggingface.py

Co-authored-by: David de la Iglesia Castro <daviddelaiglesiacastro@gmail.com>

pacifikus and others added 6 commits August 8, 2021 04:37

LGBM integration added

f56ca33

Tests for lgbm integration

ccae1aa

test_lgbm: added tmp_dir fixture

9542949

Update dvclive/lgbm.py

71092cf

Save model at the end of each iteration Co-authored-by: David de la Iglesia Castro <daviddelaiglesiacastro@gmail.com>

Merge branch 'iterative:master' into dev

02f31f6

Huggingface integration

295c6b7

daavoo changed the title ~~Dev~~ Huggingface integration Aug 13, 2021

daavoo self-assigned this Aug 13, 2021

daavoo suggested changes Aug 13, 2021

View reviewed changes

daavoo reviewed Aug 13, 2021

View reviewed changes

daavoo self-requested a review August 14, 2021 11:32

daavoo approved these changes Aug 14, 2021

View reviewed changes

Add on_log event

92d8a8e

daavoo reviewed Aug 16, 2021

View reviewed changes

dvclive/huggingface.py Outdated Show resolved Hide resolved

pacifikus and others added 5 commits August 17, 2021 00:23

Update dvclive/huggingface.py

62dabc9

Co-authored-by: David de la Iglesia Castro <daviddelaiglesiacastro@gmail.com>

fix: huggingface test after callback changes

5895ee1

Merge remote-tracking branch 'origin/dev' into dev

e05e348

revert last commit

baaa794

fix: huggingface test after calback changes

438fe54

jorgeorpinel mentioned this pull request Aug 18, 2021

dvclive - HuggingFace integration iterative/dvc.org#2718

Merged

daavoo added 2 commits August 18, 2021 17:20

Updated test_huggingface

94210a0

Updated test_huggingface

9b48440

daavoo merged commit 7023203 into iterative:master Aug 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Huggingface integration #137

Huggingface integration #137

Uh oh!

pacifikus commented Aug 13, 2021

Uh oh!

daavoo left a comment •

edited

Loading

Uh oh!

daavoo Aug 13, 2021 •

edited

Loading

Uh oh!

pacifikus Aug 14, 2021

Uh oh!

daavoo Aug 14, 2021 •

edited

Loading

Uh oh!

pacifikus Aug 14, 2021

Uh oh!

daavoo Aug 16, 2021

Uh oh!

pacifikus Aug 16, 2021

Uh oh!

pacifikus Aug 16, 2021

Uh oh!

daavoo Aug 17, 2021

Uh oh!

daavoo Aug 18, 2021 •

edited

Loading

Uh oh!

codecov-commenter commented Aug 14, 2021 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Huggingface integration #137

Huggingface integration #137

Uh oh!

Conversation

pacifikus commented Aug 13, 2021

Uh oh!

daavoo left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

daavoo Aug 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pacifikus Aug 14, 2021

Choose a reason for hiding this comment

Uh oh!

daavoo Aug 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pacifikus Aug 14, 2021

Choose a reason for hiding this comment

Uh oh!

daavoo Aug 16, 2021

Choose a reason for hiding this comment

Uh oh!

pacifikus Aug 16, 2021

Choose a reason for hiding this comment

Uh oh!

pacifikus Aug 16, 2021

Choose a reason for hiding this comment

Uh oh!

daavoo Aug 17, 2021

Choose a reason for hiding this comment

Uh oh!

daavoo Aug 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Aug 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

daavoo left a comment •

edited

Loading

daavoo Aug 13, 2021 •

edited

Loading

daavoo Aug 14, 2021 •

edited

Loading

daavoo Aug 18, 2021 •

edited

Loading

codecov-commenter commented Aug 14, 2021 •

edited

Loading