Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Don't reset the dataset type + plug for rm unused columns #6683

Merged
merged 3 commits into from
Aug 24, 2020

Conversation

sgugger
Copy link
Collaborator

@sgugger sgugger commented Aug 24, 2020

This PR avoids resetting the dataset type when removing columns, and also introduces a filed in TrainingArguments to disable that behavior (in case the use wants to use some of those fields in an elaborate data collator).

@codecov
Copy link

codecov bot commented Aug 24, 2020

Codecov Report

Merging #6683 into master will decrease coverage by 0.04%.
The diff coverage is 14.28%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master    #6683      +/-   ##
==========================================
- Coverage   79.65%   79.60%   -0.05%     
==========================================
  Files         156      156              
  Lines       28250    28256       +6     
==========================================
- Hits        22503    22494       -9     
- Misses       5747     5762      +15     
Impacted Files Coverage Δ
src/transformers/trainer.py 55.63% <0.00%> (-0.53%) ⬇️
src/transformers/training_args.py 91.34% <100.00%> (+0.08%) ⬆️
src/transformers/generation_tf_utils.py 84.21% <0.00%> (-2.51%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1a779ad...eea7e8d. Read the comment docs.

src/transformers/trainer.py Outdated Show resolved Hide resolved
@sgugger sgugger merged commit b30879f into master Aug 24, 2020
@sgugger sgugger deleted the remove_columns_fix_and_plug branch August 24, 2020 13:22
Zigur pushed a commit to Zigur/transformers that referenced this pull request Oct 26, 2020
…e#6683)

* Don't reset the type of the dataset

* Formatting

* Update trainer.py

Co-authored-by: Teven <teven.lescao@gmail.com>
fabiocapsouza pushed a commit to fabiocapsouza/transformers that referenced this pull request Nov 15, 2020
…e#6683)

* Don't reset the type of the dataset

* Formatting

* Update trainer.py

Co-authored-by: Teven <teven.lescao@gmail.com>
fabiocapsouza added a commit to fabiocapsouza/transformers that referenced this pull request Nov 15, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants