seq2seq trainer test refactor. #66

devrimcavusoglu · 2022-10-04T12:59:58Z

After jury version 2.2.2 (migrating package from datasets (to be deprecated) to evaluate), it turns out there's a little discrepancy between these two HF packages. Formerly, with datasets the arrow table schema was somehow bypassed with dtypes that are not conforming the table schema, this got unnoticed on the test cases on trapper. Currently (after switching to evaluate), the test fails as the inputs for metrics does not conform the table schema (int passed instead of string), and hence the test fails.

This PR addresses the issue above by adding an InputHandler for language generation tasks metrics that require string inputs.

Sophylax

LGTM!

devrimcavusoglu added 2 commits October 4, 2022 15:44

Refactor on test_seq2seq_trainer.py (input handler refactored).

56ebfeb

Code formatting.

d2e04dc

devrimcavusoglu added bug Something isn't working do not merge Don't merge the PR, yet. refactor labels Oct 4, 2022

devrimcavusoglu requested review from Sophylax, SecilOzkSen and cemilcengiz October 4, 2022 12:59

devrimcavusoglu marked this pull request as ready for review October 5, 2022 09:43

devrimcavusoglu removed the do not merge Don't merge the PR, yet. label Oct 5, 2022

Docstring refactored, comment added.

8b0d1ba

Sophylax approved these changes Oct 13, 2022

View reviewed changes

devrimcavusoglu merged commit 15b1f5e into main Oct 13, 2022

devrimcavusoglu deleted the eval-prediction-refactor branch October 13, 2022 13:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

seq2seq trainer test refactor. #66

seq2seq trainer test refactor. #66

devrimcavusoglu commented Oct 4, 2022

Sophylax left a comment

seq2seq trainer test refactor. #66

seq2seq trainer test refactor. #66

Conversation

devrimcavusoglu commented Oct 4, 2022

Sophylax left a comment

Choose a reason for hiding this comment