Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

seq2seq trainer test refactor. #66

Merged
merged 3 commits into from Oct 13, 2022
Merged

Conversation

devrimcavusoglu
Copy link
Member

After jury version 2.2.2 (migrating package from datasets (to be deprecated) to evaluate), it turns out there's a little discrepancy between these two HF packages. Formerly, with datasets the arrow table schema was somehow bypassed with dtypes that are not conforming the table schema, this got unnoticed on the test cases on trapper. Currently (after switching to evaluate), the test fails as the inputs for metrics does not conform the table schema (int passed instead of string), and hence the test fails.

This PR addresses the issue above by adding an InputHandler for language generation tasks metrics that require string inputs.

@devrimcavusoglu devrimcavusoglu added bug Something isn't working do not merge Don't merge the PR, yet. refactor labels Oct 4, 2022
@devrimcavusoglu devrimcavusoglu marked this pull request as ready for review October 5, 2022 09:43
@devrimcavusoglu devrimcavusoglu removed the do not merge Don't merge the PR, yet. label Oct 5, 2022
Copy link
Member

@Sophylax Sophylax left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@devrimcavusoglu devrimcavusoglu merged commit 15b1f5e into main Oct 13, 2022
@devrimcavusoglu devrimcavusoglu deleted the eval-prediction-refactor branch October 13, 2022 13:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working refactor
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants