-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Duplicate category value across features crash during training evaluation #935
Comments
Error has the first shape member set to the batch size. Changing the batch size of course changes the error details but presents the same issue. |
Both output features to text DOES solve the issue I do not have a dataset and time to test 2 category + 1 text, but it seems almost like the combination of 2 category output features is breaking something. As long as there is only one category output, it works fine |
Hey @carlogrisetti, is it possible that training and eval sets have a different number of distinct values for the categorical features? Looks like there is an off-by-one error between the output and target, suggesting different vocab sizes. |
Ok, found out how to repro, and can include full sample dataset. Here it is: Once you get different category names across different columns, it all works (just tested on the original dataset where I found it, i had the same category name for 2 output features. Once one of them was renamed to be different, it all worked fine) |
I figured out what the issue is, working on a fix. Very good catch, thank you, this only show up when you have two output features of the same type. |
Should be solved, please confirm if that's true also for your real usecase. |
Confirmed, thanks! |
Training a model with multiple output features results in a valueerror in the evaluation phase of the first epoch.
It's not related to the tied category value, even two "simple" categorical outputs gave the error.
Switching to a single categorical value (one or the other) solves the issue and allows for the training to continue.
multiple_output.zip
I don't know if it's only related to this specific combination or it happens with multiple outputs of any type. Will try to investigate further...
The text was updated successfully, but these errors were encountered: