-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
hdf5 being rebuilt too often #1078
Comments
Just happened with the same exact data, config, etc. I will find the time to debug this |
The logic is the following: https://github.com/ludwig-ai/ludwig/blob/master/ludwig/data/preprocessing.py#L1798-L1811
If a feature parameter like dropout or a training parameter like batch size are changed, the preprocessing should not be triggered again (they are not used to compute the checksum, so checksum is identical and so the preprocessing should not be triggered, the relevant piece of code is: ludwig/ludwig/data/preprocessing.py Lines 1313 to 1334 in 576b74b
Let me know if you can figure out what triggers the spurious recreation of the cache. |
Here it is... You should be able to repro with a model definition like this one |
Ok... needed some sleep :)
ludwig/ludwig/data/preprocessing.py Line 1313 in 576b74b
vs
ludwig/ludwig/data/preprocessing.py Line 1591 in 576b74b
Since that The "merging" happens here: ludwig/ludwig/data/preprocessing.py Lines 1132 to 1161 in 576b74b
|
@w4nderlust I don't know how to manage this without breaking that last code... I'd say I'm passing this to you guys hehe :) |
Looks like a bug then, will look into it! |
Working on it #1114 |
Merged the PR, @carlogrisetti could you confirm this solves the issue in your specific use case? |
Never saw this message, sorry @w4nderlust . It does indeed fix the issue. I just had it resurface in a non-master updated install... and as soon as I updated to master it worked flawessly |
great to hear! |
ludwig from master (and since 0.3.2 at least), on different systems
when changing the config.yaml file, even just changing the dropout values (which nothing has to do with the hdf5 preprocessed file), Ludwig says that the checksum has changed, hence it has to rebuild the hdf5 file.
This happens also switching the batch size parameter, for example. I am suspecting it happened to me also without changing any parameter whatsoever.
I will look into that, but wanted to keep track of this, since I don't know how much time I will have to do this in the next few days, and maybe that's already a known issue (i have found no issues regarding this, anyway).
The text was updated successfully, but these errors were encountered: