-
Notifications
You must be signed in to change notification settings - Fork 170
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training on multiple csvs #1
Comments
Hi armheb, Yes concatenation is one possible solution. `class DataGenerator(tf.keras.utils.Sequence):
train_gen = DataGenerator(train_seq_paths, seq_per_file, seq_len, batch_size) Then when training the model the fit function looks as follows
I hope this helps. |
Thank you so much for the great explanation, I will try that and share the results here. |
I'm looking forward to the results. If you have any additional questions just let me know |
Hi, thanks you've been very helpful, I modified your DataGenerator code a bit and got it to start training as you said, but now at the end of the first epoch, I get out memory errors for the GPU! Here is the error: Function call stack: I'm training on a Titan XP with 12GB memory, and also decreased the batch size and seq_len but still getting the same error. |
This is my DataGenerator class: `class DataGenerator(tf.keras.utils.Sequence):
|
If you are not shuffling ur files during training, it looks like that the last files that go into the generator have a lot of entries. What I can deriving from shape[3736448,256] is that ur are passing 3736448 sequences with a length of 256 into the model. The 3736448 is the aggregated batch size of that file batch. Just check whether u have very large file in ur dataset and potentially exclude it for now. |
Thanks for your answer, I don't have a large file in the dataset, in the |
By adding all sequences together, the model can train, although it takes about 4.5 hours per epoch! |
Hi, I trained the model for about a week but unfortunately, the final result was a straight line in the middle. do you have plans to update the repo? can please share your weight to finetune the model based on that? |
Hi, thanks for sharing your awesome work, I wanted to train the transformer model in multiple csvs which have the same time span, should I just contact them and make one big dataframe and train the model?
Thanks
The text was updated successfully, but these errors were encountered: