New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training loss curve on MMC4 dataset? #226
Comments
Meet the same problem, it seems the training loss on MMC4 is hard to convergence. |
Thanks. What's the sim thresh score used for this figure? |
These curves use a threshold of 0.24. |
Hmm, we haven't plotted such a validation loss before -- this behavior is pretty surprising to me! Do you know if your downstream performance on task benchmarks improves or degrades with training? |
The downstream performance are also unstable. The ckpt in middle sometimes better than the final. I guess it's because I train the model with only 1M LAION and 1M MMc4. The data scale is too small. How many LAION and MMC4 samples you used for the above figure? @i-gao |
Ah, okay! The x-axis in the training curve plots refer to the number of interleaved (mmc4) samples. |
Hi, thanks for the great work! I tried training on a subset of MMC4-core but the LM loss does not go down too much. Is it possible to share the MMC4 loss curve for reference, so that I may know if it is expected (or potentially a bug). Thanks so much!
The text was updated successfully, but these errors were encountered: