Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Version dolma flan change #710

Merged
merged 7 commits into from
Aug 26, 2024
Merged

Version dolma flan change #710

merged 7 commits into from
Aug 26, 2024

Conversation

IanMagnusson
Copy link
Contributor

I'm working on sorting out which model ladder runs are comparable to other model ladder runs. One crucial necessity for this is versioning our data mixes. We should lock in a name for the version of Dolma 1.7 that uses preprocessed/tulu_flan/v1-decontaminated-60M-shots_all-upweight_1-dialog_false-sep_rulebased instead of preprocessed/tulu_flan/v2-decontaminated-60M-shots_all-upweight_1-dialog_false-sep_newline/ (introduced to the model ladder in this PR). I think we should not call this dolma17 as we currently do unless there is a plan to update the HF hosted version of dolma 1.7 to also have this change. At least I'd like there to be two different named_data_mixes for dolma 1.7 with each of these flans so that tracking which what exact dataset a run uses can be done by just looking at the data mix name and not having to check out the code used to train a run just to check what overloaded version of a named mix it is.

The implementation here is a hot fix to differentiate different flans in dolma17 for model ladder. Later we can work on cleaning up the data mix definition system more thoroughly but right now we need to just make sure that new runs do not have a mislabeled data mix.

@IanMagnusson IanMagnusson marked this pull request as ready for review August 22, 2024 20:33
@soldni soldni requested a review from epwalsh August 22, 2024 20:57
@IanMagnusson IanMagnusson merged commit cee1a5d into main Aug 26, 2024
11 of 12 checks passed
@IanMagnusson IanMagnusson deleted the version-dolma-flan-change branch August 26, 2024 18:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants