Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add preliminary Dolma v1.7 configurations, fix corner case in tokens. #120

Merged
merged 6 commits into from
Feb 13, 2024

Commits on Feb 10, 2024

  1. data

    soldni committed Feb 10, 2024
    Configuration menu
    Copy the full SHA
    9312ad5 View commit details
    Browse the repository at this point in the history
  2. added configs

    soldni committed Feb 10, 2024
    Configuration menu
    Copy the full SHA
    fd035ba View commit details
    Browse the repository at this point in the history

Commits on Feb 11, 2024

  1. Configuration menu
    Copy the full SHA
    a059338 View commit details
    Browse the repository at this point in the history

Commits on Feb 13, 2024

  1. fixed bug in tokenizer

    soldni committed Feb 13, 2024
    Configuration menu
    Copy the full SHA
    9ce2e51 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    dcbf75d View commit details
    Browse the repository at this point in the history
  3. removed models dir for now

    soldni committed Feb 13, 2024
    Configuration menu
    Copy the full SHA
    15a1e5a View commit details
    Browse the repository at this point in the history