Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.7k 628

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.3k 146

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.4k 248

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 13.1k 953

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 800 76

Repositories

Showing 10 of 506 repositories
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 36 Apache-2.0 7 0 19 Updated Jul 7, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    allenai/ai2thor’s past year of commit activity
    C# 1,439 Apache-2.0 248 263 5 Updated Jul 7, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 253 Apache-2.0 48 0 29 Updated Jul 7, 2025
  • allenai/rslearn_projects’s past year of commit activity
    Python 9 Apache-2.0 2 11 13 Updated Jul 7, 2025
  • S2AND Public

    Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

    allenai/S2AND’s past year of commit activity
    Python 94 19 6 0 Updated Jul 7, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 40 Apache-2.0 5 10 6 Updated Jul 7, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,045 Apache-2.0 411 16 11 Updated Jul 7, 2025
  • ai2-scholarqa-lib Public

    Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

    allenai/ai2-scholarqa-lib’s past year of commit activity
    Python 191 Apache-2.0 35 3 0 Updated Jul 7, 2025
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    allenai/OLMo’s past year of commit activity
    Python 5,745 Apache-2.0 628 10 58 Updated Jul 7, 2025
  • beaker-gantry Public

    Gantry streamlines running Python experiments in Beaker by managing containers and boilerplate for you

    allenai/beaker-gantry’s past year of commit activity
    Python 24 Apache-2.0 6 2 3 Updated Jul 7, 2025