Skip to content

Pinned Loading

  1. OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.3k 570

  2. dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.1k 128

  3. ai2thor Public

    An open-source platform for Visual AI.

    C# 1.3k 231

  4. olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 9.6k 633

  5. OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 667 55

Repositories

Showing 10 of 493 repositories
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    Python 73 Apache-2.0 19 1 20 Updated Mar 13, 2025
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5,329 Apache-2.0 570 46 54 Updated Mar 13, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    Python 13 Apache-2.0 5 1 4 Updated Mar 12, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    C# 1,295 Apache-2.0 231 244 4 Updated Mar 12, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 9,626 Apache-2.0 633 58 17 Updated Mar 12, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    Python 2,789 Apache-2.0 358 14 11 Updated Mar 13, 2025
  • dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1,145 Apache-2.0 128 25 18 Updated Mar 13, 2025
  • Python 6 Apache-2.0 0 0 0 Updated Mar 12, 2025
  • OLMoE.swift Public
    Swift 256 Apache-2.0 29 6 1 Updated Mar 12, 2025
  • SciRIFF Public

    Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.

    Python 36 Apache-2.0 5 2 0 Updated Mar 12, 2025