Skip to content

Pinned Loading

  1. OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.5k 591

  2. dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.2k 136

  3. ai2thor Public

    An open-source platform for Visual AI.

    C# 1.3k 232

  4. olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 11.5k 778

  5. OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 716 63

Repositories

Showing 10 of 497 repositories
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 11,517 Apache-2.0 778 77 17 Updated Apr 21, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    Python 197 Apache-2.0 35 1 17 Updated Apr 21, 2025
  • Python 7 Apache-2.0 2 6 2 Updated Apr 21, 2025
  • DrawEduMath Public

    Can VLMs understand students' hand-drawn math work?

    Python 8 Apache-2.0 0 0 13 Updated Apr 20, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    Python 2,911 Apache-2.0 375 13 16 Updated Apr 20, 2025
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5,508 Apache-2.0 591 51 58 Updated Apr 19, 2025
  • regmixer Public
    Python 4 0 0 2 Updated Apr 19, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    Python 22 Apache-2.0 5 0 6 Updated Apr 19, 2025
  • datamap-rs Public

    Data mapping framework for rust stuff

    Rust 2 0 0 0 Updated Apr 18, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    C# 1,347 Apache-2.0 232 250 4 Updated Apr 18, 2025