Skip to content

Pinned Loading

  1. OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.4k 582

  2. dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.2k 130

  3. ai2thor Public

    An open-source platform for Visual AI.

    C# 1.3k 232

  4. olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 10.5k 702

  5. OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 693 58

Repositories

Showing 10 of 495 repositories
  • open-instruct Public

    AllenAI's post-training codebase

    Python 2,841 Apache-2.0 367 15 13 Updated Mar 26, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    Python 16 Apache-2.0 5 1 6 Updated Mar 26, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    Python 175 Apache-2.0 31 1 19 Updated Mar 26, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 10,459 Apache-2.0 702 71 18 Updated Mar 26, 2025
  • lighthouse Public
    Python 0 Apache-2.0 0 0 0 Updated Mar 25, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    C# 1,312 Apache-2.0 232 247 4 Updated Mar 25, 2025
  • olmes Public

    Reproducible, flexible LLM evaluations

    Python 180 Apache-2.0 19 6 1 Updated Mar 25, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    Python 30 Apache-2.0 2 5 0 Updated Mar 25, 2025
  • Python 6 Apache-2.0 2 6 3 Updated Mar 25, 2025
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5,435 Apache-2.0 582 49 53 Updated Mar 24, 2025