Skip to content

Pinned Loading

  1. OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.9k 647

  2. dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.3k 146

  3. ai2thor Public

    An open-source platform for Visual AI.

    C# 1.5k 253

  4. olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 13.9k 1k

  5. OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 842 79

Repositories

Showing 10 of 517 repositories
  • ai2thor Public

    An open-source platform for Visual AI.

    C# 1,490 Apache-2.0 253 267 5 Updated Aug 20, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    Python 3,118 Apache-2.0 428 18 18 Updated Aug 20, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    Python 43 Apache-2.0 8 1 30 Updated Aug 20, 2025
  • agent-eval Public
    Python 2 1 0 7 Updated Aug 20, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    Python 273 Apache-2.0 52 0 37 Updated Aug 20, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 13,856 Apache-2.0 1,023 16 7 Updated Aug 20, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    Python 41 Apache-2.0 6 9 4 Updated Aug 20, 2025
  • Python 13 Apache-2.0 2 14 16 Updated Aug 19, 2025
  • signal-and-noise Public

    Measuring the Signal to Noise Ratio in Language Model Evaluation

    Python 9 Apache-2.0 0 0 0 Updated Aug 19, 2025
  • asta-paper-finder Public

    frozen-in-time version of our Paper Finder agent for reproducing results on ASTA-Bench

    Python 0 0 0 0 Updated Aug 19, 2025