Skip to content

Pinned Loading

  1. OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.5k 592

  2. dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.2k 139

  3. ai2thor Public

    An open-source platform for Visual AI.

    C# 1.4k 234

  4. olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 11.9k 809

  5. OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 719 63

Repositories

Showing 10 of 498 repositories
  • open-instruct Public

    AllenAI's post-training codebase

    Python 2,925 Apache-2.0 377 13 14 Updated Apr 24, 2025
  • atlantes Public

    Efficient and low latency real-time global-scale GPS trajectory modeling

    Python 1 0 0 4 Updated Apr 24, 2025
  • beaker-py Public

    A pure-Python Beaker client

    Python 15 Apache-2.0 2 1 5 Updated Apr 23, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    Python 197 Apache-2.0 35 1 16 Updated Apr 23, 2025
  • ai2-scholarqa-lib Public

    Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

    Python 156 Apache-2.0 26 2 1 Updated Apr 24, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    C# 1,351 Apache-2.0 234 251 4 Updated Apr 23, 2025
  • beaker-gantry Public

    Gantry streamlines running Python experiments in Beaker by managing containers and boilerplate for you

    Python 23 Apache-2.0 5 1 2 Updated Apr 23, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 11,892 Apache-2.0 809 77 18 Updated Apr 23, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    Python 23 Apache-2.0 5 0 7 Updated Apr 23, 2025
  • dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1,200 Apache-2.0 139 26 19 Updated Apr 23, 2025