Skip to content

Pinned Loading

  1. OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.5k 585

  2. dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.2k 130

  3. ai2thor Public

    An open-source platform for Visual AI.

    C# 1.3k 233

  4. olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 10.7k 716

  5. OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 698 60

Repositories

Showing 10 of 494 repositories
  • python-package-template Public template

    A template repo for Python packages

    Python 484 Apache-2.0 76 2 9 Updated Mar 31, 2025
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5,450 Apache-2.0 585 51 54 Updated Mar 31, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    Python 177 Apache-2.0 31 1 18 Updated Mar 30, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    Python 16 Apache-2.0 5 1 5 Updated Mar 30, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    Python 2,849 Apache-2.0 368 17 12 Updated Mar 30, 2025
  • Python 6 Apache-2.0 2 6 3 Updated Mar 28, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 10,720 Apache-2.0 716 74 17 Updated Mar 28, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    C# 1,318 Apache-2.0 233 248 4 Updated Mar 28, 2025
  • ai2-scholarqa-lib Public

    Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

    Python 138 Apache-2.0 21 1 0 Updated Mar 27, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    Python 30 Apache-2.0 2 5 0 Updated Mar 26, 2025