Skip to content
@bigscience-workshop

BigScience Workshop

Research workshop on large language models - The Summer of Language Models 21

Popular repositories Loading

  1. petals petals Public

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    Python 9.5k 548

  2. promptsource promptsource Public

    Toolkit for creating, sharing and using natural language prompts.

    Python 2.8k 365

  3. Megatron-DeepSpeed Megatron-DeepSpeed Public

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Python 1.4k 223

  4. bigscience bigscience Public

    Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

    Shell 989 101

  5. xmtf xmtf Public

    Crosslingual Generalization through Multitask Finetuning

    Jupyter Notebook 529 38

  6. biomedical biomedical Public

    Tools for curating biomedical training data for large-scale language modeling

    Python 472 117

Repositories

Showing 10 of 35 repositories
  • data_tooling Public

    Tools for managing datasets for governance and training.

    HTML 83 Apache-2.0 48 138 (2 issues need help) 3 Updated Feb 3, 2025
  • biomedical Public

    Tools for curating biomedical training data for large-scale language modeling

    Python 472 117 163 (6 issues need help) 16 Updated Dec 9, 2024
  • xmtf Public

    Crosslingual Generalization through Multitask Finetuning

    Jupyter Notebook 529 Apache-2.0 38 11 0 Updated Sep 22, 2024
  • petals Public

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    Python 9,512 MIT 548 90 (9 issues need help) 19 Updated Sep 7, 2024
  • bigscience Public

    Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

    Shell 989 101 13 8 Updated Jul 29, 2024
  • Megatron-DeepSpeed Public

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Python 1,380 223 74 (10 issues need help) 45 Updated Mar 20, 2024
  • multilingual-modeling Public

    BLOOM+1: Adapting BLOOM model to support a new unseen language

    Python 71 Apache-2.0 15 13 6 Updated Mar 2, 2024
  • promptsource Public

    Toolkit for creating, sharing and using natural language prompts.

    Python 2,804 Apache-2.0 365 11 32 Updated Oct 23, 2023
  • massive-probing-framework Public Forked from AIRI-Institute/Probing_framework

    Framework for BLOOM probing

    Python 8 8 0 0 Updated Oct 17, 2023
  • Python 95 Apache-2.0 319 4 5 Updated Jul 25, 2023

Top languages

Loading…

Most used topics

Loading…