Skip to content
@bigscience-workshop

BigScience Workshop

Research workshop on large language models - The Summer of Language Models 21

Popular repositories Loading

  1. petals petals Public

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    Python 9.5k 542

  2. promptsource promptsource Public

    Toolkit for creating, sharing and using natural language prompts.

    Python 2.8k 363

  3. Megatron-DeepSpeed Megatron-DeepSpeed Public

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Python 1.4k 224

  4. bigscience bigscience Public

    Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

    Shell 986 101

  5. xmtf xmtf Public

    Crosslingual Generalization through Multitask Finetuning

    Jupyter Notebook 528 38

  6. biomedical biomedical Public

    Tools for curating biomedical training data for large-scale language modeling

    Python 470 117

Repositories

Showing 10 of 35 repositories
  • data_tooling Public

    Tools for managing datasets for governance and training.

    HTML 82 Apache-2.0 48 138 (2 issues need help) 3 Updated Feb 3, 2025
  • biomedical Public

    Tools for curating biomedical training data for large-scale language modeling

    Python 470 117 163 (6 issues need help) 16 Updated Dec 9, 2024
  • xmtf Public

    Crosslingual Generalization through Multitask Finetuning

    Jupyter Notebook 528 Apache-2.0 38 11 0 Updated Sep 22, 2024
  • petals Public

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    Python 9,497 MIT 542 90 (9 issues need help) 19 Updated Sep 7, 2024
  • bigscience Public

    Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

    Shell 986 101 13 8 Updated Jul 29, 2024
  • Megatron-DeepSpeed Public

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Python 1,373 224 74 (10 issues need help) 45 Updated Mar 20, 2024
  • multilingual-modeling Public

    BLOOM+1: Adapting BLOOM model to support a new unseen language

    Python 71 Apache-2.0 15 13 6 Updated Mar 2, 2024
  • promptsource Public

    Toolkit for creating, sharing and using natural language prompts.

    Python 2,792 Apache-2.0 363 11 32 Updated Oct 23, 2023
  • massive-probing-framework Public Forked from AIRI-Institute/Probing_framework

    Framework for BLOOM probing

    Python 8 8 0 0 Updated Oct 17, 2023
  • Python 95 Apache-2.0 319 4 5 Updated Jul 25, 2023

Top languages

Loading…

Most used topics

Loading…