Skip to content
Change the repository type filter

Forks

    Repositories list

    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      Apache License 2.0
      1.2k000Updated Feb 12, 2025Feb 12, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      6.2k001Updated Feb 5, 2025Feb 5, 2025
    • llama.cpp

      Public
      LLM inference in C/C++
      C++
      MIT License
      11k000Updated Jan 31, 2025Jan 31, 2025
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      Apache License 2.0
      4.3k000Updated Aug 27, 2024Aug 27, 2024
    • Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
      Python
      MIT License
      55100Updated Aug 23, 2024Aug 23, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2.2k100Updated Jun 24, 2024Jun 24, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2.2k000Updated May 31, 2024May 31, 2024
    • Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
      Python
      Apache License 2.0
      49300Updated May 3, 2024May 3, 2024
    • A lightweight python gRPC client to communicate with TensorFlow Serving
      C++
      Apache License 2.0
      15000Updated Apr 8, 2024Apr 8, 2024
    • peft

      Public
      🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
      Python
      Apache License 2.0
      1.8k000Updated Mar 6, 2024Mar 6, 2024
    • Python
      8100Updated Jan 26, 2024Jan 26, 2024
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      28k100Updated Jan 13, 2024Jan 13, 2024
    • Puzzle Generator; Einstein's Riddle, Zebra Puzzle and Blood Donation Puzzle Solver. For non-commercial use only!
      Python
      9000Updated Dec 8, 2023Dec 8, 2023
    • xformers

      Public
      Hackable and optimized Transformers building blocks, supporting a composable construction.
      Python
      Other
      650100Updated Nov 29, 2023Nov 29, 2023
    • An extension for flake8 that forbids some imports statements in some modules.
      Python
      MIT License
      3000Updated May 17, 2023May 17, 2023
    • Fast Inference Solutions for BLOOM
      Python
      Apache License 2.0
      113000Updated Nov 21, 2022Nov 21, 2022
    16 repositories found. List is sorted by Last pushed in descending order.