Skip to content
Change the repository type filter

All

    Repositories list

    • Efficient Track Anything for Kinara internal
      Python
      33000Updated Jan 6, 2025Jan 6, 2025
    • qwen.cpp

      Public
      C++ implementation of Qwen-LM
      C++
      61000Updated Jul 30, 2024Jul 30, 2024
    • LLM-QAT

      Public
      Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"
      Python
      25000Updated Jul 12, 2024Jul 12, 2024
    • OmniQuant

      Public
      [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
      Python
      72000Updated Jul 11, 2024Jul 11, 2024
    • x280

      Public
      0000Updated Jul 8, 2024Jul 8, 2024
    • aimet

      Public
      AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
      Python
      438000Updated Jun 18, 2024Jun 18, 2024
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      C++
      27k000Updated Mar 29, 2023Mar 29, 2023
    • cereal

      Public
      A C++11 library for serialization
      C++
      821000Updated Aug 10, 2022Aug 10, 2022