Skip to content
@neuralmagic

Neural Magic

Neural Magic helps developers in accelerating machine learning performance using automated model sparsification techniques and inference technologies.

Pinned

  1. nm-vllm nm-vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 177 7

  2. deepsparse deepsparse Public

    Sparsity-aware deep learning inference runtime for CPUs

    Python 2.9k 168

  3. sparseml sparseml Public

    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

    Python 2k 140

  4. docs docs Public

    Top-level directory for documentation and general content

    MDX 120 7

  5. examples examples Public

    Notebooks using the Neural Magic libraries 📓

    Jupyter Notebook 38 6

  6. sparsezoo sparsezoo Public

    Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

    Python 357 23

Repositories

Showing 10 of 38 repositories