Skip to content
@NolanoOrg

Nolano.ai

Compressing Foundation models for deployment on clouds, phones and laptops

Popular repositories

  1. cformers cformers Public

    SoTA Transformers with C-backend for fast inference on your CPU.

    C 315 29

  2. smol-gpt smol-gpt Public

    Smol but mighty language model

    C 62 3

  3. sparse_quant_llms sparse_quant_llms Public

    SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia

    Python 38 3

  4. llama-int4-quant llama-int4-quant Public archive

    C 26 2

  5. InstructLLaMa.cpp InstructLLaMa.cpp Public

    Fast inference of Instruct tuned LLaMa on your personal devices.

    C 23 1

  6. pydalai pydalai Public

    Python 10 2

Repositories

Showing 9 of 9 repositories

Top languages

Loading…

Most used topics

Loading…