Skip to content
@ModelCloud

ModelCloud.ai

Our mission is to give allow everyone, including bots, unlimited and free access to llm/ai models.

Pinned Loading

  1. GPTQModel GPTQModel Public

    GPTQ based LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

    Python 114 26

Repositories

Showing 1 of 1 repositories
  • GPTQModel Public

    GPTQ based LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

    ModelCloud/GPTQModel’s past year of commit activity
    Python 114 Apache-2.0 26 4 1 Updated Nov 1, 2024

Top languages

Loading…

Most used topics

Loading…