Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Tracker] General feature requests for torchao #65

Open
9 tasks
supriyar opened this issue Mar 19, 2024 · 1 comment
Open
9 tasks

[Tracker] General feature requests for torchao #65

supriyar opened this issue Mar 19, 2024 · 1 comment
Labels

Comments

@supriyar
Copy link
Contributor

supriyar commented Mar 19, 2024

This issue tracks outstanding feature requests for torchao. If you'd like a specific feature to be added to torchao, please comment directly here.

Quantization Techniques (based on planned, new requests)

  • GPTQ
  • HQQ

DTypes

  • fp8
  • mx format

Sparsity APIs

  • int8 + 2:4 sparsity
  • fp8 + 2:4 sparsity

Kernels

cc @cpuhrsch

@msaroufim msaroufim pinned this issue Mar 19, 2024
@mergennachin
Copy link

GGUF conversion and kernels

  • Add gguf conversion scripts using gpt-fast/model.py as source
  • Add torch subclass to ao that does the quantization
  • Add ggml kernel with the wrapper to make sure it executes in eager but calls the specific kernel
  • Reference python implementation

cc @larryliu0820

@msaroufim msaroufim unpinned this issue May 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants