Docs Revamp #181

msaroufim · 2024-04-26T18:46:10Z

Just listing out all the issues I'm seeing with our docs, feel free to pick something up and fix it. First step just add your documentation directly in a relevant subfolder in the repo directly and tag me to review

For API docstrings and end usage instructions that won't change a lot please put them here https://github.com/pytorch/ao/tree/main/docs so they get rendered on pytorch.org/docs

Numbers

The repo is primarily about performance so we should share performance tables directly in the README until we figure out a dashboard like solution

For each sparsity or quantization technique you're working on feel free to add another subsection

End to end tutorials

Revamp the main README.md to have the features we want to advertise the most broadly
How to configure compile for consumer GPUs
End to end tutorial with llama3
Run an evaluation with eleuther eval @andrewor14
torch.ao.pruning accuracy benchmarks on llama2 or llama3

Core concepts

Sparsity patterns and how they work
What are the different kinds of quantization algorithms
How to make quantization/sparsity kernels faster
Sparsity for LLMs overview @jcaip
docs for AffineQuantizedTensor in quantization.md @jerryzh168

Contributing

How to test
Version guards
How to benchmark

Features

AOT inductor and no python overhead tutorial @jerryzh168
Update autoquant tutorial to work OOB with llama2/3, should be some copy pastable snippet using our llama model in torchao
Smoothquant tutorial is placeholder code, needs an actual runnable snippet or be moved to prototype

Composability

Docstrings

e.g https://pytorch.org/ao/stable/generated/torchao.sparsity.WandaSparsifier.html#torchao.sparsity.WandaSparsifier

dtypes @jainapurva
kernel @msaroufim
quantization @msaroufim
sparsity @jcaip

Confirm they're visible on pytorch.org

Completed

We don't have a wanda tutorial @jcaip
Sparsity we mention tons of algorithms but should suggest a simple one people should start with @jcaip
Our main goals are performance w/ composability with torch.compile and FSDP + performance. And also easy packaging for wide reach @msaroufim
In the main README when we talk about features we should link to usage instructions and code not papers @msaroufim
Mention HQQ, GaLore and prototype folder somewhere in main docs @msaroufim
A doc for how to register a new custom OP for both C++ and Triton @msaroufim
Mention tinygemm @msaroufim

The text was updated successfully, but these errors were encountered:

msaroufim added the documentation Improvements or additions to documentation label May 7, 2024

supriyar mentioned this issue May 17, 2024

[Tracker] WIP features for torchao 0.3 #252

Open

19 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docs Revamp #181

Docs Revamp #181

msaroufim commented Apr 26, 2024 •

edited

Docs Revamp #181

Docs Revamp #181

Comments

msaroufim commented Apr 26, 2024 • edited

Numbers

End to end tutorials

Core concepts

Contributing

Features

Composability

Docstrings

Completed

msaroufim commented Apr 26, 2024 •

edited