Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DOC] Documentation on Quantization #4435

Closed
masahi opened this issue Nov 27, 2019 · 4 comments
Closed

[DOC] Documentation on Quantization #4435

masahi opened this issue Nov 27, 2019 · 4 comments

Comments

@masahi
Copy link
Member

masahi commented Nov 27, 2019

With v0.6 adding quantization support, I think it is good time to add documentation on our quantization story.

There have been many questions on the forum, some of which are listed at the bottom. I myself have recently become interested in the topic, but I'm having hard time digging through the forum, github issues, PRs etc.

It would be great if we could add an end to end quantization usage tutorial or design document to our developer guide.

Current state of quantization effort
Is there a complete quantization tutorial including the use of various quantization algorithms and the addition of operators?
Quantization Configuration Documentation?
How to choose different quantization methods?
Int8 tutorial request
Understanding the quantization passes
Which operators are quantized in TVM?
Does tvm support quantization

@vinx13
Copy link
Member

vinx13 commented Nov 27, 2019

I'm working on cuda quantization tutorial, it will be out soon

@masahi
Copy link
Member Author

masahi commented May 16, 2020

#4667, #5321, #5362, #5595 added tutorials for quantization.

@masahi masahi closed this as completed May 16, 2020
@junrushao
Copy link
Member

@masahi Hey the last link should be #5595, not #5585

@masahi
Copy link
Member Author

masahi commented May 16, 2020

@junrushao1994 Thanks, fixed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants