We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
0.0.40 shipped the first version of embedding quant.
--embedding-dtype int8
This Issue is looking for testers, to verify the real life performance of these features at real datasets.
E.g. delivering a benchmark of int8 quantization - can be added under ./docs
./docs
The text was updated successfully, but these errors were encountered:
@mahiro72 Expressed interest in working on this fyi!
Sorry, something went wrong.
No branches or pull requests
System Info
0.0.40 shipped the first version of embedding quant.
--embedding-dtype int8
This Issue is looking for testers, to verify the real life performance of these features at real datasets.
Information
Tasks
Reproduction
Expected behavior
E.g. delivering a benchmark of int8 quantization - can be added under
./docs
The text was updated successfully, but these errors were encountered: