Skip to content

Conversation

matthewdouglas
Copy link
Member

This PR loosens the fp32 tolerance requirements for the 4bit CPU quantization tests from those introduced in #1721. The tolerances for the default blocksize of 64, as well as for all fp16/bf16 tests, remains the same.

@matthewdouglas matthewdouglas added this to the v0.48.0 milestone Sep 8, 2025
Copy link

github-actions bot commented Sep 8, 2025

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@matthewdouglas matthewdouglas merged commit d731fc4 into main Sep 8, 2025
120 of 121 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant