Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
-
Updated
Nov 12, 2024 - Python
Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
A JAX implementation of stochastic addition.
Intelligent interface between Python-computed values and your LaTeX work.
Convex body sampling algorithms in Python
Round percentages that add up to 100, such that the rounded percentages also add up to 100 using the largest remainder method
Explore direct neighbors and limits of IEEE floating-point values.
Python3 porting: Stochastic Rounding
Opinionated pretty output of values and their errors rounded based on error.
Python module providing an easy way to set the precision of a floating-point number to the desired amount of decimal places, or total amount of significant digits.
Add a description, image, and links to the rounding topic page so that developers can more easily learn about it.
To associate your repository with the rounding topic, visit your repo's landing page and select "manage topics."