Add modelopt quantization blog #259

Edwardf0t1 · 2025-11-21T09:13:02Z

Add modelopt quantization blog

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

blog/2025-11-26-modelopt-quantization.md

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

eduand-alvarez

Update instances of ModelOpt, capitalize Blackwell, and make sure perf chart is rendering properly.

blog/2025-12-02-modelopt-quantization.md

eduand-alvarez · 2025-12-01T22:24:45Z

blog/2025-12-02-modelopt-quantization.md

+The models optimized through this new API enable significant performance boost. Better yet these optimizations can be stacked with other software components in the NVIDIA software-hardware stack and across the various embodiments of the latest blackwell architecture, from the DGX Spark to GB300 NVL72.
+
+
+![DSR1-nvfp4-perf.jpg](/images/blog/nvidia-modelopt-quantization/DSR1-nvfp4-perf.jpg)


Image isn't rendering, please make sure it is embedded properly .

Thanks @eduand-alvarez , I checked that the path is correct.

@sglang-bot @JustinTong0323 @merrymercy I assume this should be fine, right?

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

Edwardf0t1 added 3 commits November 21, 2025 09:12

add modelopt quantization blog

1dac090

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

update

44fa956

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

update

411ccdd

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

JustinTong0323 reviewed Nov 22, 2025

View reviewed changes

blog/2025-11-26-modelopt-quantization.md Show resolved Hide resolved

sglang-bot reviewed Nov 24, 2025

View reviewed changes

blog/2025-11-26-modelopt-quantization.md Outdated Show resolved Hide resolved

blog/2025-11-26-modelopt-quantization.md Show resolved Hide resolved

Edwardf0t1 added 6 commits November 24, 2025 16:30

update sample code

2dfb05b

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

update release date to 12/02/2025, add sglang slack workspace link

40df211

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

update the code example to use qwen3

a9157a4

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

rename

323eebf

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

minor

7338eb6

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

update chat template in code example for qwen3

e2015df

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

merrymercy approved these changes Nov 29, 2025

View reviewed changes

minor

c0af26d

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

eduand-alvarez reviewed Dec 1, 2025

View reviewed changes

address reviews

65ba3c4

Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>

Ying1123 approved these changes Dec 2, 2025

View reviewed changes

Ying1123 merged commit 9a872a0 into lm-sys:main Dec 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add modelopt quantization blog #259

Add modelopt quantization blog #259

Uh oh!

Edwardf0t1 commented Nov 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eduand-alvarez left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eduand-alvarez Dec 1, 2025

Uh oh!

Edwardf0t1 Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

		The models optimized through this new API enable significant performance boost. Better yet these optimizations can be stacked with other software components in the NVIDIA software-hardware stack and across the various embodiments of the latest blackwell architecture, from the DGX Spark to GB300 NVL72.


		![DSR1-nvfp4-perf.jpg](/images/blog/nvidia-modelopt-quantization/DSR1-nvfp4-perf.jpg)

Add modelopt quantization blog #259

Add modelopt quantization blog #259

Uh oh!

Conversation

Edwardf0t1 commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eduand-alvarez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eduand-alvarez Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Edwardf0t1 Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Edwardf0t1 commented Nov 21, 2025 •

edited

Loading