Skip to content

Conversation

@vkuzo
Copy link
Contributor

@vkuzo vkuzo commented Dec 3, 2025

Summary:

with-proxy python
benchmarks/mx_formats/vllm/create_quantized_hf_model.py
~/local/tmp/20251203_test_model_mxfp8

with-proxy vllm bench throughput --model
~/local/tmp/20251203_test_model_mxfp8/ --dataset-name sonnet
--dataset-path ~/local/vllm/benchmarks/sonnet.txt --num-prompts 1024
--tensor-parallel-size 1 --max-model-len 2048 --gpu-memory-utilization
0.8

currently fails with compile error (PyTorch 2.9)

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]
@vkuzo
Copy link
Contributor Author

vkuzo commented Dec 3, 2025

Stack from ghstack (oldest at bottom):

@pytorch-bot
Copy link

pytorch-bot bot commented Dec 3, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3426

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 2 Unrelated Failures

As of commit ec96c24 with merge base 16aad7c (image):

NEW FAILURE - The following job has failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vkuzo added a commit that referenced this pull request Dec 3, 2025
Summary:

```
with-proxy python
benchmarks/mx_formats/vllm/create_quantized_hf_model.py
~/local/tmp/20251203_test_model_mxfp8

with-proxy vllm bench throughput --model
~/local/tmp/20251203_test_model_mxfp8/ --dataset-name sonnet
--dataset-path ~/local/vllm/benchmarks/sonnet.txt --num-prompts 1024
--tensor-parallel-size 1 --max-model-len 2048 --gpu-memory-utilization
0.8
```

currently fails with compile error (PyTorch 2.9)

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: 620d504
ghstack-comment-id: 3608599132
Pull-Request: #3426
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants