support neo++ fp8 by helloyongyang · Pull Request #993 · ModelTC/LightX2V

helloyongyang · 2026-04-09T07:04:08Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces FP8 quantization support for the NeoPP model, including new configuration files and example scripts for 1k and 2k resolutions. It also updates weight registration in the transformer architecture to explicitly handle null biases and use default weight types for specific heads. Feedback is provided regarding several inconsistencies in the new 2k resolution example scripts where the index_offset_cond parameter does not match the provided KV cache filenames, which would likely cause incorrect RoPE indexing during inference.

gemini-code-assist · 2026-04-09T07:06:27Z

+    "/data/nvme1/yongyang/FL/neo_9b_new/vlm_tensor_44000_ema_2k/to_x2v_uncond_kv_1_12.pt",
+)
+pipe.runner.set_inference_params(
+    index_offset_cond=366,


The index_offset_cond value (366) does not match the offset indicated in the filename of the KV cache being loaded on line 48 (..._1_360.pt). This inconsistency will likely lead to incorrect RoPE indexing during inference. It appears this value was copy-pasted from the 1k example without adjustment.

Suggested change

index_offset_cond=366,

index_offset_cond=360,

gemini-code-assist · 2026-04-09T07:06:28Z

+    "/data/nvme1/yongyang/FL/neo_9b_new/vlm_tensor_44000_ema_2k/to_x2v_uncond_kv_2_15.pt",
+)
+pipe.runner.set_inference_params(
+    index_offset_cond=441,


The index_offset_cond value (441) does not match the offset in the filename of the KV cache being loaded on line 71 (..._2_439.pt). Please ensure the parameter matches the actual data being loaded.

Suggested change

index_offset_cond=441,

index_offset_cond=439,

gemini-code-assist · 2026-04-09T07:06:28Z

+    "/data/nvme1/yongyang/FL/neo_9b_new/vlm_tensor_44000_ema_2k/to_x2v_uncond_kv_1_12.pt",
+)
+pipe.runner.set_inference_params(
+    index_offset_cond=366,


The index_offset_cond value (366) does not match the offset in the filename of the KV cache being loaded on line 48 (..._1_360.pt).

Suggested change

index_offset_cond=366,

index_offset_cond=360,

gemini-code-assist · 2026-04-09T07:06:28Z

+    "/data/nvme1/yongyang/FL/neo_9b_new/vlm_tensor_44000_ema_2k/to_x2v_uncond_kv_2_15.pt",
+)
+pipe.runner.set_inference_params(
+    index_offset_cond=441,


The index_offset_cond value (441) does not match the offset in the filename of the KV cache being loaded on line 71 (..._2_439.pt).

Suggested change

index_offset_cond=441,

index_offset_cond=439,

support neo++ fp8

b1dbfc8

llmc-reviewer approved these changes Apr 9, 2026

View reviewed changes

llmc-reviewer merged commit 73147d4 into main Apr 9, 2026
2 checks passed

llmc-reviewer deleted the neo branch April 9, 2026 07:04

gemini-code-assist Bot reviewed Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support neo++ fp8#993

support neo++ fp8#993
llmc-reviewer merged 1 commit intomainfrom
neo

helloyongyang commented Apr 9, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

helloyongyang commented Apr 9, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants