[TRTLLM-11339][fix] Wan tests refactor + small transformer fix by o-stoner · Pull Request #12128 · NVIDIA/TensorRT-LLM

o-stoner · 2026-03-11T23:58:55Z

Summary by CodeRabbit

Bug Fixes
- Refined cross-attention normalization in the WAN transformer model with updated LayerNorm epsilon values for improved numerical stability.
- Enhanced dtype and device handling during final projection transformations.
Tests
- Added comprehensive integration tests validating WAN model correctness across multiple configurations (480p, 720p variants).
- Expanded test coverage for TeaCache optimization and quantization features (FP8, NVFP4).
- Added end-to-end pipeline validation tests comparing outputs against reference implementations.

Description

Refactor Wan tests to be more modular, aligning with the structure of the current Flux tests, and replace Wan 2.1 I2V tests to use the checkpoint now available in /home/scratch.trt_llm_data_ci/llm-models/ after this PR was merged: https://gitlab-master.nvidia.com/ftp/llm-models/-/merge_requests/501#23b957b075082ab7ca8edec0db934ac9aa8f8ced.

Include Wan pipeline tests for each model, an isolated Wan transformer test, Wan TeaCache tests, and a feature test which includes some sanity tests + quantization checks. These tests ensure the Wan checkpoints are not loaded more times than necessary, cutting down each file's runtime (should all be less than 10 mins), and they allow for better separation of concerns. Deletes the original, long test files for Wan I2V/T2V.

This PR was initially part of this larger PR: #11923 (to be closed), but we broke the functionality of the previous PR into 2 parts (TeaCache update, then Wan test update) to merge TeaCache changes quickly.

This PR includes a transformer fix which is necessary to treat text+image cross-attention properly per the HF reference: https://github.com/huggingface/diffusers/blob/main/src/diffusers/models/transformers/transformer_wan.py#L160 and fix eps value to align with nn.LayerNorm: https://github.com/pytorch/pytorch/blob/main/torch/nn/modules/normalization.py#L191, since that is the eps value for default FP32LayerNorm(nn.LayerNorm).

Test Coverage

PR Checklist

Please review the following before submitting your PR:

PR description clearly explains what and why. If using CodeRabbit's summary, please make sure it makes sense.
PR Follows TRT-LLM CODING GUIDELINES to the best of your knowledge.
Test cases are provided for new code paths (see test instructions)
Any new dependencies have been scanned for license and vulnerabilities
CODEOWNERS updated if ownership changes
Documentation updated as needed
Update tava architecture diagram if there is a significant design change in PR.
The reviewers assigned automatically/manually are appropriate for the PR.
Please check this after reviewing the above items as appropriate for this PR.

GitHub Bot Help

To see a list of available CI bot commands, please comment /bot help.

o-stoner · 2026-03-12T00:00:13Z

/bot run --disable-fail-fast

coderabbitai · 2026-03-12T00:02:46Z

📝 Walkthrough

Walkthrough

The PR modifies WAN transformer cross-attention handling to use QKV-based computation with optional normalization, adjusts LayerNorm epsilon values, and improves dtype/device handling in final projections. Test coverage is reorganized by replacing a large I2V test suite with new focused integration and feature tests for Wan 2.1/2.2 models across T2V/I2V variants and quantization modes.

Changes

Cohort / File(s)	Summary
WAN Transformer Model Logic `tensorrt_llm/_torch/visual_gen/models/wan/transformer_wan.py`	Reworked cross-attention processing to use QKV-based computation paths with optional LayerNorm, adjusted LayerNorm epsilon (1e-6→1e-5), improved final projection dtype/device casting, and updated output fusion logic.
Wan 2.1 Integration Tests `tests/unittest/_torch/visual_gen/test_wan21__pipeline.py`, `tests/unittest/_torch/visual_gen/test_wan21__teacache.py`	New integration test suites for Wan 2.1 T2V and I2V pipelines comparing TRTLLM outputs against HuggingFace references; includes TeaCache validation with expected hit-rate thresholds across multiple model sizes and configurations.
Wan 2.2 Integration Tests `tests/unittest/_torch/visual_gen/test_wan22_*_pipeline.py`	New integration test suites for Wan 2.2 two-stage T2V and I2V pipelines with HuggingFace correctness validation; includes FP8 quantization and TRTLLM attention backend feature tests.
WAN Transformer Feature & Unit Tests `tests/unittest/_torch/visual_gen/test_wan_features.py`, `tests/unittest/_torch/visual_gen/test_wan_transformer.py`	New comprehensive unit and pipeline feature tests validating WanTransformer3DModel structure, forward pass sanity, HuggingFace weight parity, and quantization behavior (FP8, FP8_BLOCK_SCALES, NVFP4); integration tests comparing T2V and I2V transformer outputs against HuggingFace baselines.
Deleted Legacy Test Suite `tests/unittest/_torch/visual_gen/test_wan_i2v.py`	Removed large test suite that contained fixtures, smoke tests, integration tests, CFG parallelism validation, and robustness checks for Wan I2V pipelines.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~75 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 42.74% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The PR title clearly summarizes the main changes: refactoring of Wan tests and a transformer fix, both key aspects of the changeset.
Description check	✅ Passed	The PR description provides a clear explanation of the purpose, changes, and reasoning, though Test Coverage section is empty.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (5)

tests/unittest/_torch/visual_gen/test_wan21_t2v_teacache.py (1)

141-148: Use explicit float | None type annotation.

Same as in the I2V teacache test: expected_hit_rate: float = None should use explicit union syntax.

♻️ Proposed fix

 def _assert_single_stage_teacache(
     pipeline,
     height: int,
     width: int,
     model: str = "",
-    expected_hit_rate: float = None,
+    expected_hit_rate: float | None = None,
     atol: float = 0.02,
 ) -> None:

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/unittest/_torch/visual_gen/test_wan21_t2v_teacache.py` around lines 141
- 148, Update the function signature for _assert_single_stage_teacache so the
optional expected_hit_rate parameter uses explicit union syntax
(expected_hit_rate: float | None) instead of assigning a bare None to a plain
float; keep the default value as None and update any other occurrences or type
hints referring to expected_hit_rate in that function to match the new float |
None annotation.

tests/unittest/_torch/visual_gen/test_wan22_t2v_pipeline.py (1)

275-275: Minor inconsistency: String literals vs PipelineComponent enum.

_SKIP_AUX uses string literals ("text_encoder", "vae", etc.) while test_wan_features.py uses PipelineComponent enum values. Both work, but using the enum would be more type-safe and consistent.

♻️ Optional: Use PipelineComponent enum

+from tensorrt_llm._torch.visual_gen.config import PipelineComponent

-_SKIP_AUX = ["text_encoder", "vae", "tokenizer", "scheduler"]
+_SKIP_AUX = [
+    PipelineComponent.TEXT_ENCODER,
+    PipelineComponent.VAE,
+    PipelineComponent.TOKENIZER,
+    PipelineComponent.SCHEDULER,
+]

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/unittest/_torch/visual_gen/test_wan22_t2v_pipeline.py` at line 275, The
test defines _SKIP_AUX as string literals ("text_encoder", "vae", "tokenizer",
"scheduler") which is inconsistent with other tests that use the
PipelineComponent enum; change _SKIP_AUX to use the PipelineComponent enum
members (e.g., PipelineComponent.TEXT_ENCODER, PipelineComponent.VAE,
PipelineComponent.TOKENIZER, PipelineComponent.SCHEDULER) so the test is
type-safe and consistent with test_wan_features.py, and ensure you import
PipelineComponent if not already imported.

tests/unittest/_torch/visual_gen/test_wan_features.py (1)

380-396: Consider consolidating FP8 op availability checks.

The same availability check pattern appears in test_fp8_weights_loaded and test_fp8_block_scales_weights_loaded. A shared helper or pytest fixture could reduce duplication.

♻️ Optional: Extract shared FP8 availability check

def _skip_if_fp8_ops_unavailable():
    """Skip test if FP8 quantization ops are not available."""
    try:
        if not hasattr(torch.ops, "tensorrt_llm"):
            pytest.skip("tensorrt_llm torch ops not available")
        _ = torch.ops.tensorrt_llm.quantize_e4m3_per_tensor
        _ = torch.ops.tensorrt_llm.quantize_e4m3_activation
    except (AttributeError, RuntimeError) as e:
        pytest.skip(f"FP8 quantization ops not available: {e}")

Then use _skip_if_fp8_ops_unavailable() at the start of each FP8-related test.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/unittest/_torch/visual_gen/test_wan_features.py` around lines 380 -
396, Extract the repeated FP8-op availability logic into a shared helper (e.g.,
_skip_if_fp8_ops_unavailable) and call it at the start of FP8-related tests like
test_fp8_weights_loaded and test_fp8_block_scales_weights_loaded; the helper
should perform the same checks currently in test_fp8_weights_loaded (check
torch.ops.tensorrt_llm presence and access quantize_e4m3_per_tensor and
quantize_e4m3_activation, catching AttributeError/RuntimeError and calling
pytest.skip with the error message) so the tests simply invoke
_skip_if_fp8_ops_unavailable() before proceeding.

tests/unittest/_torch/visual_gen/test_wan21_i2v_teacache.py (1)

154-161: Use explicit float | None type annotation.

The static analysis tool correctly flags that expected_hit_rate: float = None implicitly creates an Optional[float]. Python 3.10+ supports the union syntax directly.

♻️ Proposed fix

 def _assert_i2v_teacache(
     pipeline,
     height: int,
     width: int,
     model: str = "",
-    expected_hit_rate: float = None,
+    expected_hit_rate: float | None = None,
     atol: float = 0.02,
 ) -> None:

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/unittest/_torch/visual_gen/test_wan21_i2v_teacache.py` around lines 154
- 161, The parameter type for expected_hit_rate in _assert_i2v_teacache should
use explicit union syntax instead of assigning None to a plain float; change the
annotation on the function signature for expected_hit_rate to use float | None
(i.e., expected_hit_rate: float | None) so the type is correctly Optional[float]
under Python 3.10+ while leaving the default value as None and keeping other
parameters (height, width, model, atol) unchanged.

tests/unittest/_torch/visual_gen/test_wan22_i2v_pipeline.py (1)

39-43: Consider importing modules rather than classes directly.

Per coding guidelines, prefer importing the module and accessing classes through the namespace (e.g., import diffusers then diffusers.DiffusionPipeline). However, from PIL import Image and similar patterns are idiomatic in the ML ecosystem and widely used in this codebase.
♻️ Example if aligning strictly with guidelines
-from diffusers import DiffusionPipeline
-from PIL import Image
+import diffusers
+from PIL import Image  # PIL.Image is the idiomatic import
 
-from tensorrt_llm._torch.visual_gen.config import AttentionConfig, TorchCompileConfig, VisualGenArgs
-from tensorrt_llm._torch.visual_gen.pipeline_loader import PipelineLoader
+from tensorrt_llm._torch.visual_gen import config, pipeline_loader
Then use diffusers.DiffusionPipeline, config.AttentionConfig, pipeline_loader.PipelineLoader, etc.
As per coding guidelines: "Import the module, not individual classes or functions (e.g., use from package.subpackage import foo then foo.SomeClass() instead of from package.subpackage.foo import SomeClass)".
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/unittest/_torch/visual_gen/test_wan22_i2v_pipeline.py` around lines 39
- 43, Replace direct class imports with module imports for third-party and local
packages: change "from diffusers import DiffusionPipeline" to "import diffusers"
and update usages to diffusers.DiffusionPipeline; change "from
tensorrt_llm._torch.visual_gen.config import AttentionConfig,
TorchCompileConfig, VisualGenArgs" to "from tensorrt_llm._torch import
visual_gen as visual_gen" (or "import tensorrt_llm._torch.visual_gen as config")
and update references to config.AttentionConfig, config.TorchCompileConfig,
config.VisualGenArgs; similarly import the pipeline loader module (e.g., "import
tensorrt_llm._torch.visual_gen.pipeline_loader as pipeline_loader") and use
pipeline_loader.PipelineLoader; keep "from PIL import Image" as-is since
PIL.Image is idiomatic.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@tests/unittest/_torch/visual_gen/test_wan22_i2v_pipeline.py`:
- Around line 57-74: The module currently resolves WAN22_I2V_PATH at import time
by calling _llm_models_root() via the module-level WAN22_I2V_PATH =
_checkpoint(...) which can raise an AssertionError during import and break
pytest collection; change this to lazy resolution by removing the module-level
WAN22_I2V_PATH assignment and instead provide a function (e.g.,
_get_wan22_i2v_path()) or a pytest fixture that calls
_checkpoint(...)/_llm_models_root() at test runtime, and when the path is
missing use pytest.skip(...) or return None so tests can skip gracefully rather
than failing at import.

---

Nitpick comments:
In `@tests/unittest/_torch/visual_gen/test_wan_features.py`:
- Around line 380-396: Extract the repeated FP8-op availability logic into a
shared helper (e.g., _skip_if_fp8_ops_unavailable) and call it at the start of
FP8-related tests like test_fp8_weights_loaded and
test_fp8_block_scales_weights_loaded; the helper should perform the same checks
currently in test_fp8_weights_loaded (check torch.ops.tensorrt_llm presence and
access quantize_e4m3_per_tensor and quantize_e4m3_activation, catching
AttributeError/RuntimeError and calling pytest.skip with the error message) so
the tests simply invoke _skip_if_fp8_ops_unavailable() before proceeding.

In `@tests/unittest/_torch/visual_gen/test_wan21_i2v_teacache.py`:
- Around line 154-161: The parameter type for expected_hit_rate in
_assert_i2v_teacache should use explicit union syntax instead of assigning None
to a plain float; change the annotation on the function signature for
expected_hit_rate to use float | None (i.e., expected_hit_rate: float | None) so
the type is correctly Optional[float] under Python 3.10+ while leaving the
default value as None and keeping other parameters (height, width, model, atol)
unchanged.

In `@tests/unittest/_torch/visual_gen/test_wan21_t2v_teacache.py`:
- Around line 141-148: Update the function signature for
_assert_single_stage_teacache so the optional expected_hit_rate parameter uses
explicit union syntax (expected_hit_rate: float | None) instead of assigning a
bare None to a plain float; keep the default value as None and update any other
occurrences or type hints referring to expected_hit_rate in that function to
match the new float | None annotation.

In `@tests/unittest/_torch/visual_gen/test_wan22_i2v_pipeline.py`:
- Around line 39-43: Replace direct class imports with module imports for
third-party and local packages: change "from diffusers import DiffusionPipeline"
to "import diffusers" and update usages to diffusers.DiffusionPipeline; change
"from tensorrt_llm._torch.visual_gen.config import AttentionConfig,
TorchCompileConfig, VisualGenArgs" to "from tensorrt_llm._torch import
visual_gen as visual_gen" (or "import tensorrt_llm._torch.visual_gen as config")
and update references to config.AttentionConfig, config.TorchCompileConfig,
config.VisualGenArgs; similarly import the pipeline loader module (e.g., "import
tensorrt_llm._torch.visual_gen.pipeline_loader as pipeline_loader") and use
pipeline_loader.PipelineLoader; keep "from PIL import Image" as-is since
PIL.Image is idiomatic.

In `@tests/unittest/_torch/visual_gen/test_wan22_t2v_pipeline.py`:
- Line 275: The test defines _SKIP_AUX as string literals ("text_encoder",
"vae", "tokenizer", "scheduler") which is inconsistent with other tests that use
the PipelineComponent enum; change _SKIP_AUX to use the PipelineComponent enum
members (e.g., PipelineComponent.TEXT_ENCODER, PipelineComponent.VAE,
PipelineComponent.TOKENIZER, PipelineComponent.SCHEDULER) so the test is
type-safe and consistent with test_wan_features.py, and ensure you import
PipelineComponent if not already imported.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: c4435127-7394-48ae-8493-d2c90ed94082

📥 Commits

Reviewing files that changed from the base of the PR and between bf7142f and 30f1de8.

📒 Files selected for processing (11)

tensorrt_llm/_torch/visual_gen/models/wan/transformer_wan.py
tests/unittest/_torch/visual_gen/test_wan.py
tests/unittest/_torch/visual_gen/test_wan21_i2v_pipeline.py
tests/unittest/_torch/visual_gen/test_wan21_i2v_teacache.py
tests/unittest/_torch/visual_gen/test_wan21_t2v_pipeline.py
tests/unittest/_torch/visual_gen/test_wan21_t2v_teacache.py
tests/unittest/_torch/visual_gen/test_wan22_i2v_pipeline.py
tests/unittest/_torch/visual_gen/test_wan22_t2v_pipeline.py
tests/unittest/_torch/visual_gen/test_wan_features.py
tests/unittest/_torch/visual_gen/test_wan_i2v.py
tests/unittest/_torch/visual_gen/test_wan_transformer.py

💤 Files with no reviewable changes (1)

tests/unittest/_torch/visual_gen/test_wan_i2v.py

o-stoner · 2026-03-12T12:48:46Z

/bot run --disable-fail-fast

tensorrt-cicd · 2026-03-12T12:54:35Z

PR_Github #38733 [ run ] triggered by Bot. Commit: 0d806ca Link to invocation

tensorrt-cicd · 2026-03-12T17:14:01Z

PR_Github #38733 [ run ] completed with state SUCCESS. Commit: 0d806ca
/LLM/main/L0_MergeRequest_PR pipeline #30051 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

Please check the failed tests and fix your PR
If you cannot view the failures, ask the CI triggerer to share details
Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

o-stoner · 2026-03-13T18:48:42Z

@chang-l do you mind giving a review when you get a chance? Thank you!

chang-l

Thanks @o-stoner – please also make sure all I2V/T2V tests are enabled in CI

o-stoner · 2026-03-18T23:02:11Z

/bot run --disable-fail-fast

tensorrt-cicd · 2026-03-18T23:08:16Z

PR_Github #39515 [ run ] triggered by Bot. Commit: 6493663 Link to invocation

tensorrt-cicd · 2026-03-19T03:39:23Z

PR_Github #39515 [ run ] completed with state SUCCESS. Commit: 6493663
/LLM/main/L0_MergeRequest_PR pipeline #30736 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

Please check the failed tests and fix your PR
If you cannot view the failures, ask the CI triggerer to share details
Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

o-stoner · 2026-04-06T22:18:01Z

/bot run --disable-fail-fast

o-stoner · 2026-04-07T17:57:13Z

/bot run --disable-fail-fast

tensorrt-cicd · 2026-04-07T18:03:21Z

PR_Github #42175 [ run ] triggered by Bot. Commit: 771ebb7 Link to invocation

tensorrt-cicd · 2026-04-08T01:29:10Z

PR_Github #42175 [ run ] completed with state SUCCESS. Commit: 771ebb7
/LLM/main/L0_MergeRequest_PR pipeline #33002 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

Please check the failed tests and fix your PR
If you cannot view the failures, ask the CI triggerer to share details
Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

o-stoner · 2026-04-09T23:29:02Z

/bot run --disable-fail-fast

tensorrt-cicd · 2026-04-09T23:35:52Z

PR_Github #42590 [ run ] triggered by Bot. Commit: 142602f Link to invocation

tensorrt-cicd · 2026-04-10T06:08:17Z

PR_Github #42590 [ run ] completed with state SUCCESS. Commit: 142602f
/LLM/main/L0_MergeRequest_PR pipeline #33316 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

Please check the failed tests and fix your PR
If you cannot view the failures, ask the CI triggerer to share details
Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

o-stoner · 2026-04-16T22:53:06Z

/bot run --disable-fail-fast

tensorrt-cicd · 2026-04-16T22:59:08Z

PR_Github #43845 [ run ] triggered by Bot. Commit: bb2bcf6 Link to invocation

tensorrt-cicd · 2026-04-17T01:11:41Z

PR_Github #43845 [ run ] completed with state FAILURE. Commit: bb2bcf6
/LLM/main/L0_MergeRequest_PR pipeline #34306 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

Please check the failed tests and fix your PR
If you cannot view the failures, ask the CI triggerer to share details
Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

o-stoner · 2026-04-17T04:26:46Z

/bot run --disable-fail-fast

tensorrt-cicd · 2026-04-17T04:33:53Z

PR_Github #43949 [ run ] triggered by Bot. Commit: bb2bcf6 Link to invocation

tensorrt-cicd · 2026-04-17T09:20:52Z

PR_Github #43949 [ run ] completed with state SUCCESS. Commit: bb2bcf6
/LLM/main/L0_MergeRequest_PR pipeline #34395 completed with status: 'SUCCESS'

CI Report

Link to invocation

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

o-stoner · 2026-04-17T19:03:05Z

/bot run --disable-fail-fast

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

o-stoner · 2026-04-17T19:07:46Z

/bot run --disable-fail-fast

tensorrt-cicd · 2026-04-17T19:09:36Z

PR_Github #44062 [ run ] triggered by Bot. Commit: 7d66a01 Link to invocation

tensorrt-cicd · 2026-04-17T19:13:55Z

PR_Github #44063 [ run ] triggered by Bot. Commit: 7d66a01 Link to invocation

tensorrt-cicd · 2026-04-17T19:13:59Z

PR_Github #44062 [ run ] completed with state ABORTED. Commit: 7d66a01

Link to invocation

tensorrt-cicd · 2026-04-18T13:41:26Z

PR_Github #44063 [ run ] completed with state SUCCESS. Commit: 7d66a01
/LLM/main/L0_MergeRequest_PR pipeline #34495 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

Please check the failed tests and fix your PR
If you cannot view the failures, ask the CI triggerer to share details
Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

Signed-off-by: o-stoner <245287810+o-stoner@users.noreply.github.com>

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

o-stoner · 2026-04-20T18:39:16Z

/bot run --disable-fail-fast

tensorrt-cicd · 2026-04-20T18:44:52Z

PR_Github #44502 [ run ] triggered by Bot. Commit: d3f0a94 Link to invocation

tensorrt-cicd · 2026-04-20T23:23:20Z

PR_Github #44502 [ run ] completed with state SUCCESS. Commit: d3f0a94
/LLM/main/L0_MergeRequest_PR pipeline #34904 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

Please check the failed tests and fix your PR
If you cannot view the failures, ask the CI triggerer to share details
Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

o-stoner · 2026-04-20T23:42:52Z

/bot run --disable-fail-fast

tensorrt-cicd · 2026-04-20T23:48:35Z

PR_Github #44535 [ run ] triggered by Bot. Commit: 192e05d Link to invocation

tensorrt-cicd · 2026-04-21T04:39:42Z

PR_Github #44535 [ run ] completed with state SUCCESS. Commit: 192e05d
/LLM/main/L0_MergeRequest_PR pipeline #34930 completed with status: 'SUCCESS'

CI Report

Link to invocation

tburt-nv

looks good to me; ftfy and wcwidth are apache and MIT licensed, respectively.

o-stoner requested a review from a team as a code owner March 11, 2026 23:58

github-actions Bot assigned o-stoner Mar 11, 2026

o-stoner requested a review from chang-l March 11, 2026 23:59

coderabbitai Bot reviewed Mar 12, 2026

View reviewed changes

Comment thread tests/unittest/_torch/visual_gen/test_wan22_i2v_pipeline.py

chang-l approved these changes Mar 13, 2026

View reviewed changes

Comment thread tensorrt_llm/_torch/visual_gen/models/wan/transformer_wan.py

Comment thread tensorrt_llm/_torch/visual_gen/models/wan/transformer_wan.py Outdated

o-stoner force-pushed the user/o-stoner/visual-gen-refactor-wan-tests branch 2 times, most recently from 47bb15c to bb25d9d Compare March 18, 2026 23:00

o-stoner force-pushed the user/o-stoner/visual-gen-refactor-wan-tests branch from dc232cd to 771ebb7 Compare April 6, 2026 22:00

o-stoner force-pushed the user/o-stoner/visual-gen-refactor-wan-tests branch from 2543edf to 142602f Compare April 9, 2026 23:27

o-stoner force-pushed the user/o-stoner/visual-gen-refactor-wan-tests branch from 142602f to bb2bcf6 Compare April 16, 2026 22:51

o-stoner requested a review from a team as a code owner April 16, 2026 22:51

o-stoner added 5 commits April 17, 2026 10:32

refactor wan tests + small transformer fix

60ea611

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

add tests to CI and remove redundant to(device) calls

d526b7a

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

remove deprecated multi-gpu tests

21eba99

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

add ftfy dependency for diffusers Wan pipeline

478e9fc

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

test updates

22d7f92

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

o-stoner force-pushed the user/o-stoner/visual-gen-refactor-wan-tests branch from bb2bcf6 to 22d7f92 Compare April 17, 2026 19:02

remove deleted test

7d66a01

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

o-stoner added 3 commits April 20, 2026 09:18

Merge branch 'main' into user/o-stoner/visual-gen-refactor-wan-tests

c763d5c

Signed-off-by: o-stoner <245287810+o-stoner@users.noreply.github.com>

recover waives.txt

1a60eec

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

reset waives.txt to match main

d3f0a94

Signed-off-by: Olivia Stoner <245287810+o-stoner@users.noreply.github.com>

o-stoner force-pushed the user/o-stoner/visual-gen-refactor-wan-tests branch from c2e1cfa to d3f0a94 Compare April 20, 2026 18:37

Merge branch 'main' into user/o-stoner/visual-gen-refactor-wan-tests

192e05d

tburt-nv approved these changes Apr 21, 2026

View reviewed changes

chang-l merged commit 9d66b82 into NVIDIA:main Apr 21, 2026
5 checks passed

Conversation

o-stoner commented Mar 11, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Description

Test Coverage

PR Checklist

GitHub Bot Help

Uh oh!

o-stoner commented Mar 12, 2026

Uh oh!

coderabbitai Bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

o-stoner commented Mar 12, 2026

Uh oh!

tensorrt-cicd commented Mar 12, 2026

Uh oh!

tensorrt-cicd commented Mar 12, 2026

Uh oh!

o-stoner commented Mar 13, 2026

Uh oh!

chang-l left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

o-stoner commented Mar 18, 2026

Uh oh!

tensorrt-cicd commented Mar 18, 2026

Uh oh!

tensorrt-cicd commented Mar 19, 2026

Uh oh!

o-stoner commented Apr 6, 2026

Uh oh!

o-stoner commented Apr 7, 2026

Uh oh!

tensorrt-cicd commented Apr 7, 2026

Uh oh!

tensorrt-cicd commented Apr 8, 2026

Uh oh!

o-stoner commented Apr 9, 2026

Uh oh!

tensorrt-cicd commented Apr 9, 2026

Uh oh!

tensorrt-cicd commented Apr 10, 2026

Uh oh!

o-stoner commented Apr 16, 2026

Uh oh!

tensorrt-cicd commented Apr 16, 2026

Uh oh!

tensorrt-cicd commented Apr 17, 2026

Uh oh!

o-stoner commented Apr 17, 2026

Uh oh!

tensorrt-cicd commented Apr 17, 2026

Uh oh!

tensorrt-cicd commented Apr 17, 2026

Uh oh!

o-stoner commented Apr 17, 2026

Uh oh!

o-stoner commented Apr 17, 2026

Uh oh!

tensorrt-cicd commented Apr 17, 2026

Uh oh!

tensorrt-cicd commented Apr 17, 2026

Uh oh!

tensorrt-cicd commented Apr 17, 2026

Uh oh!

tensorrt-cicd commented Apr 18, 2026

Uh oh!

o-stoner commented Apr 20, 2026

o-stoner commented Mar 11, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Mar 12, 2026 •

edited

Loading