Update cache position population and arg order for multimodal runner #14225

kirklandsign · 2025-09-11T19:33:09Z

Summary

For voxtral, we construct the cache_position_tensor like before; for llava, it will construct underneath so we pass in size 1.

Test plan

CI

pytorch-bot · 2025-09-11T19:33:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14225

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 12 New Failures, 3 Cancelled Jobs

As of commit cb633cf with merge base 10e93fb ():

NEW FAILURES - The following jobs have failed:

pull / android / build-llm-demo / linux-job (gh)
RuntimeError: Command docker exec -t b8bc2067372486cff5e703029fa8248a074a7c7aefa38df32c305d16bdd99188 /exec failed with exit code 1
pull / test-llama-lora-linux / linux-job (gh)
RuntimeError: Command docker exec -t d5df8bb53284c279e8411eab2b28943f6ce78031758a1692956e365bb930508e /exec failed with exit code 1
pull / test-llama-runner-linux (bf16, custom, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
RuntimeError: Command docker exec -t 589153d7a3ae7779405154b3a9e2e6e2245014cf9041bd5a2cef57df51754b5f /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, xnnpack+custom+qe, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
RuntimeError: Command docker exec -t cd4aa476271e1738d1545e5f83ed32dfc9ee1377a027a4d89e22cd1405cf40dd /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, xnnpack+custom+qe, linux.arm64.2xlarge, executorch-ubuntu-22.04-gc... / linux-job (gh)
RuntimeError: Command docker exec -t 22311a6a0d0c9a19c8589f4f3979de2e6198943b4413606d025100ca56542e89 /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, xnnpack+custom+quantize_kv, linux.2xlarge, executorch-ubuntu-22.04... / linux-job (gh)
RuntimeError: Command docker exec -t 3452f37345c721dd9ce2d8d1c89e67f3a7d7a86197c80c48345a08a39bfaf60c /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, xnnpack+custom+quantize_kv, linux.arm64.2xlarge, executorch-ubuntu... / linux-job (gh)
RuntimeError: Command docker exec -t 5a0cd963a4626f72ff2cf07e60f4524aeeaedc9b338343c364489d678fa5a46a /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, xnnpack+quantize_kv, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job (gh)
RuntimeError: Command docker exec -t 9896d714164aa8975f2ab1550c85ab9955c3c292c255063207d1356a7c81d1f1 /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, xnnpack+quantize_kv, linux.arm64.2xlarge, executorch-ubuntu-22.04-... / linux-job (gh)
RuntimeError: Command docker exec -t 570aef64830a973811bf4394280ae64e2b48c7f322e14c60bf3e78496f622d30 /exec failed with exit code 1
pull / test-llama-runner-qnn-linux (fp32, qnn_16a16w, qnn) / linux-job (gh)
RuntimeError: Command docker exec -t b69900c9a4df8e0dae1092d7406285c0151eb21f27352ad648396726b7d1a26a /exec failed with exit code 1
pull / test-llama-runner-qnn-linux (fp32, qnn_8a8w, qnn) / linux-job (gh)
RuntimeError: Command docker exec -t 13108960a880eb1610a4002fe6ec1c193cd1cffb27dadff3b7364c76a54aa945 /exec failed with exit code 1
pull / test-phi-3-mini-runner-linux / linux-job (gh)
RuntimeError: Command docker exec -t 6387187df0853a95038f51aa1ba68f6720d7639e3d4e07d5284b012e9e2424d2 /exec failed with exit code 1

CANCELLED JOBS - The following jobs were cancelled. Please retry:

pull / test-openvino-linux / linux-job (gh)
##[error]The operation was canceled.
pull / unittest / macos / macos-job (gh)
##[error]The operation was canceled.
pull / unittest-editable / macos / macos-job (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jackzhxng

(Synced offline)

jackzhxng · 2025-09-11T19:39:51Z