Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Hybrid chunked cache update
#500 opened Jul 8, 2025 by quic-amitraj Draft
default NPI file added 1.20.0
#498 opened Jul 7, 2025 by quic-akuruvil Loading…
Fixes in QNN compilation for data format config
#497 opened Jul 7, 2025 by shubhagr-qc Loading…
Dynamic cache support on llama4
#494 opened Jul 7, 2025 by quic-rishinr Loading…
Added env var- DYNAMIC_CACHE to switch from HCC to DC
#489 opened Jul 3, 2025 by asmigosw Loading…
[Llama4]: Add support for padding num_patches 1.20.0 enhancement New feature or request
#486 opened Jul 1, 2025 by vbaddi Loading…
Unit Tests for On Device Sampling 1.20.0
#463 opened Jun 18, 2025 by quic-sanising Loading…
Updated get_available_device_id logic 1.21.0
#445 opened Jun 11, 2025 by quic-rishinr Loading…
Addition of MIN_MASKED_ATTN_VALUE
#433 opened Jun 6, 2025 by quic-amitraj Loading…
Added Prompt length check for VLMs
#422 opened May 21, 2025 by asmigosw Loading…
Dependency package upgrade 1.21.0
#407 opened May 15, 2025 by qcdipankar Loading…
Qwen3moe model-enablement
#406 opened May 15, 2025 by qcdipankar Loading…
ProTip! Filter pull requests by the default branch with base:main.