Skip to content

Pull requests: ROCm/pytorch

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Triton/inductor related optimisations
#2008 opened Mar 27, 2025 by jataylo Loading…
[fix]: permute_021 copy fix
#1993 opened Mar 24, 2025 by TennyWang1223 Loading…
Schedule remainder loop chunk in threadblock 0.
#1985 opened Mar 20, 2025 by carlobertolli Loading…
Diable L2_cahce_size display on ROCm platforms
#1876 opened Jan 31, 2025 by tcgu-amd Loading…
Wrap vec size 8 with USE_ROCM
#1795 opened Dec 16, 2024 by oraluben Loading…
Improve performance of reduce sum for 3D shapes
#1785 opened Dec 12, 2024 by doru1004 Loading…
[release/2.5] AMDSMI/layernorm cherry picks
#1764 opened Dec 4, 2024 by jataylo Loading…
[ROCm] Improve softmax performance.
#1740 opened Nov 21, 2024 by doru1004 Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.