Skip to content

Pull requests: ROCm/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

IFU v2.14.dev0
#557 opened Apr 21, 2026 by ipanfilo Collaborator Draft
5 of 13 tasks
[ROCm] add the bias all row -inf support for jax unfused-attn ci-level 3 CI test level 3
#556 opened Apr 21, 2026 by wangye805 Collaborator Loading…
8 of 13 tasks
ci: add workflow to build and publish CI deps Docker image
#555 opened Apr 20, 2026 by VeeraRajasekhar Contributor Loading…
1 of 13 tasks
Refactor TE's CK backend padding add/remove kernels ci-level 3 CI test level 3
#553 opened Apr 20, 2026 by Micky774 Contributor Loading…
13 tasks
Refactor CK workspace memory management to use a unified toggle-pass helper object ci-level 3 CI test level 3
#552 opened Apr 20, 2026 by Micky774 Contributor Loading…
13 tasks
Refactor CK FA dispatch, and collapse API ci-level 3 CI test level 3
#551 opened Apr 20, 2026 by Micky774 Contributor Loading…
13 tasks
Integrate initial version of QoLA ci-level 1 CI test level 1
#550 opened Apr 20, 2026 by Micky774 Contributor Loading…
13 tasks
Add Claude PR review/summary action
#548 opened Apr 17, 2026 by Micky774 Contributor Loading…
13 tasks
Enable CI lint gh action on ROCm ci-level 3 CI test level 3
#547 opened Apr 17, 2026 by VeeraRajasekhar Contributor Loading…
13 tasks
[TE] Improve backward performance for CK Tile FP8 Group GEMM ci-level 3 CI test level 3
#544 opened Apr 16, 2026 by aris134 Contributor Loading…
1 of 13 tasks
CI: auto-trigger AITER prebuilt upload when 3rdparty/aiter updates on dev
#543 opened Apr 15, 2026 by VeeraRajasekhar Contributor Loading…
8 of 13 tasks
Integrate AITER fused RoPE kernels with fallback to TE native
#541 opened Apr 15, 2026 by suachong Loading…
7 tasks done
NV upstream release 2.12 merge ci-level 3 CI test level 3
#538 opened Apr 13, 2026 by Micky774 Contributor Loading…
13 tasks
Full MXFP4 Training Recipe ci-level 3 CI test level 3
#537 opened Apr 13, 2026 by sarthak-amd Collaborator Loading…
3 of 4 tasks
Ipanfilo/wheel build action
#529 opened Apr 7, 2026 by ipanfilo Collaborator Loading…
1 of 13 tasks
CI: Refactor ROCm CI to use GPU-sized runners and build-only jobs ci-level 3 CI test level 3
#528 opened Apr 7, 2026 by leo-automation Collaborator Loading…
Gfx1250 changes ci-level 2 CI test level 2
#527 opened Apr 7, 2026 by ipanfilo Collaborator Loading…
1 of 13 tasks
"castonly/casttranspose HIP kernel optimization in fp8 ci-level 3 CI test level 3
#519 opened Apr 4, 2026 by alextmagro Contributor Loading…
NVFP4 recipe with GEMM via BF16 dequant ci-level 1 CI test level 1
#518 opened Apr 2, 2026 by matthiasdiener Contributor Loading…
1 of 13 tasks
NVFP4: hadamard_transform_cast_fusion_columnwise ci-level 1 CI test level 1
#515 opened Apr 1, 2026 by matthiasdiener Contributor Draft
1 of 13 tasks
[TE] Enable deterministic mode for fused attention ci-level 1 CI test level 1
#508 opened Mar 27, 2026 by AllenFarcas Contributor Loading…
7 of 13 tasks
Add fsdp2 fp8 unit tests TE 2.10 ci-level 3 CI test level 3
#492 opened Mar 17, 2026 by sudhu2k Contributor Loading…
8 of 13 tasks
Add AITER fused RoPE dispatch to FusedRoPEFunc
#489 opened Mar 17, 2026 by sarthak-amd Collaborator Loading…
ASV-format microbenchmark suite
#487 opened Mar 16, 2026 by Micky774 Contributor Loading…
1 of 13 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.