-
Notifications
You must be signed in to change notification settings - Fork 25
Pull requests: ROCm/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[ROCm] add the bias all row -inf support for jax unfused-attn
ci-level 3
CI test level 3
#556
opened Apr 21, 2026 by
wangye805
Collaborator
Loading…
8 of 13 tasks
ci: add workflow to build and publish CI deps Docker image
#555
opened Apr 20, 2026 by
VeeraRajasekhar
Contributor
Loading…
1 of 13 tasks
Refactor TE's CK backend padding add/remove kernels
ci-level 3
CI test level 3
#553
opened Apr 20, 2026 by
Micky774
Contributor
Loading…
13 tasks
Refactor CK workspace memory management to use a unified toggle-pass helper object
ci-level 3
CI test level 3
#552
opened Apr 20, 2026 by
Micky774
Contributor
Loading…
13 tasks
Refactor CK FA dispatch, and collapse API
ci-level 3
CI test level 3
#551
opened Apr 20, 2026 by
Micky774
Contributor
Loading…
13 tasks
Integrate initial version of QoLA
ci-level 1
CI test level 1
#550
opened Apr 20, 2026 by
Micky774
Contributor
Loading…
13 tasks
Add Claude PR review/summary action
#548
opened Apr 17, 2026 by
Micky774
Contributor
Loading…
13 tasks
Enable CI lint gh action on ROCm
ci-level 3
CI test level 3
#547
opened Apr 17, 2026 by
VeeraRajasekhar
Contributor
Loading…
13 tasks
[TE] Improve backward performance for CK Tile FP8 Group GEMM
ci-level 3
CI test level 3
#544
opened Apr 16, 2026 by
aris134
Contributor
Loading…
1 of 13 tasks
CI: auto-trigger AITER prebuilt upload when 3rdparty/aiter updates on dev
#543
opened Apr 15, 2026 by
VeeraRajasekhar
Contributor
Loading…
8 of 13 tasks
[TE] Phase 2 of Sciforium cross-attn integration: a separate cpp backend and a new jax api
#542
opened Apr 15, 2026 by
VeeraRajasekhar
Contributor
•
Draft
13 tasks
Integrate AITER fused RoPE kernels with fallback to TE native
#541
opened Apr 15, 2026 by
suachong
Loading…
7 tasks done
NV upstream release 2.12 merge
ci-level 3
CI test level 3
#538
opened Apr 13, 2026 by
Micky774
Contributor
Loading…
13 tasks
Full MXFP4 Training Recipe
ci-level 3
CI test level 3
#537
opened Apr 13, 2026 by
sarthak-amd
Collaborator
Loading…
3 of 4 tasks
CI: Refactor ROCm CI to use GPU-sized runners and build-only jobs
ci-level 3
CI test level 3
#528
opened Apr 7, 2026 by
leo-automation
Collaborator
Loading…
Gfx1250 changes
ci-level 2
CI test level 2
#527
opened Apr 7, 2026 by
ipanfilo
Collaborator
Loading…
1 of 13 tasks
"castonly/casttranspose HIP kernel optimization in fp8
ci-level 3
CI test level 3
#519
opened Apr 4, 2026 by
alextmagro
Contributor
Loading…
NVFP4 recipe with GEMM via BF16 dequant
ci-level 1
CI test level 1
#518
opened Apr 2, 2026 by
matthiasdiener
Contributor
Loading…
1 of 13 tasks
NVFP4: hadamard_transform_cast_fusion_columnwise
ci-level 1
CI test level 1
#515
opened Apr 1, 2026 by
matthiasdiener
Contributor
•
Draft
1 of 13 tasks
[TE] Enable deterministic mode for fused attention
ci-level 1
CI test level 1
#508
opened Mar 27, 2026 by
AllenFarcas
Contributor
Loading…
7 of 13 tasks
Add fsdp2 fp8 unit tests TE 2.10
ci-level 3
CI test level 3
#492
opened Mar 17, 2026 by
sudhu2k
Contributor
Loading…
8 of 13 tasks
Add AITER fused RoPE dispatch to FusedRoPEFunc
#489
opened Mar 17, 2026 by
sarthak-amd
Collaborator
Loading…
ASV-format microbenchmark suite
#487
opened Mar 16, 2026 by
Micky774
Contributor
Loading…
1 of 13 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.