Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[PyTorch] Relax dimension constraints for using fused grouped MLP
#2856 opened Apr 8, 2026 by ksivaman Loading…
5 of 13 tasks
Fix zero input shape for bgrad_group_quantize
#2854 opened Apr 8, 2026 by vthumbe1503 Loading…
13 tasks
Bump transformers from 4.55.0 to 5.0.0rc3 in /docs/examples/te_gemma dependencies Pull requests that update a dependency file python Pull requests that update python code
#2851 opened Apr 8, 2026 by dependabot bot Loading…
Bump transformers from 4.57.0 to 5.0.0rc3 in /docs/examples/te_llama dependencies Pull requests that update a dependency file python Pull requests that update python code
#2850 opened Apr 8, 2026 by dependabot bot Loading…
Simplify FA3 discovery
#2849 opened Apr 8, 2026 by vcherepanov-nv Loading…
4 of 13 tasks
Skip activation kernels when tensor size is zero bug Something isn't working
#2848 opened Apr 8, 2026 by timmoon10 Loading…
8 of 13 tasks
[Common] Multicast Fixes
#2847 opened Apr 8, 2026 by phu0ngng Draft
13 tasks
Add Megatron-FSDP E2E integration test to TE CI/CD (L1).
#2845 opened Apr 7, 2026 by cspades Loading…
3 of 13 tasks
[Core] Report CUDA versions when NVRTC compilation fails enhancement New feature or request
#2842 opened Apr 7, 2026 by timmoon10 Loading…
8 of 13 tasks
comm_gemm_test fixes
#2839 opened Apr 6, 2026 by almogsegal Loading…
13 tasks
Add grouped unswizzle functionality for MXFP8 scaling factors
#2837 opened Apr 5, 2026 by int-smart Loading…
8 of 13 tasks
Fix JAX extension build with NVTE_UB_WITH_MPI=1
#2835 opened Apr 4, 2026 by GaetanLepage Loading…
2 of 13 tasks
fix CUDA architectures cmake logic community-contribution PRs from external contributor outside the core maintainers, representing community-driven work.
#2832 opened Apr 3, 2026 by GaetanLepage Loading…
2 of 13 tasks
Port softmax ops to libtorch stable ABI
#2830 opened Apr 3, 2026 by pstjohn Loading…
Cp thd swa with ag
#2829 opened Apr 3, 2026 by sudhakarsingh27 Draft
13 tasks
[Common] Reduced padding kernel compilation time
#2827 opened Apr 2, 2026 by Oleg-Goncharov Loading…
5 of 13 tasks
fix(CP, MLA): CP works fine with MLA in a2a cp_comm_type community-contribution PRs from external contributor outside the core maintainers, representing community-driven work.
#2826 opened Apr 2, 2026 by zhujian19891203 Loading…
5 of 13 tasks
ProTip! Adding no:label will show everything without a label.