Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Parallel Cross Entropy using online softmax enhancement New feature or request
#1456 opened Feb 4, 2025 by sanandaraj5597 Loading…
[JAX] THD ring attention
#1454 opened Feb 4, 2025 by zlsh80826 Draft
2 of 13 tasks
Add NVTX ranges to categorize execution
#1447 opened Jan 31, 2025 by minitu Loading…
13 tasks
[PyTorch] Rename and clean up MXFP8 recipe class
#1445 opened Jan 31, 2025 by timmoon10 Draft
7 of 13 tasks
Support store_param_remainders feature from Apex in TE Fused Adam enhancement New feature or request
#1443 opened Jan 30, 2025 by timmoon10 Loading…
6 of 13 tasks
[Pytorch] Nvidia-DLFramework-Inspect support
#1441 opened Jan 30, 2025 by pggPL Draft
8 of 13 tasks
Add test for Lightning Thunder integration testing Improvements to tests or testing infrastructure
#1433 opened Jan 28, 2025 by timmoon10 Draft
6 of 14 tasks
Introduce NVSHMEM based communication API for pytorch
#1430 opened Jan 28, 2025 by gdengk Loading…
13 tasks
Adding remove_caches API to Float8Tensor class
#1425 opened Jan 27, 2025 by youngeunkwon0405 Loading…
13 tasks
Initial Support Blackwell Build
#1418 opened Jan 21, 2025 by johnnynunez Loading…
9 of 13 tasks
[PyTorch] cuBLAS workspace size fix for TP overlap unit test bug Something isn't working
#1415 opened Jan 17, 2025 by denera Loading…
8 of 13 tasks
Fix Linear Weight Initialization in the PaddlePaddle Implementation
#1413 opened Jan 17, 2025 by GuoxiaWang Loading…
4 of 13 tasks
Better cuBLAS handle management
#1389 opened Jan 2, 2025 by ptrendx Loading…
8 of 13 tasks
Update README.rst
#1385 opened Dec 23, 2024 by sbhavani Loading…
1 of 6 tasks
Don't touch nor send messages to the root logger.
#1380 opened Dec 19, 2024 by sagostinho-nvidia Loading…
4 of 13 tasks
Add paged attention support
#1355 opened Dec 4, 2024 by cyanguwa Loading…
8 of 13 tasks
[PyTorch] Bugfix for wgrad bulk overlap conflict when dgrad overlap is reduce-scatter bug Something isn't working
#1341 opened Nov 18, 2024 by denera Loading…
6 of 13 tasks
[C/JAX] Comm+GEMM Overlap API for TE/JAX enhancement New feature or request jax
#1337 opened Nov 15, 2024 by denera Draft
3 of 13 tasks
Build with uv instead of just pip
#1324 opened Nov 8, 2024 by jennifgcrl Loading…
5 of 13 tasks
TP communication overlap: enable the overlap between GEMM chunk at Ho…
#1311 opened Nov 4, 2024 by erhoo82 Loading…
1 of 13 tasks
[PyTorch] Add heuristics for intializing FP8 params enhancement New feature or request
#1300 opened Oct 30, 2024 by timmoon10 Loading…
8 of 13 tasks
ProTip! Adding no:label will show everything without a label.