Skip to content

Actions: NVIDIA/TransformerEngine

TE-CI Trigger

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
274 workflow run results
274 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[JAX][Common] Support GQA
TE-CI Trigger #1933: Issue comment #578 (comment) created by zlsh80826
January 8, 2024 07:15 3m 41s
January 8, 2024 07:15 3m 41s
Discrepancy in Test Results between Context Parallel and Flash Attention
TE-CI Trigger #1932: Issue comment #574 (comment) created by Infi-zc
January 8, 2024 07:10 5s
January 8, 2024 07:10 5s
[Paddle] Support GQA
TE-CI Trigger #1931: Issue comment #595 (comment) created by jeng1220
January 8, 2024 04:56 5s
January 8, 2024 04:56 5s
[Paddle] Add sequence parallel
TE-CI Trigger #1930: Issue comment #561 (comment) created by jeng1220
January 8, 2024 04:54 6s
January 8, 2024 04:54 6s
[Paddle] Support GQA
TE-CI Trigger #1929: Issue comment #595 (comment) created by zlsh80826
January 8, 2024 01:39 3m 57s
January 8, 2024 01:39 3m 57s
is fwd_scale_inverses needed for bwd?
TE-CI Trigger #1928: Issue comment #591 (comment) created by ronzillia
January 6, 2024 18:39 3s
January 6, 2024 18:39 3s
is fwd_scale_inverses needed for bwd?
TE-CI Trigger #1927: Issue comment #591 (comment) created by ksivaman
January 6, 2024 13:54 4s
January 6, 2024 13:54 4s
Bump FlashAttn version and add deterministic option for FAv2
TE-CI Trigger #1926: Issue comment #585 (comment) created by ksivaman
January 6, 2024 04:27 3m 38s
January 6, 2024 04:27 3m 38s
[Common/PyTorch] Fix FP8 fused attention input args
TE-CI Trigger #1925: Issue comment #592 (comment) created by cyanguwa
January 6, 2024 02:02 4m 1s
January 6, 2024 02:02 4m 1s
NVRTC kernels for cast-transpose
TE-CI Trigger #1924: Issue comment #258 (comment) created by timmoon10
January 6, 2024 01:47 3m 29s
January 6, 2024 01:47 3m 29s
[PyTorch] Reduce size of sanity tests
TE-CI Trigger #1923: Issue comment #510 (comment) created by timmoon10
January 6, 2024 01:43 3s
January 6, 2024 01:43 3s
[PyTorch] Refactor parameter splitting in Linear and LayerNormLinear
TE-CI Trigger #1922: Issue comment #590 (comment) created by timmoon10
January 6, 2024 01:42 3s
January 6, 2024 01:42 3s
Implement fused kernel for FP8 scale update
TE-CI Trigger #1921: Issue comment #593 (comment) created by timmoon10
January 6, 2024 01:40 3m 43s
January 6, 2024 01:40 3m 43s
[PyTorch] upgrade context parallelism implementations
TE-CI Trigger #1920: Issue comment #572 (comment) created by cyanguwa
January 6, 2024 01:20 3m 41s
January 6, 2024 01:20 3m 41s
[PyTorch] upgrade context parallelism implementations
TE-CI Trigger #1919: Issue comment #572 (comment) created by cyanguwa
January 5, 2024 22:51 3m 52s
January 5, 2024 22:51 3m 52s
Use unoptimized layernorm kernel if pointers are not aligned
TE-CI Trigger #1918: Issue comment #490 (comment) created by timmoon10
January 5, 2024 21:27 5s
January 5, 2024 21:27 5s
[PyTorch] Reduce size of sanity tests
TE-CI Trigger #1917: Issue comment #510 (comment) created by timmoon10
January 5, 2024 19:33 3m 26s
January 5, 2024 19:33 3m 26s
[PyTorch] Refactor parameter splitting in Linear and LayerNormLinear
TE-CI Trigger #1916: Issue comment #590 (comment) created by timmoon10
January 5, 2024 19:29 3m 46s
January 5, 2024 19:29 3m 46s
Use jit_fuser for bias-dropout-add fusion
TE-CI Trigger #1915: Issue comment #589 (comment) created by ptrendx
January 5, 2024 17:31 5s
January 5, 2024 17:31 5s
[bug] FP8+PP+Recompute+GA>1, loss = nan
TE-CI Trigger #1914: Issue comment #539 (comment) created by codecaution
January 5, 2024 13:35 6s
January 5, 2024 13:35 6s
multi-gpu example with >1 GPU crashes without fuse_qkv=True
TE-CI Trigger #1913: Issue comment #533 (comment) created by timmoon10
January 5, 2024 08:28 5s
January 5, 2024 08:28 5s
[PyTorch] Refactor parameter splitting in Linear and LayerNormLinear
TE-CI Trigger #1912: Issue comment #590 (comment) created by timmoon10
January 5, 2024 08:02 3m 43s
January 5, 2024 08:02 3m 43s
[PyTorch] Reduce size of sanity tests
TE-CI Trigger #1911: Issue comment #510 (comment) created by timmoon10
January 4, 2024 21:22 3m 31s
January 4, 2024 21:22 3m 31s
Use unoptimized layernorm kernel if pointers are not aligned
TE-CI Trigger #1910: Issue comment #490 (comment) created by timmoon10
January 4, 2024 21:22 3m 45s
January 4, 2024 21:22 3m 45s
NVRTC kernels for cast-transpose
TE-CI Trigger #1909: Issue comment #258 (comment) created by timmoon10
January 4, 2024 21:21 3m 30s
January 4, 2024 21:21 3m 30s