Skip to content

Actions: NVIDIA/TransformerEngine

Documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
297 workflow run results
297 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Paddle] Support GQA
Documentation #2289: Pull request #595 synchronize by Wong4j
January 16, 2024 04:02 1m 13s Wong4j:jaywan/add_gqa
January 16, 2024 04:02 1m 13s
[JAX] Support SP + RoPE + GeLU
Documentation #2287: Pull request #602 opened by mingxu1067
January 15, 2024 05:08 1m 3s mingxu1067:mingh/sp_rope_gelu
January 15, 2024 05:08 1m 3s
[JAX][Common] Support GQA
Documentation #2286: Pull request #578 synchronize by zlsh80826
January 14, 2024 15:00 53s zlsh80826:rewang/gqa-clean
January 14, 2024 15:00 53s
[Paddle] Add RMSNorm, RoPE and SwiGLU
Documentation #2285: Pull request #599 synchronize by Wong4j
January 14, 2024 10:57 58s Wong4j:jaywan/add_llama_op
January 14, 2024 10:57 58s
[Paddle] Support GQA
Documentation #2284: Pull request #595 synchronize by Wong4j
January 14, 2024 10:34 49s Wong4j:jaywan/add_gqa
January 14, 2024 10:34 49s
[JAX][Common] Support GQA
Documentation #2283: Pull request #578 synchronize by zlsh80826
January 14, 2024 07:36 1m 2s zlsh80826:rewang/gqa-clean
January 14, 2024 07:36 1m 2s
Activation offloading to CPU's for the Linear, Layernorm Linear and the Layernorm MLP modules
Documentation #2277: Pull request #571 synchronize by ptrendx
January 12, 2024 17:05 1m 15s main
January 12, 2024 17:05 1m 15s
Activation offloading to CPU's for the Linear, Layernorm Linear and the Layernorm MLP modules
Documentation #2276: Pull request #571 synchronize by ptrendx
January 12, 2024 17:00 50s main
January 12, 2024 17:00 50s
Activation offloading to CPU's for the Linear, Layernorm Linear and the Layernorm MLP modules
Documentation #2275: Pull request #571 synchronize by ptrendx
January 12, 2024 16:39 38s main
January 12, 2024 16:39 38s
[PyTorch] cuda graph support
Documentation #2273: Pull request #575 synchronize by ksivaman
January 12, 2024 07:49 1m 1s ksivaman:fp8_cuda_graphs
January 12, 2024 07:49 1m 1s
[Paddle] Support GQA
Documentation #2272: Pull request #595 synchronize by Wong4j
January 12, 2024 04:10 57s Wong4j:jaywan/add_gqa
January 12, 2024 04:10 57s
Activation offloading to CPU's for the Linear, Layernorm Linear and the Layernorm MLP modules
Documentation #2270: Pull request #571 synchronize by sanandaraj5597
January 12, 2024 01:04 39s main
January 12, 2024 01:04 39s
Activation offloading to CPU's for the Linear, Layernorm Linear and the Layernorm MLP modules
Documentation #2268: Pull request #571 synchronize by ptrendx
January 11, 2024 22:11 48s main
January 11, 2024 22:11 48s
Support building using the manylinux docker image.
Documentation #2267: Pull request #586 synchronize by lpetre
January 11, 2024 12:54 1m 9s fix_manylinux
January 11, 2024 12:54 1m 9s
[JAX][Common] Support GQA
Documentation #2266: Pull request #578 synchronize by zlsh80826
January 11, 2024 07:41 52s zlsh80826:rewang/gqa-clean
January 11, 2024 07:41 52s
[JAX][Common] Support GQA
Documentation #2263: Pull request #578 synchronize by zlsh80826
January 11, 2024 03:24 46s zlsh80826:rewang/gqa-clean
January 11, 2024 03:24 46s
[JAX][Common] Support GQA
Documentation #2260: Pull request #578 synchronize by zlsh80826
January 10, 2024 13:48 52s zlsh80826:rewang/gqa-clean
January 10, 2024 13:48 52s