Skip to content

Pull requests: NVIDIA/Fuser

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

disable PreBroadcastMmaBiasNeg on blackwell
#3820 opened Feb 4, 2025 by liqiangxl Loading…
Implement persistent matmul scheduling
#3812 opened Feb 3, 2025 by jacobhinkle Loading…
Deallocate TMem when done using it
#3806 opened Jan 31, 2025 by zasdfgbnm Loading…
use buffer storage params
#3804 opened Jan 31, 2025 by liqiangxl Draft
[WIP] Comms through cudaIpc
#3799 opened Jan 30, 2025 by samnordmann Draft
[WIP] ExprEvalExecutor Speedup
#3796 opened Jan 29, 2025 by csarofeen Draft
Use F.rms_norm in benchmarks
#3783 opened Jan 29, 2025 by Priya2698 Loading…
[CI Testing] For Issue #3629
#3767 opened Jan 28, 2025 by Priya2698 Draft
Move RNG logic out of codegen to a lowering pass.
#3749 opened Jan 23, 2025 by csarofeen Loading…
[WIP] Loop promotion for cyclic graphs
#3747 opened Jan 23, 2025 by naoyam Draft
2 tasks done
Rng devel debug
#3744 opened Jan 22, 2025 by csarofeen Draft
Reapply #3621
#3714 opened Jan 16, 2025 by wujingyue Draft
[DO NOT REVIEW] Testing main
#3698 opened Jan 12, 2025 by csarofeen Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.