Skip to content

Actions: stanford-crfm/helm

Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,894 workflow runs
2,894 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Include multiple annotators for WildBench
Test #7894: Pull request #3283 opened by liamjxu
January 22, 2025 06:39 10m 18s jialiang/multiple_annotator
January 22, 2025 06:39 10m 18s
refactor fleurs asrscenario
Test #7893: Pull request #3281 opened by ImKeTT
January 22, 2025 01:25 9m 48s ImKeTT:asr_haoqin
January 22, 2025 01:25 9m 48s
Switch table_benchmark wikitq to use 1 shot instead of 5
Test #7892: Pull request #3280 opened by yifanmai
January 21, 2025 23:09 10m 32s yifanmai/unitxt-1-shot
January 21, 2025 23:09 10m 32s
Add Llama 3.1 Instruct on Vertex AI (#3278)
Test #7891: Commit c617a57 pushed by yifanmai
January 20, 2025 02:30 10m 18s main
January 20, 2025 02:30 10m 18s
Add run entries for HELM Tables with only the base variants (#3279)
Test #7890: Commit ef82a87 pushed by yifanmai
January 17, 2025 06:06 10m 48s main
January 17, 2025 06:06 10m 48s
Add GPT-4o-mini and Llama 3.3 70B on Stanford Health Care API (#3277)
Test #7887: Commit 966b50b pushed by yifanmai
January 17, 2025 04:22 9m 40s main
January 17, 2025 04:22 9m 40s
Rename schema_lite_v2.yaml to schema_capabilities.yaml (#3276)
Test #7884: Commit 1b0202b pushed by yifanmai
January 16, 2025 21:48 10m 41s main
January 16, 2025 21:48 10m 41s
Use original instance IDs in IFEval
Test #7882: Pull request #3275 opened by yifanmai
January 15, 2025 19:39 10m 3s yifanmai/fix-ifeval-natural-id
January 15, 2025 19:39 10m 3s
Misc cleanup for HELM Capabilities (#3274)
Test #7881: Commit 6d70e98 pushed by yifanmai
January 15, 2025 06:16 10m 14s main
January 15, 2025 06:16 10m 14s
Misc cleanup for HELM Capabilities
Test #7880: Pull request #3274 synchronize by yifanmai
January 15, 2025 06:06 9m 24s yifanmai/fix-capabilities-cleanup
January 15, 2025 06:06 9m 24s
Add general info metrics to Capabilities run specs (#3273)
Test #7878: Commit 716e523 pushed by yifanmai
January 15, 2025 01:03 10m 49s main
January 15, 2025 01:03 10m 49s
Allow running on all subjects for MMLU-Pro (#3272)
Test #7876: Commit b069c5a pushed by yifanmai
January 14, 2025 23:42 10m 12s main
January 14, 2025 23:42 10m 12s
Return more information in Omni-MATH annotations (#3271)
Test #7875: Commit 61a9bc0 pushed by yifanmai
January 14, 2025 23:29 10m 26s main
January 14, 2025 23:29 10m 26s
Allow running on all subjects for MMLU-Pro
Test #7874: Pull request #3272 opened by yifanmai
January 14, 2025 23:28 10m 13s yifanmai/fix-mmlu-pro-all
January 14, 2025 23:28 10m 13s
Move run specs for HELM capabilities to its module (#3270)
Test #7873: Commit 6989b81 pushed by yifanmai
January 14, 2025 23:15 9m 43s main
January 14, 2025 23:15 9m 43s