Skip to content

Actions: huggingface/lighteval

Build Documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
77 workflow runs
77 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix loading of vllm model from files (#533)
Build Documentation #77: Commit d4e6f59 pushed by NathanHB
February 10, 2025 09:44 1m 23s main
February 10, 2025 09:44 1m 23s
Fix VLLM data-parallel (#541)
Build Documentation #76: Commit 86f6225 pushed by hynky1999
February 6, 2025 13:33 1m 34s main
February 6, 2025 13:33 1m 34s
Bug fix extractive match (#540)
Build Documentation #75: Commit 3c9b0c9 pushed by clefourrier
February 6, 2025 12:35 1m 23s main
February 6, 2025 12:35 1m 23s
Update README.md (#539)
Build Documentation #74: Commit f8405ee pushed by clefourrier
February 6, 2025 12:02 1m 31s main
February 6, 2025 12:02 1m 31s
Pass@k (#519)
Build Documentation #73: Commit 441d7a4 pushed by clefourrier
February 6, 2025 07:57 1m 56s main
February 6, 2025 07:57 1m 56s
Make BLEURT lazy (#536)
Build Documentation #72: Commit 15bdbb8 pushed by clefourrier
February 6, 2025 07:57 2m 11s main
February 6, 2025 07:57 2m 11s
Add GPQA for instruct models (#534)
Build Documentation #71: Commit 1ce7331 pushed by lewtun
February 5, 2025 15:39 1m 27s main
February 5, 2025 15:39 1m 27s
Sync Math-verify (#535)
Build Documentation #70: Commit cb35bea pushed by hynky1999
February 5, 2025 11:34 1m 32s main
February 5, 2025 11:34 1m 32s
Add custom task (bac-fr) for evaluation of models in french (#518)
Build Documentation #69: Commit d7a1f11 pushed by clefourrier
February 3, 2025 16:08 1m 37s main
February 3, 2025 16:08 1m 37s
Update french_evals.py
Build Documentation #68: Commit be7da17 pushed by clefourrier
February 3, 2025 12:13 1m 40s main
February 3, 2025 12:13 1m 40s
adds olympiad bench (#521)
Build Documentation #67: Commit d332207 pushed by NathanHB
January 31, 2025 14:20 1m 29s main
January 31, 2025 14:20 1m 29s
Improve readability of the quick tour. (#501)
Build Documentation #66: Commit 515bd01 pushed by clefourrier
January 30, 2025 13:11 1m 36s main
January 30, 2025 13:11 1m 36s
Implemented the possibility to load predictions from details files an…
Build Documentation #65: Commit 94fc5a2 pushed by NathanHB
January 29, 2025 14:59 1m 36s main
January 29, 2025 14:59 1m 36s
add missing inits (#524)
Build Documentation #64: Commit 48d0c28 pushed by clefourrier
January 29, 2025 07:16 1m 27s main
January 29, 2025 07:16 1m 27s
Math extraction - allow only trying the first match, more customizabl…
Build Documentation #63: Commit 0e46269 pushed by hynky1999
January 28, 2025 12:57 1m 29s main
January 28, 2025 12:57 1m 29s
Fixing commonsense qa: generative metrics, -1 gen length (#517)
Build Documentation #62: Commit cb075a5 pushed by clefourrier
January 26, 2025 17:18 1m 27s main
January 26, 2025 17:18 1m 27s
Fix Ukrainian indices and confirmation word (#516)
Build Documentation #61: Commit 499cc82 pushed by clefourrier
January 26, 2025 11:04 1m 24s main
January 26, 2025 11:04 1m 24s
Fixed bug of import url_to_fs from fsspec (#507) (#512)
Build Documentation #60: Commit 4f381b3 pushed by clefourrier
January 24, 2025 10:37 1m 33s main
January 24, 2025 10:37 1m 33s
Bump up the latex2sympy2_extended version + more tests (#510)
Build Documentation #59: Commit 0ab63d0 pushed by hynky1999
January 23, 2025 13:02 1m 38s main
January 23, 2025 13:02 1m 38s
Support custom results/details push to hub (#457)
Build Documentation #58: Commit c82143a pushed by clefourrier
January 23, 2025 10:48 1m 26s main
January 23, 2025 10:48 1m 26s
Add custom tasks for evaluation of french models (#505)
Build Documentation #57: Commit 7028af3 pushed by clefourrier
January 23, 2025 08:24 1m 29s main
January 23, 2025 08:24 1m 29s
llm_as_a_judge_for_oallv2_arabic (#498)
Build Documentation #56: Commit 620873b pushed by clefourrier
January 23, 2025 07:24 1m 24s main
January 23, 2025 07:24 1m 24s
Relax upper bound on torch (#508)
Build Documentation #55: Commit 5f5bed5 pushed by clefourrier
January 23, 2025 07:20 1m 23s main
January 23, 2025 07:20 1m 23s
Translate task template to Catalan and Galician and fix typos (#506)
Build Documentation #54: Commit 1ae2fa2 pushed by clefourrier
January 22, 2025 10:01 1m 24s main
January 22, 2025 10:01 1m 24s
Made judge response processing more robust. (#491)
Build Documentation #53: Commit 0140578 pushed by clefourrier
January 20, 2025 14:41 1m 31s main
January 20, 2025 14:41 1m 31s