Skip to content

Actions: mlfoundations/evalchemy

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
250 workflow runs
250 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

add livebench/livecodebench to reasoning configs
Lint #227: Commit 391456a pushed by sedrick-keh-tri
January 30, 2025 04:40 26s configs
January 30, 2025 04:40 26s
reasoning configs
Lint #226: Pull request #69 opened by sedrick-keh-tri
January 30, 2025 04:38 26s configs
January 30, 2025 04:38 26s
reasoning configs
Lint #225: Commit 6f11540 pushed by sedrick-keh-tri
January 30, 2025 04:38 27s configs
January 30, 2025 04:38 27s
Merge pull request #56 from mlfoundations/jean/livebench
Lint #224: Commit fc1ac80 pushed by EtashGuha
January 29, 2025 22:15 32s main
January 29, 2025 22:15 32s
Adding LiveBench
Lint #223: Pull request #56 synchronize by neginraoof
January 29, 2025 22:05 33s jean/livebench
January 29, 2025 22:05 33s
lint
Lint #222: Commit bf76a59 pushed by neginraoof
January 29, 2025 22:05 28s jean/livebench
January 29, 2025 22:05 28s
Adding LiveBench
Lint #221: Pull request #56 synchronize by neginraoof
January 29, 2025 22:02 27s jean/livebench
January 29, 2025 22:02 27s
fix max tokens
Lint #220: Commit 6fe03da pushed by neginraoof
January 29, 2025 22:02 27s jean/livebench
January 29, 2025 22:02 27s
Adding LiveBench
Lint #219: Pull request #56 synchronize by neginraoof
January 29, 2025 20:56 29s jean/livebench
January 29, 2025 20:56 29s
fix for max length
Lint #218: Commit 232448e pushed by neginraoof
January 29, 2025 20:56 27s jean/livebench
January 29, 2025 20:56 27s
Merge pull request #68 from mlfoundations/negin/livecodebench
Lint #217: Commit a7b71d8 pushed by neginraoof
January 29, 2025 18:05 25s main
January 29, 2025 18:05 25s
Adding LiveBench
Lint #216: Pull request #56 synchronize by neginraoof
January 29, 2025 08:55 24s jean/livebench
January 29, 2025 08:55 24s
update readme
Lint #215: Commit df07121 pushed by neginraoof
January 29, 2025 08:55 24s jean/livebench
January 29, 2025 08:55 24s
Adding LiveBench
Lint #214: Pull request #56 synchronize by neginraoof
January 29, 2025 08:52 24s jean/livebench
January 29, 2025 08:52 24s
Adding LiveBench
Lint #212: Pull request #56 synchronize by neginraoof
January 29, 2025 08:48 28s jean/livebench
January 29, 2025 08:48 28s
Merge branch 'main' into jean/livebench
Lint #211: Commit eaae152 pushed by neginraoof
January 29, 2025 08:48 33s jean/livebench
January 29, 2025 08:48 33s
Update reproduced_benchmarks.md
Lint #210: Commit 94c8728 pushed by neginraoof
January 29, 2025 08:23 26s jean/livebench
January 29, 2025 08:23 26s
clean
Lint #209: Commit dfa3b88 pushed by neginraoof
January 29, 2025 08:21 32s jean/livebench
January 29, 2025 08:21 32s
clean
Lint #208: Commit c56dd25 pushed by neginraoof
January 29, 2025 08:20 33s jean/livebench
January 29, 2025 08:20 33s
Update eval_instruct.py
Lint #207: Commit 844b79f pushed by neginraoof
January 29, 2025 07:36 25s main
January 29, 2025 07:36 25s
Fixing system prompts and openai models
Lint #206: Pull request #68 synchronize by neginraoof
January 29, 2025 07:24 27s negin/livecodebench
January 29, 2025 07:24 27s
merge
Lint #205: Commit 8b1bfac pushed by neginraoof
January 29, 2025 07:24 23s negin/livecodebench
January 29, 2025 07:24 23s
lint
Lint #204: Commit ca32f69 pushed by neginraoof
January 29, 2025 07:19 25s negin/livecodebench
January 29, 2025 07:19 25s
fixing system prompt
Lint #203: Commit f24e067 pushed by neginraoof
January 29, 2025 05:56 28s negin/livecodebench
January 29, 2025 05:56 28s