Skip to content

Commit

Permalink
Fix LLM as judge metric
Browse files Browse the repository at this point in the history
  • Loading branch information
yifanmai committed Feb 4, 2025
1 parent 472078b commit 22052ad
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions src/helm/benchmark/static/schema_tables.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -179,10 +179,10 @@ metrics:
lower_is_better: false

# SciGen Accuracy
- name: llama_3_8b_chat_hf_together_ai_template_table2text_single_turn_with_reference
- name: llama_3_1_70b_instruct_cross_provider_template_table2text_single_turn_with_reference
display_name: Rating
short_display_name: Rating
description: Rating by Llama 3 (8B) LLM as judge
description: Rating by Llama 3.1 (70B) LLM as judge
lower_is_better: false

perturbations: []
Expand Down

0 comments on commit 22052ad

Please sign in to comment.