Validate Translation (voras)
A regression benchmark for the voras agent's validate_all_translations_for_word() function. Tests whether the LLM correctly identifies semantically incorrect or non-lemma translations across multiple target languages.
Questions
0
Leaderboard Entries
0
Best Score
-
Leaderboard
No runs yet for this benchmark.
Run it now!
Questions
No questions generated for this benchmark yet.