Validate Translation (voras)

A regression benchmark for the voras agent's validate_all_translations_for_word() function. Tests whether the LLM correctly identifies semantically incorrect or non-lemma translations across multiple target languages.

Questions

0

Leaderboard Entries

0

Best Score

-

Leaderboard
No runs yet for this benchmark. Run it now!
Questions
No questions generated for this benchmark yet.