Run Benchmark

Syllogism Validity

This benchmark is in Tier 2 (Core).

Select Model
Models that cannot run this benchmark tier are shown as disabled based on capability level.
Benchmark execution is allowed only from direct local/private network IPs.
Cancel
Benchmark Info

A benchmark to evaluate whether a model can determine if short categorical syllogisms are logically valid.