Run Benchmark

Syllogism Validity

This benchmark is in Tier 2 (Core).

Model

Models that cannot run this benchmark tier are shown as disabled based on capability level.

Benchmark execution is allowed only from direct local/private network IPs.

Cancel

A benchmark to evaluate whether a model can determine if short categorical syllogisms are logically valid.