Syllogism Validity
A benchmark to evaluate whether a model can determine if short categorical syllogisms are logically valid.
Questions
16
Leaderboard Entries
23
Best Score
100/100
Leaderboard
| Rank | Model | Score | Run Date | Actions |
|---|---|---|---|---|
|
Gemma 4 E4B IT (LMStudio)
3200 MB |
100
7651.0ms median
|
2026-04-02 21:20:48 | View Details | |
| 2 | GPT-5.4 nano |
100
974.0ms median
|
2026-03-17 19:26:13 | View Details |
| 3 | GPT-5.4 mini |
100
903.5ms median
|
2026-03-17 19:04:33 | View Details |
| 4 | Claude Haiku 4.5 |
100
2677.0ms median
|
2026-03-03 06:01:26 | View Details |
| 5 |
Phi-4 (LMStudio)
9100 MB |
100
18457.5ms median
|
2026-03-03 01:09:50 | View Details |
| 6 | GPT-5 nano |
100
1534.0ms median
|
2026-03-02 18:36:27 | View Details |
| 7 | GPT-5 mini |
100
2448.5ms median
|
2026-03-02 18:31:55 | View Details |
| 8 |
Gemma 3 12B (LMStudio)
8100 MB |
94
9676.5ms median
|
2026-03-02 23:33:34 | View Details |
| 9 |
Gemma 4 12B (LMStudio)
7560 MB |
88
10932.5ms median
|
2026-06-03 19:34:52 | View Details |
| 10 |
Gemma 2 9B (LMStudio)
5800 MB |
88
6058.5ms median
|
2026-03-02 23:05:51 | View Details |
| 11 |
Ministral 8B (LMStudio)
4900 MB |
75
3686.0ms median
|
2026-03-03 00:19:25 | View Details |
| 12 |
Qwen3.5 4B (LMStudio)
3400 MB |
69
15473.0ms median
3 latency outliers
|
2026-03-03 21:37:20 | View Details |
| 13 |
Qwen3.5 2B (LMStudio)
2700 MB |
69
8663.0ms median
|
2026-03-03 19:27:49 | View Details |
| 14 |
Llama 3.1 8B (LMStudio)
4900 MB |
69
5841.0ms median
|
2026-03-02 20:38:08 | View Details |
| 15 |
Phi-3.5 Mini (LMStudio)
2500 MB |
62
3783.0ms median
|
2026-03-03 00:30:04 | View Details |
| 16 |
Llama 3 8B (LMStudio)
4900 MB |
62
4346.0ms median
|
2026-03-03 00:08:56 | View Details |
| 17 |
OLMo 3 7B (LMStudio)
4300 MB |
62
7497.5ms median
|
2026-03-02 20:24:48 | View Details |
| 18 |
Granite 3.2 8B (LMStudio)
4900 MB |
50
8270.5ms median
|
2026-03-02 23:47:52 | View Details |
| 19 |
Llama 3.2 1B (LMStudio)
1300 MB |
31
925.5ms median
|
2026-03-03 00:14:51 | View Details |
| 20 |
Gemma 2 2B (LMStudio)
1500 MB |
19
3209.0ms median
|
2026-03-02 22:59:13 | View Details |
| 21 |
SmolLM2 1.7B (LMStudio)
1100 MB |
12
1063.0ms median
|
2026-03-03 05:51:32 | View Details |
| 22 |
Llama 2 7B (LMStudio)
4900 MB |
6
2164.0ms median
|
2026-03-03 00:00:16 | View Details |
| 23 |
Gemma 2B (LMStudio)
1500 MB |
0
934.5ms median
|
2026-03-02 23:12:49 | View Details |
Questions
Question
Evaluate this syllogism:
Premise 1: All mammals are warm-blooded.
Premise 2: All whales are mammals.
Conclusion: Therefore, all whales are warm-blooded.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: All mammals are warm-blooded.\nPremise 2: All whales are mammals.\nConclusion: Therefore, all whales are warm-blooded.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Valid",
"category": "logic",
"difficulty": "easy",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: No prime numbers greater than 2 are even.
Premise 2: 11 is a prime number greater than 2.
Conclusion: Therefore, 11 is not even.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: No prime numbers greater than 2 are even.\nPremise 2: 11 is a prime number greater than 2.\nConclusion: Therefore, 11 is not even.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Valid",
"category": "logic",
"difficulty": "easy",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: Some engineers are musicians.
Premise 2: All musicians are creative.
Conclusion: Therefore, some engineers are creative.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: Some engineers are musicians.\nPremise 2: All musicians are creative.\nConclusion: Therefore, some engineers are creative.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Valid",
"category": "logic",
"difficulty": "medium",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: All squares are rectangles.
Premise 2: No rectangles are circles.
Conclusion: Therefore, no squares are circles.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: All squares are rectangles.\nPremise 2: No rectangles are circles.\nConclusion: Therefore, no squares are circles.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Valid",
"category": "logic",
"difficulty": "easy",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: All philosophers are thinkers.
Premise 2: Some thinkers are scientists.
Conclusion: Therefore, some philosophers are scientists.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: All philosophers are thinkers.\nPremise 2: Some thinkers are scientists.\nConclusion: Therefore, some philosophers are scientists.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Invalid",
"category": "logic",
"difficulty": "hard",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: All planets orbit stars.
Premise 2: Earth is a planet.
Conclusion: Therefore, Earth orbits a star.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: All planets orbit stars.\nPremise 2: Earth is a planet.\nConclusion: Therefore, Earth orbits a star.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Valid",
"category": "logic",
"difficulty": "easy",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: No insects are mammals.
Premise 2: All bees are insects.
Conclusion: Therefore, no bees are mammals.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: No insects are mammals.\nPremise 2: All bees are insects.\nConclusion: Therefore, no bees are mammals.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Valid",
"category": "logic",
"difficulty": "easy",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: No reptiles are warm-blooded.
Premise 2: All snakes are reptiles.
Conclusion: Therefore, no snakes are warm-blooded.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: No reptiles are warm-blooded.\nPremise 2: All snakes are reptiles.\nConclusion: Therefore, no snakes are warm-blooded.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Valid",
"category": "logic",
"difficulty": "easy",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: All doctors have medical degrees.
Premise 2: Sarah has a medical degree.
Conclusion: Therefore, Sarah is a doctor.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: All doctors have medical degrees.\nPremise 2: Sarah has a medical degree.\nConclusion: Therefore, Sarah is a doctor.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Invalid",
"category": "logic",
"difficulty": "medium",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: Some students are athletes.
Premise 2: All athletes are disciplined.
Conclusion: Therefore, some students are disciplined.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: Some students are athletes.\nPremise 2: All athletes are disciplined.\nConclusion: Therefore, some students are disciplined.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Valid",
"category": "logic",
"difficulty": "hard",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: All poets are writers.
Premise 2: Some writers are teachers.
Conclusion: Therefore, some poets are teachers.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: All poets are writers.\nPremise 2: Some writers are teachers.\nConclusion: Therefore, some poets are teachers.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Invalid",
"category": "logic",
"difficulty": "medium",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: Some birds are flightless.
Premise 2: All penguins are birds.
Conclusion: Therefore, some penguins are flightless.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: Some birds are flightless.\nPremise 2: All penguins are birds.\nConclusion: Therefore, some penguins are flightless.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Invalid",
"category": "logic",
"difficulty": "medium",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: No amphibians are mammals.
Premise 2: All frogs are amphibians.
Conclusion: Therefore, no frogs are mammals.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: No amphibians are mammals.\nPremise 2: All frogs are amphibians.\nConclusion: Therefore, no frogs are mammals.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Valid",
"category": "logic",
"difficulty": "easy",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: All cats are mammals.
Premise 2: Some pets are cats.
Conclusion: Therefore, some pets are mammals.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: All cats are mammals.\nPremise 2: Some pets are cats.\nConclusion: Therefore, some pets are mammals.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Valid",
"category": "logic",
"difficulty": "medium",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: All novels are books.
Premise 2: Some books are illustrated.
Conclusion: Therefore, some novels are illustrated.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: All novels are books.\nPremise 2: Some books are illustrated.\nConclusion: Therefore, some novels are illustrated.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Invalid",
"category": "logic",
"difficulty": "medium",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}
Question
Evaluate this syllogism:
Premise 1: All painters are artists.
Premise 2: Some artists are musicians.
Conclusion: Therefore, some painters are musicians.
Is the conclusion logically valid given the premises?
Question payload
{
"question_text": "Evaluate this syllogism:\nPremise 1: All painters are artists.\nPremise 2: Some artists are musicians.\nConclusion: Therefore, some painters are musicians.\n\nIs the conclusion logically valid given the premises?",
"answer_type": "multiple_choice",
"correct_answer": "Invalid",
"category": "logic",
"difficulty": "medium",
"tags": [
"knowledge",
"logic",
"syllogism"
],
"choices": [
"Valid",
"Invalid"
],
"evaluation_criteria": {
"exact_match": true,
"case_sensitive": false,
"contains": false,
"required_fields": [],
"tolerance": 0.0,
"alternatives": []
}
}