Simple Arithmetic

A benchmark to evaluate a model's ability to perform basic arithmetic: addition, subtraction, multiplication, and division.

Questions

40

Total Runs

19

Best Score

100/100

Leaderboard
Rank Model Score Run Date Actions
GPT-5 nano
100
2026-02-26 00:49:30 View Details
2 GPT-5 mini
100
2026-02-26 01:17:15 View Details
3 Gemma 2 9B (LMStudio)
5800 MB
100
2026-02-28 14:59:40 View Details
4 Gemma 3 12B (LMStudio)
8100 MB
100
2026-02-28 15:50:59 View Details
5 Granite 3.2 8B (LMStudio)
4900 MB
100
2026-02-28 16:17:33 View Details
6 Llama 3 8B (LMStudio)
4900 MB
100
2026-02-28 16:45:31 View Details
7 Llama 3.2 1B (LMStudio)
1300 MB
100
2026-02-28 17:12:39 View Details
8 Ministral 8B (LMStudio)
4900 MB
100
2026-02-28 17:22:01 View Details
9 OLMo 3 7B (LMStudio)
4300 MB
100
2026-02-28 17:36:22 View Details
10 Qwen3 1.7B (LMStudio)
1100 MB
100
2026-02-28 18:04:53 View Details
11 Qwen3 4B (LMStudio)
2800 MB
100
2026-02-28 18:12:40 View Details
12 Qwen3 VL 8B (LMStudio)
5000 MB
100
2026-02-28 18:36:38 View Details
13 Claude Haiku 4.5
100
2026-02-28 19:43:52 View Details
14 Gemma 2 2B (LMStudio)
1500 MB
97
2026-02-28 03:21:50 View Details
15 Gemma 2B (LMStudio)
1500 MB
97
2026-02-28 15:34:27 View Details
16 SmolLM2 1.7B (LMStudio)
1100 MB
95
2026-02-28 19:18:55 View Details
17 Llama 3.1 8B (LMStudio)
4900 MB
90
2026-02-28 17:00:54 View Details
18 Llama 2 7B (LMStudio)
4900 MB
57
2026-02-28 03:04:20 View Details
19 Phi-3.5 Mini (LMStudio)
2500 MB
10
2026-02-28 17:48:52 View Details
Questions

Question
What is 16 ÷ 8?
Question payload
{
  "question_text": "What is 16 ÷ 8?",
  "answer_type": "numeric",
  "correct_answer": 2.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 47 - 7?
Question payload
{
  "question_text": "What is 47 - 7?",
  "answer_type": "numeric",
  "correct_answer": 40.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "subtraction"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 12 ÷ 12?
Question payload
{
  "question_text": "What is 12 ÷ 12?",
  "answer_type": "numeric",
  "correct_answer": 1.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 3 × 7?
Question payload
{
  "question_text": "What is 3 × 7?",
  "answer_type": "numeric",
  "correct_answer": 21.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 46 + 13?
Question payload
{
  "question_text": "What is 46 + 13?",
  "answer_type": "numeric",
  "correct_answer": 59.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "addition"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 11 × 11?
Question payload
{
  "question_text": "What is 11 × 11?",
  "answer_type": "numeric",
  "correct_answer": 121.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 10 + 50?
Question payload
{
  "question_text": "What is 10 + 50?",
  "answer_type": "numeric",
  "correct_answer": 60.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "addition"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 12 × 4?
Question payload
{
  "question_text": "What is 12 × 4?",
  "answer_type": "numeric",
  "correct_answer": 48.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 64 ÷ 4?
Question payload
{
  "question_text": "What is 64 ÷ 4?",
  "answer_type": "numeric",
  "correct_answer": 16.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 2 × 8?
Question payload
{
  "question_text": "What is 2 × 8?",
  "answer_type": "numeric",
  "correct_answer": 16.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 6 × 2?
Question payload
{
  "question_text": "What is 6 × 2?",
  "answer_type": "numeric",
  "correct_answer": 12.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 4 × 4?
Question payload
{
  "question_text": "What is 4 × 4?",
  "answer_type": "numeric",
  "correct_answer": 16.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 39 - 23?
Question payload
{
  "question_text": "What is 39 - 23?",
  "answer_type": "numeric",
  "correct_answer": 16.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "subtraction"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 10 × 6?
Question payload
{
  "question_text": "What is 10 × 6?",
  "answer_type": "numeric",
  "correct_answer": 60.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 41 - 38?
Question payload
{
  "question_text": "What is 41 - 38?",
  "answer_type": "numeric",
  "correct_answer": 3.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "subtraction"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 12 ÷ 12?
Question payload
{
  "question_text": "What is 12 ÷ 12?",
  "answer_type": "numeric",
  "correct_answer": 1.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 23 - 8?
Question payload
{
  "question_text": "What is 23 - 8?",
  "answer_type": "numeric",
  "correct_answer": 15.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "subtraction"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 23 + 18?
Question payload
{
  "question_text": "What is 23 + 18?",
  "answer_type": "numeric",
  "correct_answer": 41.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "addition"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 11 - 11?
Question payload
{
  "question_text": "What is 11 - 11?",
  "answer_type": "numeric",
  "correct_answer": 0.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "subtraction"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 7 × 7?
Question payload
{
  "question_text": "What is 7 × 7?",
  "answer_type": "numeric",
  "correct_answer": 49.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 6 ÷ 2?
Question payload
{
  "question_text": "What is 6 ÷ 2?",
  "answer_type": "numeric",
  "correct_answer": 3.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 90 + 9?
Question payload
{
  "question_text": "What is 90 + 9?",
  "answer_type": "numeric",
  "correct_answer": 99.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "addition"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 4 × 10?
Question payload
{
  "question_text": "What is 4 × 10?",
  "answer_type": "numeric",
  "correct_answer": 40.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 6 × 6?
Question payload
{
  "question_text": "What is 6 × 6?",
  "answer_type": "numeric",
  "correct_answer": 36.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 49 + 6?
Question payload
{
  "question_text": "What is 49 + 6?",
  "answer_type": "numeric",
  "correct_answer": 55.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "addition"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 9 ÷ 9?
Question payload
{
  "question_text": "What is 9 ÷ 9?",
  "answer_type": "numeric",
  "correct_answer": 1.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 12 × 4?
Question payload
{
  "question_text": "What is 12 × 4?",
  "answer_type": "numeric",
  "correct_answer": 48.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 95 ÷ 5?
Question payload
{
  "question_text": "What is 95 ÷ 5?",
  "answer_type": "numeric",
  "correct_answer": 19.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 3 × 4?
Question payload
{
  "question_text": "What is 3 × 4?",
  "answer_type": "numeric",
  "correct_answer": 12.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 73 + 76?
Question payload
{
  "question_text": "What is 73 + 76?",
  "answer_type": "numeric",
  "correct_answer": 149.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "addition"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 95 ÷ 5?
Question payload
{
  "question_text": "What is 95 ÷ 5?",
  "answer_type": "numeric",
  "correct_answer": 19.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 60 ÷ 4?
Question payload
{
  "question_text": "What is 60 ÷ 4?",
  "answer_type": "numeric",
  "correct_answer": 15.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 63 - 48?
Question payload
{
  "question_text": "What is 63 - 48?",
  "answer_type": "numeric",
  "correct_answer": 15.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "subtraction"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 32 ÷ 8?
Question payload
{
  "question_text": "What is 32 ÷ 8?",
  "answer_type": "numeric",
  "correct_answer": 4.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 8 × 6?
Question payload
{
  "question_text": "What is 8 × 6?",
  "answer_type": "numeric",
  "correct_answer": 48.0,
  "category": "simple_arithmetic",
  "difficulty": "easy",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 12 × 6?
Question payload
{
  "question_text": "What is 12 × 6?",
  "answer_type": "numeric",
  "correct_answer": 72.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "multiplication"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 121 ÷ 11?
Question payload
{
  "question_text": "What is 121 ÷ 11?",
  "answer_type": "numeric",
  "correct_answer": 11.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 168 ÷ 12?
Question payload
{
  "question_text": "What is 168 ÷ 12?",
  "answer_type": "numeric",
  "correct_answer": 14.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 61 + 32?
Question payload
{
  "question_text": "What is 61 + 32?",
  "answer_type": "numeric",
  "correct_answer": 93.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "addition"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
What is 28 ÷ 2?
Question payload
{
  "question_text": "What is 28 ÷ 2?",
  "answer_type": "numeric",
  "correct_answer": 14.0,
  "category": "simple_arithmetic",
  "difficulty": "medium",
  "tags": [
    "arithmetic",
    "division"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}