Food Category Classification

A benchmark to evaluate classification of food items by category.

Questions

20

Leaderboard Entries

24

Best Score

100/100

Leaderboard
Rank Model Score Run Date Actions
Gemma 4 E4B IT (LMStudio)
3200 MB
100
2197.5ms median
2026-04-02 21:22:28 View Details
2 GPT-5.4 nano
100
758.0ms median
2026-03-17 19:26:50 View Details
3 GPT-5.4 mini
100
867.0ms median
2026-03-17 19:05:12 View Details
4 Claude Haiku 4.5
100
1688.0ms median
2026-03-03 06:02:44 View Details
5 Qwen3 VL 8B (LMStudio)
5000 MB
100
6767.0ms median
2026-03-03 05:44:34 View Details
6 Phi-4 (LMStudio)
9100 MB
100
12760.0ms median
2026-03-03 01:20:02 View Details
7 Ministral 8B (LMStudio)
4900 MB
100
2539.0ms median
2026-03-03 00:21:26 View Details
8 Gemma 3 12B (LMStudio)
8100 MB
100
6117.0ms median
2026-03-02 23:39:08 View Details
9 Gemma 2 9B (LMStudio)
5800 MB
100
2297.0ms median
2026-03-02 23:08:25 View Details
10 GPT-5 nano
100
1215.0ms median
2026-03-02 18:37:28 View Details
11 GPT-5 mini
100
2151.5ms median
2026-03-02 18:33:23 View Details
12 Qwen3.5 2B (LMStudio)
2700 MB
95
2765.0ms median
2026-03-03 19:30:47 View Details
13 Llama 3 8B (LMStudio)
4900 MB
95
2687.0ms median
2026-03-03 00:11:22 View Details
14 Granite 3.2 8B (LMStudio)
4900 MB
95
4370.5ms median
2026-03-02 23:52:09 View Details
15 Gemma 4 12B (LMStudio)
7560 MB
90
3670.5ms median
2026-06-03 19:37:35 View Details
16 Llama 3.2 1B (LMStudio)
1300 MB
75
558.5ms median
2026-03-03 00:15:34 View Details
17 Llama 3.1 8B (LMStudio)
4900 MB
75
2668.0ms median
2026-03-02 20:41:14 View Details
18 OLMo 3 7B (LMStudio)
4300 MB
75
3506.5ms median
2026-03-02 20:28:14 View Details
19 Qwen3.5 4B (LMStudio)
3400 MB
55
4767.0ms median
1 latency outlier
2026-03-03 21:42:18 View Details
20 Gemma 2 2B (LMStudio)
1500 MB
25
1420.0ms median
2026-03-02 23:00:46 View Details
21 SmolLM2 1.7B (LMStudio)
1100 MB
20
647.0ms median
2026-03-03 05:52:05 View Details
22 Phi-3.5 Mini (LMStudio)
2500 MB
20
623.0ms median
1 latency outlier
2026-03-03 00:34:13 View Details
23 Llama 2 7B (LMStudio)
4900 MB
20
1639.5ms median
2026-03-03 00:01:52 View Details
24 Gemma 2B (LMStudio)
1500 MB
20
753.0ms median
2026-03-02 23:13:20 View Details
Questions

Question
Which category best fits 'farfalle'?
Question payload
{
  "question_text": "Which category best fits 'farfalle'?",
  "answer_type": "multiple_choice",
  "correct_answer": "pasta",
  "category": "food",
  "difficulty": "medium",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "protein",
    "poultry",
    "pasta",
    "nut"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'cheddar'?
Question payload
{
  "question_text": "Which category best fits 'cheddar'?",
  "answer_type": "multiple_choice",
  "correct_answer": "dairy",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "dairy",
    "pasta",
    "nut",
    "vegetable"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'barley'?
Question payload
{
  "question_text": "Which category best fits 'barley'?",
  "answer_type": "multiple_choice",
  "correct_answer": "grain",
  "category": "food",
  "difficulty": "medium",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "dairy",
    "grain",
    "poultry",
    "vegetable"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'almond'?
Question payload
{
  "question_text": "Which category best fits 'almond'?",
  "answer_type": "multiple_choice",
  "correct_answer": "nut",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "fruit",
    "nut",
    "protein",
    "seafood"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'shrimp'?
Question payload
{
  "question_text": "Which category best fits 'shrimp'?",
  "answer_type": "multiple_choice",
  "correct_answer": "seafood",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "vegetable",
    "poultry",
    "dairy",
    "seafood"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'spaghetti'?
Question payload
{
  "question_text": "Which category best fits 'spaghetti'?",
  "answer_type": "multiple_choice",
  "correct_answer": "pasta",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "dairy",
    "seafood",
    "pasta",
    "legume"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'salmon'?
Question payload
{
  "question_text": "Which category best fits 'salmon'?",
  "answer_type": "multiple_choice",
  "correct_answer": "seafood",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "protein",
    "nut",
    "pasta",
    "seafood"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'spinach'?
Question payload
{
  "question_text": "Which category best fits 'spinach'?",
  "answer_type": "multiple_choice",
  "correct_answer": "vegetable",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "grain",
    "vegetable",
    "legume",
    "seafood"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'blueberry'?
Question payload
{
  "question_text": "Which category best fits 'blueberry'?",
  "answer_type": "multiple_choice",
  "correct_answer": "fruit",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "nut",
    "grain",
    "fruit",
    "poultry"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'pistachio'?
Question payload
{
  "question_text": "Which category best fits 'pistachio'?",
  "answer_type": "multiple_choice",
  "correct_answer": "nut",
  "category": "food",
  "difficulty": "medium",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "pasta",
    "nut",
    "dairy",
    "vegetable"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'kale'?
Question payload
{
  "question_text": "Which category best fits 'kale'?",
  "answer_type": "multiple_choice",
  "correct_answer": "vegetable",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "vegetable",
    "fruit",
    "poultry",
    "dairy"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'black beans'?
Question payload
{
  "question_text": "Which category best fits 'black beans'?",
  "answer_type": "multiple_choice",
  "correct_answer": "legume",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "seafood",
    "legume",
    "nut",
    "protein"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'quinoa'?
Question payload
{
  "question_text": "Which category best fits 'quinoa'?",
  "answer_type": "multiple_choice",
  "correct_answer": "grain",
  "category": "food",
  "difficulty": "medium",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "legume",
    "protein",
    "grain",
    "dairy"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'ricotta'?
Question payload
{
  "question_text": "Which category best fits 'ricotta'?",
  "answer_type": "multiple_choice",
  "correct_answer": "dairy",
  "category": "food",
  "difficulty": "medium",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "grain",
    "fruit",
    "legume",
    "dairy"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'tofu'?
Question payload
{
  "question_text": "Which category best fits 'tofu'?",
  "answer_type": "multiple_choice",
  "correct_answer": "protein",
  "category": "food",
  "difficulty": "medium",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "pasta",
    "dairy",
    "protein",
    "vegetable"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'duck'?
Question payload
{
  "question_text": "Which category best fits 'duck'?",
  "answer_type": "multiple_choice",
  "correct_answer": "poultry",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "dairy",
    "fruit",
    "grain",
    "poultry"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'tempeh'?
Question payload
{
  "question_text": "Which category best fits 'tempeh'?",
  "answer_type": "multiple_choice",
  "correct_answer": "protein",
  "category": "food",
  "difficulty": "hard",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "dairy",
    "nut",
    "grain",
    "protein"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'chickpeas'?
Question payload
{
  "question_text": "Which category best fits 'chickpeas'?",
  "answer_type": "multiple_choice",
  "correct_answer": "legume",
  "category": "food",
  "difficulty": "medium",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "dairy",
    "legume",
    "poultry",
    "fruit"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'mango'?
Question payload
{
  "question_text": "Which category best fits 'mango'?",
  "answer_type": "multiple_choice",
  "correct_answer": "fruit",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "legume",
    "poultry",
    "protein",
    "fruit"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}

Question
Which category best fits 'turkey'?
Question payload
{
  "question_text": "Which category best fits 'turkey'?",
  "answer_type": "multiple_choice",
  "correct_answer": "poultry",
  "category": "food",
  "difficulty": "easy",
  "tags": [
    "knowledge",
    "food",
    "classification"
  ],
  "choices": [
    "protein",
    "poultry",
    "vegetable",
    "seafood"
  ],
  "evaluation_criteria": {
    "exact_match": true,
    "case_sensitive": false,
    "contains": false,
    "required_fields": [],
    "tolerance": 0.0,
    "alternatives": []
  }
}