Run Details #23
Model
GPT-5 mini
Score
97
Correct (≥100)
39/40
Incorrect (<100)
1
Avg Time
1118.8ms
Tokens Used
0
Cost
$0.0031
Benchmark:
0023_word_problems
Run Date: 2026-02-26 01:18:54
Questions (40)
Question: There are 7 rows of stickers with 8 stickers in each row. How many stickers are there in total?
Expected
56.0
Provided
{
"result": 56
}
Question: There are 7 rows of cookies with 12 cookies in each row. How many cookies are there in total?
Expected
84.0
Provided
{
"result": 84
}
Question: Carlos had 79 dollars. After spending 59 dollars, how much money does Carlos have left?
Expected
20.0
Provided
{
"result": 20
}
Question: Julia had 40 dollars. After spending 19 dollars, how much money does Julia have left?
Expected
21.0
Provided
{
"result": 21
}
Question: There are 6 rows of books with 12 books in each row. How many books are there in total?
Expected
72.0
Provided
{
"result": 72
}
Question: Hannah had 30 dollars. After spending 21 dollars, how much money does Hannah have left?
Expected
9.0
Provided
{
"result": 9
}
Question: There are 5 rows of apples with 4 apples in each row. How many apples are there in total?
Expected
20.0
Provided
{
"result": 20
}
Question: A car travels at 79 km/h. How far does it travel in 8 hours?
Expected
632.0
Provided
{
"result": 632
}
Question: Fatima had 65 dollars. After spending 19 dollars, how much money does Fatima have left?
Expected
46.0
Provided
{
"result": 46
}
Question: David has 18 apples and Ivan has 7 apples. How many apples do they have together?
Expected
25.0
Provided
{
"result": 25
}
Question: Emma has 13 cards and David has 12 cards. How many cards do they have together?
Expected
25.0
Provided
{
"result": 25
}
Question: George had 41 dollars. After spending 23 dollars, how much money does George have left?
Expected
18.0
Provided
{
"result": 18
}
Question: A car travels at 55 km/h. How far does it travel in 8 hours?
Expected
440.0
Provided
{
"result": 440
}
Question: A car travels at 60 km/h. How far does it travel in 3 hours?
Expected
180.0
Provided
{
"result": 180
}
Question: Carlos had 51 dollars. After spending 50 dollars, how much money does Carlos have left?
Expected
1.0
Provided
{
"result": 1
}
Question: Carlos baked 21 cookies and wants to give an equal number to each of 7 friends. How many cookies does each friend get?
Expected
3.0
Provided
{
"result": 3
}
Question: Fatima baked 72 cookies and wants to give an equal number to each of 8 friends. How many cookies does each friend get?
Expected
9.0
Provided
{
"result": 9
}
Question: A store had 61 apples. They sold 14 apples. How many apples are left?
Expected
47.0
Provided
{
"result": 47
}
Question: Hannah had 44 dollars. After spending 21 dollars, how much money does Hannah have left?
Expected
23.0
Provided
{
"result": 23
}
Question: There are 4 rows of apples with 7 apples in each row. How many apples are there in total?
Expected
28.0
Provided
{
"result": 28
}
Question: A store has 28 apples, 28 oranges, and 21 bananas. A customer buys 20 oranges. How many oranges does the store have left?
Expected
8.0
Provided
{
"result": 8
}
Question: A store has 27 apples, 44 oranges, and 27 bananas. A customer buys 5 oranges. How many oranges does the store have left?
Expected
39.0
Provided
{
"result": 39
}
Question: A store had 74 books. They sold 32 books. How many books are left?
Expected
42.0
Provided
{
"result": 42
}
Question: A store has 14 apples, 24 oranges, and 30 bananas. A customer buys 16 oranges. How many oranges does the store have left?
Expected
8.0
Provided
{
"result": 8
}
Question: A train travels 40 km in the first hour, 85 km in the second hour, and 59 km in the third hour. How far does it travel in the first two hours?
Expected
125.0
Provided
{
"result": 125
}
Question: A train travels 63 km in the first hour, 62 km in the second hour, and 110 km in the third hour. How far does it travel in the first two hours?
Expected
125.0
Provided
{
"result": 125
}
Question: There are 5 rows of stickers with 3 stickers in each row. How many stickers are there in total?
Expected
15.0
Provided
{
"result": 15
}
Question: Bob has 22 marbles and George has 22 marbles. How many marbles do they have together?
Expected
44.0
Provided
{
"result": 44
}
Question: A store has 15 apples, 22 oranges, and 27 bananas. A customer buys 4 oranges. How many oranges does the store have left?
Expected
18.0
Provided
{
"result": 18
}
Question: A car travels at 43 km/h. How far does it travel in 2 hours?
Expected
86.0
Provided
{
"result": 86
}
Question: There are 10 rows of cookies with 2 cookies in each row. How many cookies are there in total?
Expected
20.0
Provided
{
"result": 20
}
Question: Bob baked 24 cookies and wants to give an equal number to each of 2 friends. How many cookies does each friend get?
Expected
12.0
Provided
{
"result": 12
}
Question: Bob had 80 dollars. After spending 58 dollars, how much money does Bob have left?
Expected
22.0
Provided
{
"result": 22
}
Question: A train travels 81 km in the first hour, 90 km in the second hour, and 72 km in the third hour. How far does it travel in the first two hours?
Expected
171.0
Provided
{
"result": 171
}
Question: A train travels 101 km in the first hour, 68 km in the second hour, and 42 km in the third hour. How far does it travel in the first two hours?
Expected
169.0
Provided
{
"result": 169
}
Question: A store had 77 cards. They sold 44 cards. How many cards are left?
Expected
33.0
Provided
{
"result": 33
}
Question: George has 9 marbles, Alice has 14 marbles, and Julia has 28 marbles. How many marbles do Alice and Julia have together?
Expected
42.0
Provided
{
"result": 42
}
Question: Julia had 44 dollars. After spending 13 dollars, how much money does Julia have left?
Expected
31.0
Provided
-
Question: A store had 20 cookies. They sold 7 cookies. How many cookies are left?
Expected
13.0
Provided
{
"result": 13
}
Question: A store has 44 apples, 35 oranges, and 10 bananas. A customer buys 28 oranges. How many oranges does the store have left?
Expected
7.0
Provided
{
"result": 7
}