Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

68
openai/gpt-4.1-mini
Average duration
11s
Average tokens
266
Average cost
$0.00
100
6s
263
opper_memes_meme_01
0
5s
266
opper_memes_meme_02
100
5s
300
opper_memes_meme_03
100
6s
271
opper_memes_meme_04
100
5s
257
opper_memes_meme_05
100
6s
253
opper_memes_meme_06
100
5s
252
opper_memes_meme_07
100
17s
264
opper_memes_meme_08
100
1m 35s
269
opper_memes_meme_09
100
6s
265
opper_memes_meme_10
100
5s
263
opper_memes_meme_11
0
5s
281
opper_memes_meme_12
100
5s
251
opper_memes_meme_13
0
18s
252
opper_memes_meme_14
100
6s
273
opper_memes_meme_15
100
5s
283
opper_memes_meme_16
100
26s
278
opper_memes_meme_17
0
5s
282
opper_memes_meme_18
100
6s
311
opper_memes_meme_19
0
6s
301
opper_memes_meme_20
0
5s
244
opper_memes_meme_21
100
5s
244
opper_memes_meme_22
100
5s
249
opper_memes_meme_23
0
34s
249
opper_memes_meme_24
0
5s
280
opper_memes_meme_25
100
6s
252
opper_memes_meme_26
0
5s
252
opper_memes_meme_27
100
6s
252
opper_memes_meme_28
0
5s
277
opper_memes_meme_29
100
5s
264
opper_memes_meme_30
100
2s
262
opper_memes_meme_31