Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

77
openai/gpt-4o
Average duration
9s
Average tokens
269
Average cost
$0.00
100
9s
267
opper_memes_meme_01
100
7s
267
opper_memes_meme_02
100
9s
300
opper_memes_meme_03
0
10s
274
opper_memes_meme_04
100
7s
257
opper_memes_meme_05
100
10s
253
opper_memes_meme_06
100
26s
252
opper_memes_meme_07
100
10s
268
opper_memes_meme_08
100
7s
269
opper_memes_meme_09
100
9s
265
opper_memes_meme_10
100
7s
267
opper_memes_meme_11
0
11s
281
opper_memes_meme_12
100
10s
251
opper_memes_meme_13
100
7s
256
opper_memes_meme_14
100
7s
276
opper_memes_meme_15
100
10s
283
opper_memes_meme_16
100
7s
282
opper_memes_meme_17
0
10s
282
opper_memes_meme_18
100
10s
315
opper_memes_meme_19
0
9s
308
opper_memes_meme_20
100
10s
248
opper_memes_meme_21
100
7s
248
opper_memes_meme_22
100
7s
249
opper_memes_meme_23
100
7s
249
opper_memes_meme_24
0
10s
284
opper_memes_meme_25
100
10s
256
opper_memes_meme_26
0
8s
257
opper_memes_meme_27
100
7s
256
opper_memes_meme_28
0
9s
277
opper_memes_meme_29
100
7s
264
opper_memes_meme_30
100
2s
266
opper_memes_meme_31