Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

90
gcp/gemini-2.5-flash
Average duration
4s
Average tokens
488
Average cost
$0.00
100
4s
297
opper_memes_meme_01
100
3s
309
opper_memes_meme_02
100
4s
397
opper_memes_meme_03
100
4s
354
opper_memes_meme_04
100
3s
326
opper_memes_meme_05
100
3s
301
opper_memes_meme_06
100
4s
338
opper_memes_meme_07
100
3s
364
opper_memes_meme_08
100
3s
311
opper_memes_meme_09
100
3s
419
opper_memes_meme_10
100
3s
403
opper_memes_meme_11
100
3s
473
opper_memes_meme_12
100
3s
322
opper_memes_meme_13
100
3s
297
opper_memes_meme_14
100
2s
348
opper_memes_meme_15
100
4s
487
opper_memes_meme_16
100
2s
387
opper_memes_meme_17
0
10s
1862
opper_memes_meme_18
100
3s
589
opper_memes_meme_19
0
3s
599
opper_memes_meme_20
0
2s
336
opper_memes_meme_21
100
3s
393
opper_memes_meme_22
100
3s
346
opper_memes_meme_23
100
3s
426
opper_memes_meme_24
100
11s
2089
opper_memes_meme_25
100
3s
332
opper_memes_meme_26
100
3s
328
opper_memes_meme_27
100
3s
300
opper_memes_meme_28
100
3s
482
opper_memes_meme_29
100
3s
456
opper_memes_meme_30
100
3s
447
opper_memes_meme_31