Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

94
gcp/gemini-2.5-pro
Average duration
6s
Average tokens
625
Average cost
$0.00
100
5s
542
opper_memes_meme_01
0
7s
542
opper_memes_meme_02
100
7s
729
opper_memes_meme_03
100
7s
587
opper_memes_meme_04
100
3s
371
opper_memes_meme_05
100
6s
611
opper_memes_meme_06
100
4s
368
opper_memes_meme_07
100
5s
437
opper_memes_meme_08
100
5s
525
opper_memes_meme_09
100
5s
542
opper_memes_meme_10
100
8s
869
opper_memes_meme_11
0
19s
2139
opper_memes_meme_12
100
4s
417
opper_memes_meme_13
100
3s
413
opper_memes_meme_14
100
4s
487
opper_memes_meme_15
100
6s
574
opper_memes_meme_16
100
5s
552
opper_memes_meme_17
100
7s
681
opper_memes_meme_18
100
8s
903
opper_memes_meme_19
100
7s
763
opper_memes_meme_20
100
4s
461
opper_memes_meme_21
100
6s
464
opper_memes_meme_22
100
5s
475
opper_memes_meme_23
100
5s
455
opper_memes_meme_24
100
4s
590
opper_memes_meme_25
100
5s
491
opper_memes_meme_26
100
4s
427
opper_memes_meme_27
100
4s
465
opper_memes_meme_28
100
12s
1334
opper_memes_meme_29
100
5s
543
opper_memes_meme_30
100
5s
619
opper_memes_meme_31