Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

94
gcp/gemini-flash-latest
Average duration
4s
Average tokens
658
Average cost
$0.00
100
2s
369
opper_memes_meme_01
100
5s
802
opper_memes_meme_02
100
3s
582
opper_memes_meme_03
100
4s
671
opper_memes_meme_04
100
2s
361
opper_memes_meme_05
100
2s
357
opper_memes_meme_06
100
2s
357
opper_memes_meme_07
100
2s
366
opper_memes_meme_08
100
2s
499
opper_memes_meme_09
100
3s
541
opper_memes_meme_10
100
3s
557
opper_memes_meme_11
0
6s
1091
opper_memes_meme_12
100
3s
444
opper_memes_meme_13
100
3s
575
opper_memes_meme_14
100
3s
535
opper_memes_meme_15
100
4s
737
opper_memes_meme_16
100
3s
593
opper_memes_meme_17
100
7s
1143
opper_memes_meme_18
100
4s
794
opper_memes_meme_19
100
4s
660
opper_memes_meme_20
100
3s
495
opper_memes_meme_21
100
3s
631
opper_memes_meme_22
100
3s
542
opper_memes_meme_23
100
3s
614
opper_memes_meme_24
0
13s
2079
opper_memes_meme_25
100
2s
512
opper_memes_meme_26
100
3s
570
opper_memes_meme_27
100
3s
497
opper_memes_meme_28
100
5s
1023
opper_memes_meme_29
100
3s
609
opper_memes_meme_30
100
4s
800
opper_memes_meme_31