Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

87
openai/o1-mini
Average duration
10s
Average tokens
790
Average cost
$0.00
100
6s
937
opper_memes_meme_01
100
8s
1261
opper_memes_meme_02
100
6s
654
opper_memes_meme_03
0
6s
815
opper_memes_meme_04
100
5s
546
opper_memes_meme_05
100
6s
606
opper_memes_meme_06
100
5s
733
opper_memes_meme_07
100
5s
618
opper_memes_meme_08
100
4s
494
opper_memes_meme_09
100
6s
811
opper_memes_meme_10
100
8s
1001
opper_memes_meme_11
100
6s
955
opper_memes_meme_12
100
4s
412
opper_memes_meme_13
100
5s
733
opper_memes_meme_14
100
7s
819
opper_memes_meme_15
100
4s
573
opper_memes_meme_16
100
6s
761
opper_memes_meme_17
0
8s
1468
opper_memes_meme_18
100
6s
793
opper_memes_meme_19
100
33s
657
opper_memes_meme_20
100
6s
726
opper_memes_meme_21
100
5s
598
opper_memes_meme_22
0
42s
731
opper_memes_meme_23
100
7s
795
opper_memes_meme_24
0
10s
1660
opper_memes_meme_25
100
7s
797
opper_memes_meme_26
100
5s
611
opper_memes_meme_27
100
19s
477
opper_memes_meme_28
100
7s
1142
opper_memes_meme_29
100
16s
682
opper_memes_meme_30
100
40s
616
opper_memes_meme_31