Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

97
openai/o1
Average duration
12s
Average tokens
784
Average cost
$0.00
100
7s
724
opper_memes_meme_01
100
6s
599
opper_memes_meme_02
100
7s
632
opper_memes_meme_03
0
14s
1756
opper_memes_meme_04
100
7s
655
opper_memes_meme_05
100
6s
522
opper_memes_meme_06
100
6s
648
opper_memes_meme_07
100
5s
404
opper_memes_meme_08
100
7s
665
opper_memes_meme_09
100
7s
546
opper_memes_meme_10
100
7s
659
opper_memes_meme_11
100
48s
1189
opper_memes_meme_12
100
7s
647
opper_memes_meme_13
100
5s
522
opper_memes_meme_14
100
1m 10s
602
opper_memes_meme_15
100
7s
679
opper_memes_meme_16
100
8s
802
opper_memes_meme_17
100
13s
1450
opper_memes_meme_18
100
7s
643
opper_memes_meme_19
100
8s
961
opper_memes_meme_20
100
7s
706
opper_memes_meme_21
100
8s
838
opper_memes_meme_22
100
7s
711
opper_memes_meme_23
100
8s
774
opper_memes_meme_24
100
11s
1446
opper_memes_meme_25
100
7s
648
opper_memes_meme_26
100
7s
585
opper_memes_meme_27
100
7s
712
opper_memes_meme_28
100
48s
1056
opper_memes_meme_29
100
7s
596
opper_memes_meme_30
100
7s
915
opper_memes_meme_31