Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

87
openai/gpt-5-nano
Average duration
10s
Average tokens
1275
Average cost
$0.00
100
5s
719
opper_memes_meme_01
100
12s
1747
opper_memes_meme_02
100
8s
820
opper_memes_meme_03
0
9s
1432
opper_memes_meme_04
100
7s
642
opper_memes_meme_05
100
26s
709
opper_memes_meme_06
100
7s
1156
opper_memes_meme_07
100
5s
648
opper_memes_meme_08
100
5s
598
opper_memes_meme_09
100
6s
650
opper_memes_meme_10
100
6s
655
opper_memes_meme_11
0
8s
1313
opper_memes_meme_12
100
13s
2173
opper_memes_meme_13
100
6s
837
opper_memes_meme_14
100
7s
1040
opper_memes_meme_15
100
6s
803
opper_memes_meme_16
100
9s
928
opper_memes_meme_17
100
15s
2724
opper_memes_meme_18
100
26s
1087
opper_memes_meme_19
0
8s
1466
opper_memes_meme_20
100
9s
1013
opper_memes_meme_21
100
7s
700
opper_memes_meme_22
100
6s
701
opper_memes_meme_23
100
8s
1082
opper_memes_meme_24
0
15s
2274
opper_memes_meme_25
100
23s
4781
opper_memes_meme_26
100
12s
2226
opper_memes_meme_27
100
5s
1028
opper_memes_meme_28
100
10s
1884
opper_memes_meme_29
100
4s
528
opper_memes_meme_30
100
7s
1166
opper_memes_meme_31