Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

97
openai/gpt-5-mini
Average duration
11s
Average tokens
506
Average cost
$0.00
100
5s
456
opper_memes_meme_01
100
8s
468
opper_memes_meme_02
100
7s
564
opper_memes_meme_03
100
17s
663
opper_memes_meme_04
100
7s
320
opper_memes_meme_05
100
4s
325
opper_memes_meme_06
100
6s
324
opper_memes_meme_07
100
7s
336
opper_memes_meme_08
100
7s
405
opper_memes_meme_09
100
8s
465
opper_memes_meme_10
100
8s
399
opper_memes_meme_11
0
12s
801
opper_memes_meme_12
100
9s
451
opper_memes_meme_13
100
10s
453
opper_memes_meme_14
100
9s
405
opper_memes_meme_15
100
5s
410
opper_memes_meme_16
100
9s
415
opper_memes_meme_17
100
1m 23s
933
opper_memes_meme_18
100
11s
703
opper_memes_meme_19
100
9s
891
opper_memes_meme_20
100
7s
371
opper_memes_meme_21
100
7s
371
opper_memes_meme_22
100
6s
449
opper_memes_meme_23
100
7s
577
opper_memes_meme_24
100
17s
995
opper_memes_meme_25
100
7s
452
opper_memes_meme_26
100
7s
389
opper_memes_meme_27
100
5s
388
opper_memes_meme_28
100
9s
732
opper_memes_meme_29
100
5s
391
opper_memes_meme_30
100
5s
398
opper_memes_meme_31