Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

97
openai/gpt-5
Average duration
11s
Average tokens
571
Average cost
$0.00
100
7s
520
opper_memes_meme_01
100
8s
595
opper_memes_meme_02
100
6s
564
opper_memes_meme_03
100
15s
1111
opper_memes_meme_04
100
5s
393
opper_memes_meme_05
100
8s
389
opper_memes_meme_06
100
9s
443
opper_memes_meme_07
100
5s
391
opper_memes_meme_08
100
11s
341
opper_memes_meme_09
100
9s
401
opper_memes_meme_10
100
12s
463
opper_memes_meme_11
0
10s
993
opper_memes_meme_12
100
46s
707
opper_memes_meme_13
100
8s
452
opper_memes_meme_14
100
12s
469
opper_memes_meme_15
100
12s
355
opper_memes_meme_16
100
12s
416
opper_memes_meme_17
100
15s
804
opper_memes_meme_18
100
7s
703
opper_memes_meme_19
100
12s
637
opper_memes_meme_20
100
8s
444
opper_memes_meme_21
100
12s
508
opper_memes_meme_22
100
8s
513
opper_memes_meme_23
100
12s
513
opper_memes_meme_24
100
12s
793
opper_memes_meme_25
100
10s
644
opper_memes_meme_26
100
13s
837
opper_memes_meme_27
100
6s
452
opper_memes_meme_28
100
11s
924
opper_memes_meme_29
100
5s
400
opper_memes_meme_30
100
6s
526
opper_memes_meme_31