Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

87
openai/o3-mini
Average duration
9s
Average tokens
545
Average cost
$0.00
100
5s
403
opper_memes_meme_01
0
8s
406
opper_memes_meme_02
100
15s
504
opper_memes_meme_03
100
5s
411
opper_memes_meme_04
100
8s
333
opper_memes_meme_05
100
6s
393
opper_memes_meme_06
100
8s
321
opper_memes_meme_07
100
8s
404
opper_memes_meme_08
100
6s
338
opper_memes_meme_09
100
6s
469
opper_memes_meme_10
100
8s
403
opper_memes_meme_11
0
8s
677
opper_memes_meme_12
100
8s
651
opper_memes_meme_13
100
6s
456
opper_memes_meme_14
100
8s
476
opper_memes_meme_15
100
8s
423
opper_memes_meme_16
100
8s
418
opper_memes_meme_17
0
8s
806
opper_memes_meme_18
100
8s
516
opper_memes_meme_19
100
9s
1026
opper_memes_meme_20
100
8s
448
opper_memes_meme_21
100
8s
448
opper_memes_meme_22
100
6s
581
opper_memes_meme_23
100
6s
389
opper_memes_meme_24
0
14s
1636
opper_memes_meme_25
100
8s
716
opper_memes_meme_26
100
8s
649
opper_memes_meme_27
100
16s
392
opper_memes_meme_28
100
8s
672
opper_memes_meme_29
100
8s
532
opper_memes_meme_30
100
35s
595
opper_memes_meme_31