Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

90
groq/gpt-oss-120b
Average duration
2s
Average tokens
508
Average cost
$0.00
100
2s
439
opper_memes_meme_01
100
2s
565
opper_memes_meme_02
100
1s
583
opper_memes_meme_03
100
2s
536
opper_memes_meme_04
100
1s
400
opper_memes_meme_05
100
1s
395
opper_memes_meme_06
100
2s
393
opper_memes_meme_07
100
1s
419
opper_memes_meme_08
100
2s
469
opper_memes_meme_09
100
1s
438
opper_memes_meme_10
100
1s
452
opper_memes_meme_11
100
1s
594
opper_memes_meme_12
100
1s
501
opper_memes_meme_13
0
2s
617
opper_memes_meme_14
100
1s
409
opper_memes_meme_15
100
1s
503
opper_memes_meme_16
100
2s
457
opper_memes_meme_17
0
1s
483
opper_memes_meme_18
100
2s
610
opper_memes_meme_19
100
2s
649
opper_memes_meme_20
100
1s
440
opper_memes_meme_21
100
1s
439
opper_memes_meme_22
100
2s
516
opper_memes_meme_23
100
2s
560
opper_memes_meme_24
0
2s
580
opper_memes_meme_25
100
3s
775
opper_memes_meme_26
100
2s
539
opper_memes_meme_27
100
2s
411
opper_memes_meme_28
100
5s
705
opper_memes_meme_29
100
1s
421
opper_memes_meme_30
100
2s
451
opper_memes_meme_31