Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

84
groq/gpt-oss-20b
Average duration
1s
Average tokens
549
Average cost
$0.00
100
2s
441
opper_memes_meme_01
100
1s
542
opper_memes_meme_02
100
2s
665
opper_memes_meme_03
0
2s
662
opper_memes_meme_04
100
1s
376
opper_memes_meme_05
100
2s
401
opper_memes_meme_06
100
1s
379
opper_memes_meme_07
100
1s
490
opper_memes_meme_08
100
2s
436
opper_memes_meme_09
100
1s
475
opper_memes_meme_10
100
1s
484
opper_memes_meme_11
100
1s
550
opper_memes_meme_12
100
1s
715
opper_memes_meme_13
0
1s
428
opper_memes_meme_14
100
1s
425
opper_memes_meme_15
100
1s
489
opper_memes_meme_16
100
1s
517
opper_memes_meme_17
0
1s
482
opper_memes_meme_18
0
1s
467
opper_memes_meme_19
100
1s
645
opper_memes_meme_20
100
1s
402
opper_memes_meme_21
100
1s
452
opper_memes_meme_22
100
1s
456
opper_memes_meme_23
100
1s
495
opper_memes_meme_24
0
1s
530
opper_memes_meme_25
100
2s
630
opper_memes_meme_26
100
3s
2075
opper_memes_meme_27
100
1s
419
opper_memes_meme_28
100
1s
642
opper_memes_meme_29
100
1s
398
opper_memes_meme_30
100
1s
455
opper_memes_meme_31