Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

74
anthropic/claude-sonnet-4.5
Average duration
4s
Average tokens
392
Average cost
$0.00
0
8s
419
opper_memes_meme_01
0
4s
406
opper_memes_meme_02
100
4s
422
opper_memes_meme_03
0
4s
427
opper_memes_meme_04
100
6s
386
opper_memes_meme_05
0
3s
393
opper_memes_meme_06
0
4s
391
opper_memes_meme_07
100
3s
381
opper_memes_meme_08
100
3s
386
opper_memes_meme_09
100
3s
389
opper_memes_meme_10
100
3s
381
opper_memes_meme_11
0
4s
404
opper_memes_meme_12
100
4s
376
opper_memes_meme_13
100
4s
370
opper_memes_meme_14
100
3s
399
opper_memes_meme_15
100
3s
403
opper_memes_meme_16
100
5s
402
opper_memes_meme_17
100
3s
400
opper_memes_meme_18
100
4s
431
opper_memes_meme_19
0
5s
436
opper_memes_meme_20
100
3s
361
opper_memes_meme_21
100
3s
362
opper_memes_meme_22
100
3s
366
opper_memes_meme_23
100
3s
366
opper_memes_meme_24
100
5s
402
opper_memes_meme_25
100
4s
370
opper_memes_meme_26
100
3s
375
opper_memes_meme_27
100
3s
374
opper_memes_meme_28
0
3s
399
opper_memes_meme_29
100
3s
388
opper_memes_meme_30
100
3s
391
opper_memes_meme_31