Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

84
anthropic/claude-opus-4.1
Average duration
5s
Average tokens
314
Average cost
$0.00
100
4s
312
opper_memes_meme_01
100
4s
315
opper_memes_meme_02
100
5s
350
opper_memes_meme_03
100
6s
321
opper_memes_meme_04
100
6s
308
opper_memes_meme_05
100
6s
300
opper_memes_meme_06
100
5s
298
opper_memes_meme_07
100
5s
309
opper_memes_meme_08
100
5s
313
opper_memes_meme_09
100
5s
312
opper_memes_meme_10
100
5s
309
opper_memes_meme_11
0
6s
334
opper_memes_meme_12
100
5s
298
opper_memes_meme_13
100
5s
296
opper_memes_meme_14
100
6s
316
opper_memes_meme_15
100
5s
330
opper_memes_meme_16
100
5s
329
opper_memes_meme_17
100
4s
328
opper_memes_meme_18
100
4s
358
opper_memes_meme_19
100
5s
363
opper_memes_meme_20
100
5s
288
opper_memes_meme_21
0
5s
289
opper_memes_meme_22
100
5s
293
opper_memes_meme_23
0
5s
293
opper_memes_meme_24
100
5s
329
opper_memes_meme_25
100
6s
297
opper_memes_meme_26
0
6s
297
opper_memes_meme_27
100
6s
296
opper_memes_meme_28
0
6s
327
opper_memes_meme_29
100
6s
315
opper_memes_meme_30
100
4s
313
opper_memes_meme_31