Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

65
anthropic/claude-3.5-haiku
Average duration
3s
Average tokens
427
Average cost
$0.00
100
2s
311
opper_memes_meme_01
100
2s
314
opper_memes_meme_02
100
2s
349
opper_memes_meme_03
100
2s
320
opper_memes_meme_04
100
2s
307
opper_memes_meme_05
100
2s
299
opper_memes_meme_06
100
2s
297
opper_memes_meme_07
100
4s
648
opper_memes_meme_08
100
4s
625
opper_memes_meme_09
100
4s
635
opper_memes_meme_10
100
4s
671
opper_memes_meme_11
0
2s
331
opper_memes_meme_12
100
2s
297
opper_memes_meme_13
0
2s
296
opper_memes_meme_14
100
2s
327
opper_memes_meme_15
100
5s
671
opper_memes_meme_16
100
6s
690
opper_memes_meme_17
0
5s
666
opper_memes_meme_18
100
5s
735
opper_memes_meme_19
100
3s
360
opper_memes_meme_20
0
2s
287
opper_memes_meme_21
0
2s
288
opper_memes_meme_22
0
4s
599
opper_memes_meme_23
0
2s
292
opper_memes_meme_24
0
2s
326
opper_memes_meme_25
100
2s
296
opper_memes_meme_26
0
2s
297
opper_memes_meme_27
100
2s
295
opper_memes_meme_28
0
5s
731
opper_memes_meme_29
100
2s
314
opper_memes_meme_30
0
3s
376
opper_memes_meme_31