Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

81
mistral/mistral-large-eu
Average duration
2s
Average tokens
272
Average cost
$0.00
100
2s
272
opper_memes_meme_01
100
2s
278
opper_memes_meme_02
100
2s
309
opper_memes_meme_03
0
2s
282
opper_memes_meme_04
100
2s
261
opper_memes_meme_05
100
2s
259
opper_memes_meme_06
100
2s
257
opper_memes_meme_07
100
2s
265
opper_memes_meme_08
100
1s
270
opper_memes_meme_09
100
1s
267
opper_memes_meme_10
100
1s
265
opper_memes_meme_11
0
2s
293
opper_memes_meme_12
100
2s
256
opper_memes_meme_13
0
2s
255
opper_memes_meme_14
100
2s
274
opper_memes_meme_15
100
1s
286
opper_memes_meme_16
100
2s
283
opper_memes_meme_17
0
3s
287
opper_memes_meme_18
100
2s
316
opper_memes_meme_19
100
3s
314
opper_memes_meme_20
100
2s
250
opper_memes_meme_21
100
2s
250
opper_memes_meme_22
100
1s
254
opper_memes_meme_23
100
2s
255
opper_memes_meme_24
100
2s
287
opper_memes_meme_25
100
2s
257
opper_memes_meme_26
0
1s
256
opper_memes_meme_27
100
2s
255
opper_memes_meme_28
0
4s
284
opper_memes_meme_29
100
1s
267
opper_memes_meme_30
100
2s
265
opper_memes_meme_31