Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

77
gcp/gemini-2.0-flash-lite
Average duration
3s
Average tokens
272
Average cost
$0.00
100
3s
269
opper_memes_meme_01
100
4s
271
opper_memes_meme_02
100
4s
306
opper_memes_meme_03
100
3s
276
opper_memes_meme_04
100
4s
261
opper_memes_meme_05
100
4s
257
opper_memes_meme_06
100
3s
256
opper_memes_meme_07
100
3s
266
opper_memes_meme_08
100
3s
270
opper_memes_meme_09
100
3s
269
opper_memes_meme_10
100
4s
267
opper_memes_meme_11
0
3s
294
opper_memes_meme_12
100
3s
255
opper_memes_meme_13
100
3s
257
opper_memes_meme_14
100
3s
269
opper_memes_meme_15
100
2s
288
opper_memes_meme_16
100
3s
282
opper_memes_meme_17
0
3s
287
opper_memes_meme_18
100
3s
317
opper_memes_meme_19
0
3s
329
opper_memes_meme_20
0
3s
249
opper_memes_meme_21
100
3s
249
opper_memes_meme_22
0
2s
254
opper_memes_meme_23
100
3s
253
opper_memes_meme_24
100
3s
285
opper_memes_meme_25
100
3s
257
opper_memes_meme_26
0
3s
256
opper_memes_meme_27
100
3s
257
opper_memes_meme_28
0
3s
287
opper_memes_meme_29
100
3s
269
opper_memes_meme_30
100
3s
269
opper_memes_meme_31