Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

68
gcp/gemini-flash-lite-latest
Average duration
2s
Average tokens
372
Average cost
$0.00
100
3s
369
opper_memes_meme_01
100
3s
372
opper_memes_meme_02
100
2s
406
opper_memes_meme_03
0
2s
375
opper_memes_meme_04
100
2s
361
opper_memes_meme_05
100
2s
357
opper_memes_meme_06
100
2s
357
opper_memes_meme_07
100
2s
366
opper_memes_meme_08
100
2s
370
opper_memes_meme_09
100
2s
369
opper_memes_meme_10
100
2s
367
opper_memes_meme_11
0
2s
393
opper_memes_meme_12
100
2s
355
opper_memes_meme_13
100
2s
356
opper_memes_meme_14
100
2s
376
opper_memes_meme_15
100
2s
389
opper_memes_meme_16
100
2s
384
opper_memes_meme_17
0
2s
388
opper_memes_meme_18
100
2s
416
opper_memes_meme_19
0
2s
417
opper_memes_meme_20
0
2s
349
opper_memes_meme_21
100
3s
349
opper_memes_meme_22
0
2s
354
opper_memes_meme_23
0
2s
353
opper_memes_meme_24
0
2s
386
opper_memes_meme_25
100
2s
357
opper_memes_meme_26
0
2s
357
opper_memes_meme_27
100
2s
356
opper_memes_meme_28
0
2s
388
opper_memes_meme_29
100
2s
369
opper_memes_meme_30
100
2s
367
opper_memes_meme_31