Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

90
xai/grok-4
Average duration
22s
Average tokens
258
Average cost
$0.00
100
5s
253
opper_memes_meme_01
0
1m 30s
254
opper_memes_meme_02
100
7s
296
opper_memes_meme_03
100
5s
259
opper_memes_meme_04
100
6s
250
opper_memes_meme_05
100
7s
253
opper_memes_meme_06
100
4s
243
opper_memes_meme_07
100
5s
251
opper_memes_meme_08
100
5s
256
opper_memes_meme_09
100
12s
261
opper_memes_meme_10
100
10s
250
opper_memes_meme_11
100
46s
272
opper_memes_meme_12
100
7s
242
opper_memes_meme_13
100
6s
243
opper_memes_meme_14
100
8s
264
opper_memes_meme_15
100
5s
283
opper_memes_meme_16
100
10s
269
opper_memes_meme_17
0
1m 37s
273
opper_memes_meme_18
100
26s
303
opper_memes_meme_19
100
12s
287
opper_memes_meme_20
100
8s
234
opper_memes_meme_21
100
6s
234
opper_memes_meme_22
100
7s
238
opper_memes_meme_23
100
7s
239
opper_memes_meme_24
0
2m 15s
273
opper_memes_meme_25
100
8s
253
opper_memes_meme_26
100
8s
244
opper_memes_meme_27
100
6s
252
opper_memes_meme_28
100
2m 2s
269
opper_memes_meme_29
100
6s
262
opper_memes_meme_30
100
5s
253
opper_memes_meme_31