Meme Understanding

Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.

100
Duration
6s
Input Tokens
296
Output Tokens
15
Cost
$0.00
Context
Input
Its 50 degrees Celsius outside and sunny. I go to the freezer and take out an ice cube. I put it in my glass of water. Then I walk to my terraze and put it on a table there. After 1 hour I drink all the water. The ice cube remains in the glass once I have drunk all the water, True or False?
Expected output
{
  "answer": false
}
Model output
{
  "answer": false
}