Duration
3s
Input Tokens
241
Output Tokens
25
Cost
$0.00
Context
Input
Answer with YES only if the sentence contains no negation; otherwise NO. Sentence: This is not a cat.
Expected output
{
"answer": "NO"
}
Model output
{
"answer": "NO"
}
Evaluates a model’s ability to interpret culture-dependent, tricky, and humor-driven content that feels obvious to humans but is hard for AI.
Answer with YES only if the sentence contains no negation; otherwise NO. Sentence: This is not a cat.
{
"answer": "NO"
}
{
"answer": "NO"
}