Context Reasoning

Context understanding and reasoning tasks test accurate answers grounded in provided context. This capability is essential for knowledge-base support bots, policy lookup systems, and internal knowledge Q&A applications. Models are evaluated on their ability to provide accurate answers that are properly grounded in the given context rather than hallucinating information.

Duration

42s

Input Tokens

3334

Output Tokens

Cost

$0.00

Context

Input

Who owned a machete?

Expected output

Joaquín

Model output

We cannot definitively answer who owned a machete.