Input Tokens
0
Output Tokens
0
Duration
0
Cost
0
Context
Input
Who owned a machete?
Expected output
Joaquín
Model output
Joaquín owned the machete.
Context understanding and reasoning tasks test accurate answers grounded in provided context. This capability is essential for knowledge-base support bots, policy lookup systems, and internal knowledge Q&A applications. Models are evaluated on their ability to provide accurate answers that are properly grounded in the given context rather than hallucinating information.
Who owned a machete?
Joaquín
Joaquín owned the machete.