Duration
24s
Input Tokens
3113
Output Tokens
217
Cost
$0.00
Context
Input
Which specific family member had a history of asthma?
Expected output
Oscar's mother
Model output
His mother had a history of asthma.
Context understanding and reasoning tasks test accurate answers grounded in provided context. This capability is essential for knowledge-base support bots, policy lookup systems, and internal knowledge Q&A applications. Models are evaluated on their ability to provide accurate answers that are properly grounded in the given context rather than hallucinating information.
Which specific family member had a history of asthma?
Oscar's mother
His mother had a history of asthma.