Context Reasoning

Context understanding and reasoning tasks test accurate answers grounded in provided context. This capability is essential for knowledge-base support bots, policy lookup systems, and internal knowledge Q&A applications. Models are evaluated on their ability to provide accurate answers that are properly grounded in the given context rather than hallucinating information.

0
Duration
23s
Input Tokens
4782
Output Tokens
141
Cost
$0.00
Context
Input
What was Oscars response to the bronchodilator during testing?
Expected output
15% increase in FEV1, which was a positive response
Model output
Oscar Jansson was diagnosed with Mild Persistent Asthma.