Context Reasoning

Context understanding and reasoning tasks test accurate answers grounded in provided context. This capability is essential for knowledge-base support bots, policy lookup systems, and internal knowledge Q&A applications. Models are evaluated on their ability to provide accurate answers that are properly grounded in the given context rather than hallucinating information.

0
Duration
6s
Input Tokens
8269
Output Tokens
170
Cost
$0.00
Context
Input
Which patient had a more dangerous medical timeline?
Expected output
Melisa Farrow
Model output
Based on the provided context, Oscar Jansson's medical timeline suggests a mild persistent asthma diagnosis, likely aggravated by an upper respiratory viral infection or allergic response. Melisa Farrow was diagnosed with acute community-acquired pneumonia (CAP) in the left lower lobe and pleuritic chest pain secondary to infection.