Context Reasoning

Context understanding and reasoning tasks test accurate answers grounded in provided context. This capability is essential for knowledge-base support bots, policy lookup systems, and internal knowledge Q&A applications. Models are evaluated on their ability to provide accurate answers that are properly grounded in the given context rather than hallucinating information.

0
Duration
4s
Input Tokens
3517
Output Tokens
470
Cost
$0.00
Context
Input
What was the final diagnosis given to Oscar?
Expected output
Mild persistent asthma, with a secondary diagnosis of suspected viral upper respiratory infection
Model output
Oscar's final diagnosis was Mild Persistent Asthma (J45.30, ICD-10) with a secondary diagnosis of suspected Viral Upper Respiratory Infection (J06.9, ICD-10).