Context Reasoning

Context understanding and reasoning tasks test accurate answers grounded in provided context. This capability is essential for knowledge-base support bots, policy lookup systems, and internal knowledge Q&A applications. Models are evaluated on their ability to provide accurate answers that are properly grounded in the given context rather than hallucinating information.

0
Duration
8s
Input Tokens
78947
Output Tokens
129
Cost
$0.00
Context
Input
Who is the Author of 'Building a Simple GitHub PR Review Agent with ReAct' and what date was it published on?
Expected output
By Johnny Chadda - 4/8/2025
Model output
The author of 'Building a Simple GitHub PR Review Agent with ReAct' is G"oran Sandahl, and it was published on April 8, 2025.