Context Reasoning

Context understanding and reasoning tasks test accurate answers grounded in provided context. This capability is essential for knowledge-base support bots, policy lookup systems, and internal knowledge Q&A applications. Models are evaluated on their ability to provide accurate answers that are properly grounded in the given context rather than hallucinating information.

87
openai/gpt-5
Average duration
0
Average tokens
0
Average cost
$0
0
null
0
opper_context_sample_01
50
null
0
opper_context_sample_02
100
null
0
opper_context_sample_03
50
null
0
opper_context_sample_04
100
null
0
opper_context_sample_05
100
null
0
opper_context_sample_06
50
null
0
opper_context_sample_07
100
null
0
opper_context_sample_08
100
null
0
opper_context_sample_09
100
null
0
opper_context_sample_10
100
null
0
opper_context_sample_11
100
null
0
opper_context_sample_12
100
null
0
opper_context_sample_13
100
null
0
opper_context_sample_14
100
null
0
opper_context_sample_15
100
null
0
opper_context_sample_16
100
null
0
opper_context_sample_17
100
null
0
opper_context_sample_18
100
null
0
opper_context_sample_19
100
null
0
opper_context_sample_20
100
null
0
opper_context_sample_21
50
null
0
opper_context_sample_22
50
null
0
opper_context_sample_23
50
null
0
opper_context_sample_24
100
null
0
opper_context_sample_25
100
null
0
opper_context_sample_26
100
null
0
opper_context_sample_27
100
null
0
opper_context_sample_28
100
null
0
opper_context_sample_29
100
null
0
opper_context_sample_30