SQL

Natural language to SQL query generation evaluates text-to-query fidelity and schema reasoning. This task is particularly relevant for analytics chat assistants and simplified database interfaces where users need to query data using natural language. Models must understand both the intent behind the question and the structure of the underlying database schema.

93
openai/gpt-5-nano
Average duration
40s
Average tokens
2411
Average cost
$0.00
100
58s
1389
opper_sql_sample_01
100
58s
1275
opper_sql_sample_02
100
39s
1917
opper_sql_sample_03
100
30s
1588
opper_sql_sample_04
75
30s
1893
opper_sql_sample_05
100
58s
1716
opper_sql_sample_06
100
58s
1191
opper_sql_sample_07
100
39s
1459
opper_sql_sample_08
100
30s
1548
opper_sql_sample_09
100
39s
2139
opper_sql_sample_10
100
58s
1476
opper_sql_sample_11
100
30s
2223
opper_sql_sample_12
100
34s
2749
opper_sql_sample_13
100
30s
1779
opper_sql_sample_14
100
58s
2314
opper_sql_sample_15
100
39s
2797
opper_sql_sample_16
100
39s
2500
opper_sql_sample_17
100
39s
2985
opper_sql_sample_18
50
39s
3378
opper_sql_sample_19
100
39s
2720
opper_sql_sample_20
100
58s
2328
opper_sql_sample_21
75
39s
4306
opper_sql_sample_22
75
58s
2751
opper_sql_sample_23
50
30s
3942
opper_sql_sample_24
100
30s
3171
opper_sql_sample_25
100
20s
3057
opper_sql_sample_26
100
20s
3066
opper_sql_sample_27
75
20s
2376
opper_sql_sample_28
100
47s
3251
opper_sql_sample_29
100
22s
3045
opper_sql_sample_30