SQL

Natural language to SQL query generation evaluates text-to-query fidelity and schema reasoning. This task is particularly relevant for analytics chat assistants and simplified database interfaces where users need to query data using natural language. Models must understand both the intent behind the question and the structure of the underlying database schema.

85
openai/gpt-4o-mini
Average duration
25s
Average tokens
868
Average cost
$0.00
100
23s
602
opper_sql_sample_01
100
17s
624
opper_sql_sample_02
100
23s
620
opper_sql_sample_03
100
17s
614
opper_sql_sample_04
100
46s
651
opper_sql_sample_05
75
17s
690
opper_sql_sample_06
100
23s
606
opper_sql_sample_07
100
17s
623
opper_sql_sample_08
100
23s
614
opper_sql_sample_09
100
17s
672
opper_sql_sample_10
75
16s
817
opper_sql_sample_11
100
23s
848
opper_sql_sample_12
100
17s
870
opper_sql_sample_13
100
17s
891
opper_sql_sample_14
75
22s
883
opper_sql_sample_15
75
22s
925
opper_sql_sample_16
100
16s
912
opper_sql_sample_17
75
16s
935
opper_sql_sample_18
50
16s
881
opper_sql_sample_19
75
17s
943
opper_sql_sample_20
100
22s
1020
opper_sql_sample_21
75
17s
1073
opper_sql_sample_22
100
22s
1130
opper_sql_sample_23
50
22s
1087
opper_sql_sample_24
75
16s
1117
opper_sql_sample_25
75
1m 16s
1077
opper_sql_sample_26
50
17s
1058
opper_sql_sample_27
75
1m 50s
1163
opper_sql_sample_28
50
26s
1077
opper_sql_sample_29
100
16s
1022
opper_sql_sample_30