SQL

Natural language to SQL query generation evaluates text-to-query fidelity and schema reasoning. This task is particularly relevant for analytics chat assistants and simplified database interfaces where users need to query data using natural language. Models must understand both the intent behind the question and the structure of the underlying database schema.

96
gcp/gemini-flash-latest
Average duration
5s
Average tokens
1575
Average cost
$0.00
100
3s
872
opper_sql_sample_01
100
3s
1011
opper_sql_sample_02
100
2s
827
opper_sql_sample_03
100
3s
969
opper_sql_sample_04
100
4s
1203
opper_sql_sample_05
100
4s
1200
opper_sql_sample_06
100
2s
867
opper_sql_sample_07
100
3s
918
opper_sql_sample_08
100
4s
1144
opper_sql_sample_09
100
4s
1176
opper_sql_sample_10
100
3s
1134
opper_sql_sample_11
100
4s
1454
opper_sql_sample_12
100
4s
1461
opper_sql_sample_13
100
4s
1409
opper_sql_sample_14
100
4s
1529
opper_sql_sample_15
100
7s
2013
opper_sql_sample_16
100
4s
1419
opper_sql_sample_17
75
8s
2268
opper_sql_sample_18
50
4s
1553
opper_sql_sample_19
100
7s
2163
opper_sql_sample_20
100
3s
1414
opper_sql_sample_21
100
4s
1668
opper_sql_sample_22
100
6s
2047
opper_sql_sample_23
75
5s
1813
opper_sql_sample_24
100
6s
1959
opper_sql_sample_25
100
8s
2292
opper_sql_sample_26
100
8s
2307
opper_sql_sample_27
75
6s
1957
opper_sql_sample_28
100
11s
2874
opper_sql_sample_29
100
8s
2339
opper_sql_sample_30