SQL

Natural language to SQL query generation evaluates text-to-query fidelity and schema reasoning. This task is particularly relevant for analytics chat assistants and simplified database interfaces where users need to query data using natural language. Models must understand both the intent behind the question and the structure of the underlying database schema.

88
gcp/gemini-2.5-flash-lite
Average duration
17s
Average tokens
1119
Average cost
$0.00
100
19s
970
opper_sql_sample_01
100
19s
689
opper_sql_sample_02
100
19s
668
opper_sql_sample_03
100
17s
643
opper_sql_sample_04
100
17s
755
opper_sql_sample_05
100
19s
758
opper_sql_sample_06
100
19s
661
opper_sql_sample_07
100
19s
649
opper_sql_sample_08
100
19s
662
opper_sql_sample_09
100
17s
742
opper_sql_sample_10
100
19s
891
opper_sql_sample_11
100
17s
970
opper_sql_sample_12
100
19s
974
opper_sql_sample_13
100
17s
1000
opper_sql_sample_14
75
19s
1013
opper_sql_sample_15
0
15s
1068
opper_sql_sample_16
100
15s
992
opper_sql_sample_17
75
15s
1101
opper_sql_sample_18
50
15s
1002
opper_sql_sample_19
100
15s
1083
opper_sql_sample_20
100
14s
1113
opper_sql_sample_21
100
14s
1154
opper_sql_sample_22
100
17s
1369
opper_sql_sample_23
50
17s
1198
opper_sql_sample_24
100
17s
1910
opper_sql_sample_25
100
17s
1188
opper_sql_sample_26
50
17s
1168
opper_sql_sample_27
75
16s
1300
opper_sql_sample_28
50
22s
4768
opper_sql_sample_29
100
13s
1096
opper_sql_sample_30