SQL

Natural language to SQL query generation evaluates text-to-query fidelity and schema reasoning. This task is particularly relevant for analytics chat assistants and simplified database interfaces where users need to query data using natural language. Models must understand both the intent behind the question and the structure of the underlying database schema.

94
groq/gpt-oss-20b
Average duration
4s
Average tokens
1216
Average cost
$0.00
100
9s
1387
opper_sql_sample_01
100
3s
846
opper_sql_sample_02
100
2s
873
opper_sql_sample_03
100
6s
857
opper_sql_sample_04
100
3s
752
opper_sql_sample_05
100
3s
927
opper_sql_sample_06
100
3s
831
opper_sql_sample_07
100
4s
785
opper_sql_sample_08
100
2s
804
opper_sql_sample_09
100
3s
905
opper_sql_sample_10
100
3s
939
opper_sql_sample_11
100
5s
1084
opper_sql_sample_12
100
3s
1118
opper_sql_sample_13
100
5s
1158
opper_sql_sample_14
100
7s
1321
opper_sql_sample_15
100
4s
1576
opper_sql_sample_16
100
3s
1101
opper_sql_sample_17
100
6s
1780
opper_sql_sample_18
50
4s
1400
opper_sql_sample_19
100
4s
1639
opper_sql_sample_20
100
4s
1176
opper_sql_sample_21
100
3s
1407
opper_sql_sample_22
100
4s
1422
opper_sql_sample_23
75
6s
1503
opper_sql_sample_24
75
4s
1552
opper_sql_sample_25
100
4s
1573
opper_sql_sample_26
50
4s
1565
opper_sql_sample_27
75
4s
1384
opper_sql_sample_28
100
4s
1615
opper_sql_sample_29
100
5s
1201
opper_sql_sample_30