SQL

Natural language to SQL query generation evaluates text-to-query fidelity and schema reasoning. This task is particularly relevant for analytics chat assistants and simplified database interfaces where users need to query data using natural language. Models must understand both the intent behind the question and the structure of the underlying database schema.

94
groq/gpt-oss-120b
Average duration
4s
Average tokens
1206
Average cost
$0.00
100
4s
804
opper_sql_sample_01
100
3s
870
opper_sql_sample_02
100
3s
803
opper_sql_sample_03
100
3s
804
opper_sql_sample_04
100
4s
970
opper_sql_sample_05
100
4s
1012
opper_sql_sample_06
100
4s
762
opper_sql_sample_07
100
3s
795
opper_sql_sample_08
100
4s
832
opper_sql_sample_09
100
4s
1046
opper_sql_sample_10
100
3s
1074
opper_sql_sample_11
100
4s
1102
opper_sql_sample_12
100
4s
1145
opper_sql_sample_13
100
3s
1086
opper_sql_sample_14
100
3s
1240
opper_sql_sample_15
100
3s
1248
opper_sql_sample_16
100
4s
1242
opper_sql_sample_17
100
4s
1484
opper_sql_sample_18
50
4s
1401
opper_sql_sample_19
100
5s
1430
opper_sql_sample_20
100
3s
1286
opper_sql_sample_21
100
4s
1517
opper_sql_sample_22
100
4s
1600
opper_sql_sample_23
75
4s
1598
opper_sql_sample_24
75
4s
1483
opper_sql_sample_25
100
4s
1413
opper_sql_sample_26
50
4s
1592
opper_sql_sample_27
75
4s
1551
opper_sql_sample_28
100
4s
1688
opper_sql_sample_29
100
3s
1291
opper_sql_sample_30