Skip to content

Commit d9b0d16

Browse files
authored
Update index.html
1 parent 0284116 commit d9b0d16

1 file changed

Lines changed: 12 additions & 0 deletions

File tree

index.html

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -242,6 +242,18 @@ <h4>News</h4>
242242
>
243243
<div class="card-body" style="background-color: #f1f6f9">
244244
<ul style="padding-left: 0">
245+
<li style="list-style-type: none">
246+
<span
247+
style="
248+
background-color: #B03052;
249+
color: white;
250+
padding: 2px 4px;
251+
border-radius: 5px;
252+
"
253+
><strong style="color: #ffe7ce">Sep. 10, 2025:</strong></span> We're excited to announce the release of the <a href="https://huggingface.co/datasets/birdsql/livesqlbench-base-full-v1" target="_blank"><strong>LiveSQLBench-Base-Full-V1 (600)</strong></a>! The first text-to-SQL benchmark covering all SQL spectrum with Hierarchical Knowlegde Base (HKB) and test cases.
254+
We provide two types of queries: normal query and colloquial queries for people to test according to their own needs. The flag model Gemini-2.5-pro can only achieve <strong>28.67</strong> in colloquial queries, and <strong>35.67</strong> in normal queries. The <a href="https://huggingface.co/datasets/birdsql/livesqlbench-base-lite" target="_blank">base-lite</a> and <a href="https://huggingface.co/datasets/birdsql/livesqlbench-base-full-v1" target="_blank">base-full-v1</a> would be locked version for development of research methods.
255+
The detailed performance is in our <a href="https://livesqlbench.ai/" target="_blank">website</a>.
256+
</li>
245257
<li style="list-style-type: none">
246258
<span
247259
style="

0 commit comments

Comments
 (0)