Agentar-Scale-SQL is a novel framework that leverages scalable computation to significantly improve Text-to-SQL performance on challenging benchmarks. By implementing an Orchestrated Test-Time Scaling strategy, our framework synergistically combines three distinct perspectives to bridge the gap between state-of-the-art models and human expert performance.
Figure 1: The proposed Agentar-Scale-SQL framework.| Methods | EX (Dev) | EX (Test) | R-VES (%) |
|---|---|---|---|
| Agentar-Scale-SQL (Ours) | 74.90 | 81.67 | 77.00 |
| AskData + GPT-4o | 76.14 | 80.88 | 76.24 |
| LongData-SQL | 74.32 | 77.53 | 71.89 |
| CHASE-SQL + Gemini | 74.90 | 76.02 | 69.94 |
| JoyDataAgent-SQL | 74.25 | 75.85 | 70.16 |
| TCDataAgent-SQL | 74.12 | 75.74 | - |
| Contextual-SQL | 73.50 | 75.63 | 70.02 |
| XiYan-SQL | 73.34 | 75.63 | 71.41 |
- π
2025.11.27: We are excited to release Agentar-Scale-SQL-Generation-32B on Hugging Face and ModelScope! Simultaneously, we have open-sourced the code for the Light Schema Engine and the Offline Data Preprocessing Pipeline! - π
2025.09.30: Our paper is available on arXiv. - π
2025.09.25: We are proud to announce that we have achieved #1 Rank on the official BIRD leaderboard with 81.67% execution accuracy!
We are committed to continuously improving Agentar-Scale-SQL. Here is our plan for upcoming features and releases.
- Paper
- Publish the paper on arXiv.
- Model Releases
- Release Agentar-Scale-SQL-Generation-32B on Hugging Face and ModelScope.
- Release Agentar-Scale-SQL-Selection-32B on Hugging Face and ModelScope.
- Code Releases
- Release the code for the light schema engine.
- Release the code for the offline data preprocessing pipeline.
- Release the code for task understanding and generating SQL candidates with closed-source models.
- Release the code for generating SQL candidates with the fine-tuned model.
- Release the code for the SQL selection module.
Agentar-Scale-SQL/
βββ ScaleSQL/ # Core source code directory
β βββ workflows/ # Main workflow scripts
β βββ config/ # Configuration files
βββ ddl_schema.sh
βββ requirements.txt # Dependency list
βββ .env # Environment variable
βββ .env.example # Environment variable template
βββ .gitignore
βββ README.md # Current document
βββ nltk_data.zip # For ddl schema generationconda create -n ScaleSQL python=3.10
conda activate ScaleSQL# Install PyTorch (CUDA 12.1)
pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cu121pip install -r requirements.txtpip install https://github.com/vllm-project/vllm/releases/download/v0.8.5.post1/vllm-0.8.5.post1+cu121-cp38-abi3-manylinux1_x86_64.whlmodelscope download --model sentence-transformers/all-MiniLM-L6-v2 --local_dir ./ScaleSQL/model/all-MiniLM-L6-v2Modify the configuration file: .ScaleSQL/workflows/config/pipeline_config.yaml.
Note that, we need column meaning file in the evaluation.
dataset_folder: /temp/bird_test # Change to the actual folder
column_meaning_path: /your_path/column_meaning.json # Change to the actual pathpython -m ScaleSQL.workflows.schema_generation --evaluation_type testOutput example:
.ScaleSQL/dataset/bird_test_light_schema.json
ANONYMIZED_TELEMETRY=False python -m ScaleSQL.workflows.train_skeleton_processOutput path:
/tmp/ScaleSQL/chroma/bird_train_skeleton
ANONYMIZED_TELEMETRY=False python -m ScaleSQL.workflows.database_cell_process --evaluation_type testOutput path:
/tmp/ScaleSQL/chroma/bird_test
bash ddl_schema.shOutput example:
.ScaleSQL/dataset/bird_test_ddl_schema.json
We are developing a ChatBI product that transforms complex business data into conversational insights. If you are interested in trying our ChatBI product, please contact us.
Figure 2: The contact information.@misc{wang2025agentarscalesqladvancingtexttosqlorchestrated,
title={Agentar-Scale-SQL: Advancing Text-to-SQL through Orchestrated Test-Time Scaling},
author={Pengfei Wang and Baolin Sun and Xuemei Dong and Yaxun Dai and Hongwei Yuan and Mengdie Chu and Yingqi Gao and Xiang Qi and Peng Zhang and Ying Yan},
year={2025},
eprint={2509.24403},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2509.24403},
}
