Agentar-Scale-SQL: Advancing Text-to-SQL through Orchestrated Test-Time Scaling

📝 Introduction

Agentar-Scale-SQL is a novel framework that leverages scalable computation to significantly improve Text-to-SQL performance on challenging benchmarks. By implementing an Orchestrated Test-Time Scaling strategy, our framework synergistically combines three distinct perspectives to bridge the gap between state-of-the-art models and human expert performance.

Figure 1: The proposed Agentar-Scale-SQL framework.

⚡️ Performance

Methods	EX (Dev)	EX (Test)	R-VES (%)
Agentar-Scale-SQL (Ours)	74.90	81.67	77.00
AskData + GPT-4o	76.14	80.88	76.24
LongData-SQL	74.32	77.53	71.89
CHASE-SQL + Gemini	74.90	76.02	69.94
JoyDataAgent-SQL	74.25	75.85	70.16
TCDataAgent-SQL	74.12	75.74	-
Contextual-SQL	73.50	75.63	70.02
XiYan-SQL	73.34	75.63	71.41

🎉 News

🚀 2025.11.27: We are excited to release Agentar-Scale-SQL-Generation-32B on Hugging Face and ModelScope! Simultaneously, we have open-sourced the code for the Light Schema Engine and the Offline Data Preprocessing Pipeline!
🎁 2025.09.30: Our paper is available on arXiv.
🏆 2025.09.25: We are proud to announce that we have achieved #1 Rank on the official BIRD leaderboard with 81.67% execution accuracy!

🗺️ Release Roadmap

We are committed to continuously improving Agentar-Scale-SQL. Here is our plan for upcoming features and releases.

Paper
- Publish the paper on arXiv.
Model Releases
- Release Agentar-Scale-SQL-Generation-32B on Hugging Face and ModelScope.
- Release Agentar-Scale-SQL-Selection-32B on Hugging Face and ModelScope.
Code Releases
- Release the code for the light schema engine.
- Release the code for the offline data preprocessing pipeline.
- Release the code for task understanding and generating SQL candidates with closed-source models.
- Release the code for generating SQL candidates with the fine-tuned model.
- Release the code for the SQL selection module.

📂 Directory Structure

Agentar-Scale-SQL/
├── ScaleSQL/                     # Core source code directory
│   └── workflows/                # Main workflow scripts
│       └── config/               # Configuration files
├── ddl_schema.sh
├── requirements.txt              # Dependency list
├── .env                          # Environment variable
├── .env.example                  # Environment variable template
├── .gitignore
├── README.md                     # Current document
├── nltk_data.zip                 # For ddl schema generation

📚 Usage

1. Installation and Environment Settings

1.1 Create Virtual Environment and Install Python Dependencies

conda create -n ScaleSQL python=3.10
conda activate ScaleSQL

1.2 Install PyTorch and Core Dependencies

# Install PyTorch (CUDA 12.1)
pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cu121

1.3 Install Project Dependencies

pip install -r requirements.txt

1.4 Install vLLM (for Inference Acceleration)

pip install https://github.com/vllm-project/vllm/releases/download/v0.8.5.post1/vllm-0.8.5.post1+cu121-cp38-abi3-manylinux1_x86_64.whl

1.5 Download Embedding Model

modelscope download --model sentence-transformers/all-MiniLM-L6-v2 --local_dir ./ScaleSQL/model/all-MiniLM-L6-v2

2. Data Preparation

2.1 Configure Paths

Modify the configuration file: .ScaleSQL/workflows/config/pipeline_config.yaml. Note that, we need column meaning file in the evaluation.

dataset_folder: /temp/bird_test  # Change to the actual folder
column_meaning_path: /your_path/column_meaning.json # Change to the actual path

3. Preprocessing Pipeline

3.1 Generate Light Schema

python -m ScaleSQL.workflows.schema_generation --evaluation_type test

Output example: .ScaleSQL/dataset/bird_test_light_schema.json

3.2 Process Training Set Examples and Write to Vector Database

ANONYMIZED_TELEMETRY=False python -m ScaleSQL.workflows.train_skeleton_process

Output path: /tmp/ScaleSQL/chroma/bird_train_skeleton

3.3 Process Database Cell Values and Write to Vector Database

ANONYMIZED_TELEMETRY=False python -m ScaleSQL.workflows.database_cell_process --evaluation_type test

Output path: /tmp/ScaleSQL/chroma/bird_test

3.4 Build BM25 Index (Content-Based) and Generate DDL Schema (Requires Java Environment)

bash ddl_schema.sh

Output example: .ScaleSQL/dataset/bird_test_ddl_schema.json

📦 Try Our Product

We are developing a ChatBI product that transforms complex business data into conversational insights. If you are interested in trying our ChatBI product, please contact us.

Figure 2: The contact information.

📎 Citation

@misc{wang2025agentarscalesqladvancingtexttosqlorchestrated,
      title={Agentar-Scale-SQL: Advancing Text-to-SQL through Orchestrated Test-Time Scaling}, 
      author={Pengfei Wang and Baolin Sun and Xuemei Dong and Yaxun Dai and Hongwei Yuan and Mengdie Chu and Yingqi Gao and Xiang Qi and Peng Zhang and Ying Yan},
      year={2025},
      eprint={2509.24403},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2509.24403}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
ScaleSQL		ScaleSQL
resources		resources
LEGAL.md		LEGAL.md
LICENSE		LICENSE
README.md		README.md
ddl_schema.sh		ddl_schema.sh
nltk_data.zip		nltk_data.zip
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Agentar-Scale-SQL: Advancing Text-to-SQL through Orchestrated Test-Time Scaling

📝 Introduction

⚡️ Performance

🎉 News

🗺️ Release Roadmap

📂 Directory Structure

📚 Usage

1. Installation and Environment Settings

1.1 Create Virtual Environment and Install Python Dependencies

1.2 Install PyTorch and Core Dependencies

1.3 Install Project Dependencies

1.4 Install vLLM (for Inference Acceleration)

1.5 Download Embedding Model

2. Data Preparation

2.1 Configure Paths

3. Preprocessing Pipeline

3.1 Generate Light Schema

3.2 Process Training Set Examples and Write to Vector Database

3.3 Process Database Cell Values and Write to Vector Database

3.4 Build BM25 Index (Content-Based) and Generate DDL Schema (Requires Java Environment)

📦 Try Our Product

📎 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

antgroup/Agentar-Scale-SQL

Folders and files

Latest commit

History

Repository files navigation

Agentar-Scale-SQL: Advancing Text-to-SQL through Orchestrated Test-Time Scaling

📝 Introduction

⚡️ Performance

🎉 News

🗺️ Release Roadmap

📂 Directory Structure

📚 Usage

1. Installation and Environment Settings

1.1 Create Virtual Environment and Install Python Dependencies

1.2 Install PyTorch and Core Dependencies

1.3 Install Project Dependencies

1.4 Install vLLM (for Inference Acceleration)

1.5 Download Embedding Model

2. Data Preparation

2.1 Configure Paths

3. Preprocessing Pipeline

3.1 Generate Light Schema

3.2 Process Training Set Examples and Write to Vector Database

3.3 Process Database Cell Values and Write to Vector Database

3.4 Build BM25 Index (Content-Based) and Generate DDL Schema (Requires Java Environment)

📦 Try Our Product

📎 Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages