🚑 Emergency Vehicle Priority System

Intelligent Traffic Signal Control using Reinforcement Learning

📋 Overview

This project implements an AI-powered traffic signal control system designed to prioritize emergency vehicles (ambulances) while maintaining efficient overall traffic flow. Using Proximal Policy Optimization (PPO) and SUMO (Simulation of Urban MObility), the system learns optimal signal timing policies that significantly reduce emergency vehicle response times.

🎯 Problem Statement

Traditional fixed-time traffic signals cannot adapt to dynamic traffic conditions or prioritize emergency vehicles. This results in:

Delayed emergency response times
Increased risk to patients requiring urgent care
Inefficient traffic flow during emergency situations

Our RL-based solution dynamically adjusts traffic signals to clear paths for ambulances while minimizing disruption to civilian traffic.

🛠️ Technical Stack

RL Framework: Stable-Baselines3 (PPO)
Traffic Simulator: SUMO 1.25.0
Environment Wrapper: sumo-rl
Python: 3.12
Key Libraries: Gymnasium, PyTorch, NumPy

📁 Project Structure

rl-traffic-control/
├── train_optimized.py          # Optimized training script with VecNormalize
├── test_optimized.py            # Evaluation script with GUI visualization
├── train2.py                    # Basic training script
├── test2.py                     # Basic testing script
├── run_baseline_pure_traci.py  # Baseline fixed-time signals comparison
├── draft02.net.xml              # SUMO road network
├── draft02.rou.xml              # Civilian vehicle routes
├── ambulance.rou.xml            # Emergency vehicle configuration
├── vtypes.rou.xml               # Vehicle type definitions
└── draft02.sumocfg              # SUMO configuration file

🚀 Quick Start

Installation

# Install dependencies
pip install -r requirements.txt

# Set SUMO_HOME environment variable
export SUMO_HOME="/usr/share/sumo"  # Adjust path as needed

Running the Project

1. Train the Optimized RL Agent

python train_optimized.py

Training runs for 100,000 timesteps
Checkpoints saved every 10,000 steps to ./models/
Final model: optimized_traffic_agent.zip
Normalization stats: vec_normalize.pkl

2. Run Baseline Comparison

python run_baseline_pure_traci.py

Simulates fixed-time traffic signals
Records ambulance travel time for comparison
Visualizes simulation in SUMO GUI

3. Test the Trained Agent

python test_optimized.py

Loads trained model and normalization stats
Visualizes agent performance with SUMO GUI
Tracks and reports ambulance travel time

🧠 Key Features

Custom Reward Function

reward = -1 * ((civilian_penalty × 0.1) + (ambulance_penalty × 5000))

Civilian Penalty: Sum of accumulated waiting time across all lanes
Ambulance Penalty: Massive penalty (5000) when ambulance speed < 1 m/s
Weighted Design: Ambulance priority 50× more important than civilian flow

Optimization Techniques

VecNormalize: Normalizes observations and rewards to prevent gradient explosion
Big Brain Architecture: 256×256 neural network (vs default 64×64)
Fine-tuned Hyperparameters:
- Learning rate: 3e-4
- Gamma: 0.995 (long-term planning)
- Entropy coefficient: 0.01 (exploration)
- n_steps: 2048 (experience collection)
- GAE Lambda: 0.95 (variance smoothing)

Environment Configuration

Simulation Time: 1000 seconds per episode
Ambulance Spawn: 120 seconds into simulation
Signal Constraints: 5-60 seconds green time, 4 seconds yellow
Control Mode: Single-agent (centralized control)

📊 Performance Metrics

The system tracks:

Ambulance Travel Time: Primary metric (seconds from spawn to destination)
System Mean Waiting Time: Average civilian vehicle delay
Episode Reward Mean: Normalized reward (should converge to stable value)
Explained Variance: How well the value function predicts returns

Achieved Results

Baseline (Fixed-time): ~17 seconds ambulance travel time
RL Agent (Optimized): ~5 seconds ambulance travel time
Improvement: 70% reduction in emergency response time
Impact: The RL agent successfully prioritizes ambulance passage with minimal civilian traffic disruption

🔧 Configuration

Training Parameters

Edit train_optimized.py:

total_timesteps=100000      # Training duration
num_seconds=1000            # Episode length
learning_rate=3e-4          # Learning rate
checkpoint_freq=10000       # Save frequency

Reward Tuning

Adjust weights in custom_ambulance_reward():

civilian_penalty * 0.1      # Civilian weight (default: 0.1)
ambulance_penalty = 5000    # Ambulance penalty (default: 5000)

📈 Monitoring Training

During training, watch for:

ep_rew_mean: Should stabilize (not remain at -228k)
explained_variance: Should increase from ~0 to 0.3-0.5
entropy_loss: Should gradually decrease (exploration → exploitation)
policy_gradient_loss: Should remain stable, not explode

🎮 Simulation Controls

When running with GUI (use_gui=True):

Space: Pause/Resume
Mouse Wheel: Zoom in/out
Right-click + Drag: Pan view
Vehicle Click: View individual vehicle details

📝 Requirements

sumo-rl
stable-baselines3
gymnasium
matplotlib
seaborn
shimmy
torch

🤝 Contributing

This is an academic/research project. Key areas for improvement:

Multi-agent scenarios (multiple intersections)
Real-world traffic pattern integration
Transfer learning from simulation to reality
Additional emergency vehicle types

📄 License

Educational/Research Use

👨‍💻 Author

MAYANK SAHU

🙏 Acknowledgments

SUMO Development Team
Stable-Baselines3 Contributors
sumo-rl Library Maintainers

Note: This system is designed for simulation and research purposes. Real-world deployment would require extensive validation, safety testing, and regulatory approval.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
models/rl		models/rl
models_robust		models_robust
models_s		models_s
modelsop		modelsop
modelsops		modelsops
.gitignore		.gitignore
README.md		README.md
README_S_SCENARIO.md		README_S_SCENARIO.md
ambulance.rou.xml		ambulance.rou.xml
ambulance_s.rou.xml		ambulance_s.rou.xml
baseline_result.txt		baseline_result.txt
baseline_result_s.txt		baseline_result_s.txt
baseline_scenario_s.py		baseline_scenario_s.py
chat_history.md		chat_history.md
draft02.net.xml		draft02.net.xml
draft02.rou.xml		draft02.rou.xml
draft02.sumocfg		draft02.sumocfg
draft02A.xml		draft02A.xml
draft02_scenario_s.sumocfg		draft02_scenario_s.sumocfg
netedit01.netecfg		netedit01.netecfg
netedit01.rou.xml		netedit01.rou.xml
optimized_result.txt		optimized_result.txt
optimized_result_s.txt		optimized_result_s.txt
optimized_traffic_agent.zip		optimized_traffic_agent.zip
plot_res.py		plot_res.py
plot_reward.py		plot_reward.py
plot_scenario_s.py		plot_scenario_s.py
ppo_learning_curve.png		ppo_learning_curve.png
ppo_learning_curve_robust.png		ppo_learning_curve_robust.png
ppo_learning_curve_s_scenario.png		ppo_learning_curve_s_scenario.png
ppo_traffic_metrics_dashboard.png		ppo_traffic_metrics_dashboard.png
ppo_traffic_metrics_dashboard_robust.png		ppo_traffic_metrics_dashboard_robust.png
ppo_training_dashboard.png		ppo_training_dashboard.png
requirements.txt		requirements.txt
robust_traffic_agent.zip		robust_traffic_agent.zip
robust_vec_normalise.pkl		robust_vec_normalise.pkl
run_baseline.py		run_baseline.py
run_baseline_pure_traci.py		run_baseline_pure_traci.py
s_scenario_results.png		s_scenario_results.png
test2.py		test2.py
test_agent.py		test_agent.py
test_checkpoint.py		test_checkpoint.py
test_diagnosis.py		test_diagnosis.py
test_optimized.py		test_optimized.py
test_scenario_s.py		test_scenario_s.py
traffic_dense.rou.xml		traffic_dense.rou.xml
train.py		train.py
train2.py		train2.py
train_optimized.py		train_optimized.py
train_robust.py		train_robust.py
train_scenario_s.py		train_scenario_s.py
trips.trips.xml		trips.trips.xml
unpackpkl.py		unpackpkl.py
vec_normalise.pkl		vec_normalise.pkl
vec_normalize.pkl		vec_normalize.pkl
vtypes.rou.xml		vtypes.rou.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚑 Emergency Vehicle Priority System

📋 Overview

🎯 Problem Statement

🛠️ Technical Stack

📁 Project Structure

🚀 Quick Start

Installation

Running the Project

🧠 Key Features

Custom Reward Function

Optimization Techniques

Environment Configuration

📊 Performance Metrics

Achieved Results

🔧 Configuration

Training Parameters

Reward Tuning

📈 Monitoring Training

🎮 Simulation Controls

📝 Requirements

🤝 Contributing

📄 License

👨‍💻 Author

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

msnotfound/rl-emergency-traffic-control

Folders and files

Latest commit

History

Repository files navigation

🚑 Emergency Vehicle Priority System

📋 Overview

🎯 Problem Statement

🛠️ Technical Stack

📁 Project Structure

🚀 Quick Start

Installation

Running the Project

🧠 Key Features

Custom Reward Function

Optimization Techniques

Environment Configuration

📊 Performance Metrics

Achieved Results

🔧 Configuration

Training Parameters

Reward Tuning

📈 Monitoring Training

🎮 Simulation Controls

📝 Requirements

🤝 Contributing

📄 License

👨‍💻 Author

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages