Traffic Accident Analysis

Project Overview

This project analyzes a dataset on traffic accidents and evaluates different machine learning models to classify accident types. The dataset undergoes exploratory data analysis (EDA), preprocessing, and training using various classification models. The goal is to determine which model performs best in predicting accident types based on available features.

Dataset Details

The dataset traffic_accidents.csv contains information on traffic accidents, including categorical and numerical features. The crash_type column serves as the target variable.

Data Preprocessing

Categorical columns are encoded using LabelEncoder.
The crash_date column is dropped as it is not required for model training.
The dataset is split into training (80%) and testing (20%) sets.

Machine Learning Models Used

The following models were trained and evaluated:

Random Forest Classifier
Logistic Regression
Decision Tree Classifier
K-Nearest Neighbors (KNN)

Model Performance

Model	Accuracy
Random Forest	83.66%
Logistic Regression	83.16%
Decision Tree	79.23%
KNN	74.52%

Random Forest achieved the highest accuracy among all models.

Results and Insights

Random Forest performed the best due to its ability to handle both categorical and numerical features while reducing overfitting.
Logistic Regression performed slightly worse but is still a strong baseline model.
Decision Tree had lower accuracy, likely due to overfitting on training data.
KNN had the lowest accuracy as it is sensitive to feature scaling and data distribution.

How to Run the Code

Clone this repository:

git clone https://github.com/arpanpramanik2003/traffic-accident-analysis.git

Navigate to the project folder:
```
cd traffic-accident-analysis
```
Install required dependencies:
```
pip install -r requirements.txt
```

Future Improvements

Adding more advanced models such as Gradient Boosting and Neural Networks.
Feature engineering to enhance model performance.
Hyperparameter tuning for better optimization.
Deployment using Flask or Streamlit for interactive visualization.

Author

Arpan Pramanik

For any queries, feel free to connect with me on GitHub!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
traffic-accidents-analyses.ipynb		traffic-accidents-analyses.ipynb
traffic.jpg		traffic.jpg
traffic_accidents.csv		traffic_accidents.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Traffic Accident Analysis

Project Overview

Dataset Details

Data Preprocessing

Machine Learning Models Used

Model Performance

Results and Insights

How to Run the Code

Future Improvements

Author

About

Releases

Packages

Languages

License

arpanpramanik2003/traffic-accident-analysis

Folders and files

Latest commit

History

Repository files navigation

Traffic Accident Analysis

Project Overview

Dataset Details

Data Preprocessing

Machine Learning Models Used

Model Performance

Results and Insights

How to Run the Code

Future Improvements

Author

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages