This machine learning project's goal is to analyze and predict flight delays based on weather, precipitation and past aviation data. Given a collection of input data, the goal is to create a predictive model that can accurately forcast the possibility of flight delays for future flights.
- Python 3.x
- XGBoost
- pandas
- scikit-learn
- pickle
- seaborn
- numpy
The project uses a flight dataset containing historical flight information, including various features such as airline, origin, destination, scheduled departure time, and others. Make sure to obtain the dataset and place it in the appropriate directory.
Delay status for each day of the week
Aviation_delay.ipynb
: jupyter notebook containing EDA, feature engineering, model and hyperparameter tuning.xgbmodel.pkl
: trained model saved as pkl filerequirements.txt
: required dependencies and modulesREADME.md
: detailed description about the project.data/
: directory containg datasets.
- Install the required dependencies by running the following command:
pip install -r requirements.txt
-
Place the flight dataset file in the
data/
directory. -
Preprocess the dataset and train the XGBoost model, The trained model will be saved in the
models/
directory as a pickle file. -
Evaluate the trained model's performance.
-
Predict flight delays for new data using the trained model by loading the pickle file.
This project provides a framework for analyzing and predicting flight delays using XGBoost. The trained model can be used to forecast flight delays and potentially help airlines and passengers make informed decisions.