Skip to content

This is a machine learning project. Logistic regression, decision tree and random forests are used to predict if a hotel booking will be canceled or not.

Notifications You must be signed in to change notification settings

tylerzhang77/hotel_booking_prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

This data set contains about 120k lines of data about hotel booking information.

Research question

  • Predict if a hotel booking will be canceled using existing variables.

Files

data_dictionary.txt

This is a data dictionary, containing notes for each variable.

hotel_booking_prediction.ipynb

This is the jupyter notebook that contains all the code and analysis.

hotel_bookings.csv

This is the main data set used in this project.

Conclusions

Random forest does a better job at predicting cancellation, compared to logistic regression and decision tree, with 0.86 precision, 0.77 recall and 0.87 accuracy.

About

This is a machine learning project. Logistic regression, decision tree and random forests are used to predict if a hotel booking will be canceled or not.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published