This is a classic dataset used in many data mining tutorials and demos -- perfect for getting started with exploratory analysis and building binary classification models to predict survival. Data covers passengers only, not crew. We are Predicting the Survival of Titanic Passengers in this dataset.
Data Description
- Survived(0=No and 1=Yes)
- Pclass = Passenger Class(1=1st,2=2nd,3=3rd)
- Name
- Sex
- Age
- SibSp(Number of siblings/Spouses Aboard)
- Parch( Number of parents/Children Aboard)
- Ticket= Ticket No
- Fare=Passenger Fare(British Pound)
- Cabin=Cabin
- Embarked=Port of Embarkation(C=Cherbourg,France; Q=Queenstown,UK; S=Southampton-Cobh, Ireland)
We have analyzed this data using Logistic Regression. The accuracy of this model is nearly 75%.