Skip to content

Latest commit

 

History

History
19 lines (17 loc) · 842 Bytes

File metadata and controls

19 lines (17 loc) · 842 Bytes

Analysis-of-Titanic-Data-Set-using-Logistic-Regression-

This is a classic dataset used in many data mining tutorials and demos -- perfect for getting started with exploratory analysis and building binary classification models to predict survival. Data covers passengers only, not crew. We are Predicting the Survival of Titanic Passengers in this dataset.

Data Description

  1. Survived(0=No and 1=Yes)
  2. Pclass = Passenger Class(1=1st,2=2nd,3=3rd)
  3. Name
  4. Sex
  5. Age
  6. SibSp(Number of siblings/Spouses Aboard)
  7. Parch( Number of parents/Children Aboard)
  8. Ticket= Ticket No
  9. Fare=Passenger Fare(British Pound)
  10. Cabin=Cabin
  11. Embarked=Port of Embarkation(C=Cherbourg,France; Q=Queenstown,UK; S=Southampton-Cobh, Ireland)

We have analyzed this data using Logistic Regression. The accuracy of this model is nearly 75%.