Titanic-Logistic-Regression

For this project, we have worked with the Titanic Data Set from Kaggle.

The goal is to predict whether each person survived or deceased in the shipwreck. (binary classification task)

The model's objective is to analyze various features or information about the individuals and make predictions about their survival outcomes based on that data.

Steps of the project:

Import all important libraries
Reading the titanic_train.csv file into pandas dataframe
View the top few rows of the dataframe
Exploratory Data Analysis to visualize the data
Check for Missing data
Data Cleaning: Impute missing values in Age based Pclass (take average of age in Pclass)
Data Cleaning: Drop the Cabin Column
Data Cleaning: Drop the row in Embarked column that is NaN
Feature Engineering
Convert categorical features (Sex, Embark) to dummy variables using get_dummies
Build a logistic regression model (by splitting the data in 70:30 ratio of train/test)
Predict and evaluate the model
Analyse Confusion Matrix and Classification Report

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Group 4_Titanic.ipynb		Group 4_Titanic.ipynb
README.md		README.md
Titanic.pptx		Titanic.pptx
titanic_train.csv		titanic_train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Titanic-Logistic-Regression

About

Uh oh!

Releases

Packages

Languages

khanhbang03/Titanic-Logistic-Regression

Folders and files

Latest commit

History

Repository files navigation

Titanic-Logistic-Regression

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages