Welcome to the Titanic Survival Prediction project! This repository contains a Jupyter notebook that demonstrates a comprehensive data analysis process and machine learning techniques to predict the survival of passengers on the Titanic. This project is intended for beginners who are looking to get started with data analysis and machine learning.
The Titanic disaster is one of the most infamous shipwrecks in history. In this project, we use data from the Titanic passengers to predict who would survive the disaster. This project covers the entire data analysis and machine learning pipeline, from data cleaning and exploratory data analysis (EDA) to model training and evaluation.
The dataset used in this project is the Titanic dataset provided by Kaggle. It contains information about the passengers on the Titanic, including whether they survived, their age, fare, class, and other attributes.
You can download the dataset from Kaggle Titanic Competition.
To run this notebook, you need to have Python installed along with the following libraries:
In the EDA section, we perform the following steps:
The results section includes: