This directory contains Jupyter notebook for the Heart Disease Prediction project. The notebook is used for exploratory data analysis, model prototyping, and various model building experiments related to predicting heart disease.
├── README.md
├── catboost_info
│ ├── catboost_training.json
│ ├── learn
│ │ └── events.out.tfevents
│ ├── learn_error.tsv
│ └── time_left.tsv
├── model.ipynb
└── requirements.txt
The notebook in this directory serve multiple purposes:
- Data Preprocessing and Exploration: Understand the dataset by visualizing the data distribution, identifying patterns, and detecting outliers.
- Data normalization: Normalize data using StandardScaler and DictVectorizer
- Model Prototyping: Experiment with different machine learning models including Logistic Regression, Random Forest, XGBoost, CatBoost to predict heart disease.
- Evaluation: Assess model performance using appropriate metrics like F1 Score and ROC AUC Score.
To get started with these notebooks:
- Install packages
pip install -r requirements.txt
- Run Jupyter Notebook
jupyter notebook
- Get the token and enter token to access notebook