Skip to content

This project aims to predict the presence of heart disease in patients based on their medical attributes, using machine learning techniques and PySpark. The goal is to develop an accurate model that can assist healthcare professionals in early diagnosis and treatment decisions.

Notifications You must be signed in to change notification settings

Rhythm-Divine/Heart-Disease-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

Predicting Heart Disease

This project focuses on predicting the presence of heart disease in patients based on various medical attributes. The goal is to develop a machine learning model that can assist in early diagnosis and treatment decisions.

Dataset

The dataset used for this project is sourced from the UCI Machine Learning Repository. It contains information collected by multiple institutions, including the Hungarian Institute of Cardiology and University Hospitals in Zurich and Basel.

Project Structure

The project follows the following structure:

  • data/: Directory to store the dataset
  • README.md: This file, providing an overview of the project.

Setup and Usage

To run the project, follow these steps:

  1. Clone the repository: git clone https://github.com/Rhythm-Divine/heart-disease-prediction.git
  2. Navigate to the project directory: cd heart-disease-prediction
  3. Install the required libraries
  4. Download the dataset from Kaggle and place it in the data/ directory.
  5. Open and run the Jupyter notebooks in the notebooks/ directory for data exploration, preprocessing, modeling, and evaluation.

Results and Evaluation

The model's performance is evaluated based on accuracy, aiming to achieve at least 85% accuracy in predicting heart disease. The results, insights, and potential applications are discussed in the Jupyter notebooks and documented in the project report.

Future Steps and Contributions

Potential future steps for this project include:

  • Refining the model with additional feature engineering and hyperparameter tuning.
  • Exploring different machine learning algorithms for comparison.
  • Deploying the trained model as a web application or API for practical use.

Contributions and suggestions are welcome! Feel free to open an issue or submit a pull request.

About

This project aims to predict the presence of heart disease in patients based on their medical attributes, using machine learning techniques and PySpark. The goal is to develop an accurate model that can assist healthcare professionals in early diagnosis and treatment decisions.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published