Machine Learning Model for Diabetes Diagnosis

Description

This project focuses on training a predictive machine learning model to diagnose diabetes in female patients. The dataset used for training was sourced from Kaggle, consisting of over 800 records. The workflow includes data cleaning, exploratory data analysis, model training, evaluation, and visualization of performance metrics.

Features

Data cleaning and preprocessing
Exploratory Data Analysis (EDA) using Pandas
Implementation of multiple machine learning techniques using Scikit-Learn
Model evaluation and hyperparameter tuning
Performance visualization using Seaborn and Matplotlib
Achieved an accuracy score of up to 80%

Technologies Used

Python
Pandas, NumPy for data manipulation
Scikit-Learn for machine learning models
Seaborn, Matplotlib for visualization

Installation & Setup

Prerequisites

Python 3.x installed
Jupyter Notebook or a Python IDE (VS Code, PyCharm, etc.)
Virtual environment (optional but recommended)

Setup

Clone the repository:

git clone https://github.com/TheVinh-Ha-1710/Diabetes-Predictive-Model.git
cd Diabetes-Predictive-Model

Create and activate a virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Install dependencies:
```
pip install -r requirements.txt
```
Run the Jupyter Notebook:
```
jupyter notebook
```

Usage

Load and preprocess the dataset.
Perform exploratory data analysis to understand data insights.
Train and evaluate various machine learning models.
Optimize the best-performing model through hyperparameter tuning.
Visualize model performance with accuracy, confusion matrix, and ROC curve.

Folder Structure

📂 Diabetes-Predictive-Model
 ├── 📜 README.md               # Project documentation   
 ├── 📜 diabetes.csv            # Model training script notebook  
 ├── 📜 model_training.ipynb    # Dataset  
 ├── 📜 model_training.pdf      # PDF version of the notebook
 ├── 📜 requirements.txt        # Dependencies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Model for Diabetes Diagnosis

Description

Features

Technologies Used

Installation & Setup

Prerequisites

Setup

Usage

Folder Structure

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
README.md		README.md
diabetes.csv		diabetes.csv
model_training.ipynb		model_training.ipynb
model_traning.pdf		model_traning.pdf
requirements.txt		requirements.txt

TheVinh-Ha-1710/Diabetes-Predictive-Model

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Model for Diabetes Diagnosis

Description

Features

Technologies Used

Installation & Setup

Prerequisites

Setup

Usage

Folder Structure

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages