Skip to content

This project involves analyzing a diabetes dataset and implementing predictive models using Random Forest, SVM, and Logistic Regression. The models were fine-tuned and their performance compared to determine the most effective classifier.

Notifications You must be signed in to change notification settings

srinikha193/Diabetes_Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Diabetes_Prediction

Diabetes Dataset Analysis and Predictive Modeling

This project involves an in-depth analysis of a diabetes dataset, followed by the implementation of predictive modeling techniques to classify and predict the likelihood of diabetes occurrence. The dataset was carefully examined and preprocessed to ensure data quality and feature relevance.

Three different machine learning classifiers were utilized for predictive modeling:

  1. Random Forest Classifier: An ensemble learning method that operates by constructing multiple decision trees and outputting the class that is the mode of the classes of the individual trees. Fine-tuning involved optimizing the number of trees, maximum depth, and other hyperparameters to achieve the best performance.

  2. Support Vector Machine (SVM): A powerful linear classifier that aims to find the optimal hyperplane that maximizes the margin between different classes. The SVM model was fine-tuned by adjusting the kernel type, regularization parameter (C), and gamma values to enhance the classification accuracy.

  3. Logistic Regression:A widely used statistical method for binary classification problems. The model was fine-tuned by selecting appropriate regularization techniques and adjusting hyperparameters such as the regularization strength.

Each model was fine-tuned using cross-validation techniques to avoid overfitting and to ensure robustness in performance. After fine-tuning, the models' performance was evaluated and compared based on accuracy, precision, recall, and F1-score metrics. The results provide insights into the most effective model for predicting diabetes in the given dataset.

About

This project involves analyzing a diabetes dataset and implementing predictive models using Random Forest, SVM, and Logistic Regression. The models were fine-tuned and their performance compared to determine the most effective classifier.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published