Skip to content

Latest commit

 

History

History
17 lines (12 loc) · 1010 Bytes

README.md

File metadata and controls

17 lines (12 loc) · 1010 Bytes

QSAR-Biodegradation

This project aimed to analyze the QSAR Biodegradation dataset using five classification models: Logistic Regression, K-Nearest Neighbour (KNN), Support Vector Classifier (SVC), Decision Tree Classifier, and Random Forest Classifier. The models were trained and evaluated using sklearn metrics.

Proposed Methodology

CLASSIFICATION

Types of Classification Models Used

  • Logistic Regression
  • K-Nearest Neighbor (KNN)
  • Support Vector Classifier
  • Decision Tree Classifier
  • Random Forest Classifier

DATASET DESCRIPTION

The QSAR biodegradation dataset was built in the Milano Chemometrics and QSAR Research Group. The data have been used to develop QSAR (Quantitative Structure Activity Relationships) models for the study of the relationships between chemical structure and biodegradation of molecules. Biodegradation experimental values of 1055 chemicals were collected from the webpage of the National Institute of Technology and Evaluation of Japan (NITE).