Wine Quality Predictions

Description

This project focuses on predicting the quality of red wine using various machine learning algorithms for regression analysis, data visualizations, and data analysis. The dataset comprises physicochemical and sensory variables related to red and white variants of the Portuguese "Vinho Verde" wine.

Context

The datasets present a classification or regression task, considering physicochemical inputs and sensory outputs. Notably, the classes are ordered and imbalanced, making it challenging to predict wine quality accurately. Privacy and logistic constraints limit available information to physicochemical and sensory variables, omitting grape types, wine brand, and selling price.

Content

For detailed information, refer to the original publication by Cortez et al., 2009. Input Variables (Physicochemical Tests):

Fixed acidity
Volatile acidity
Citric acid
Residual sugar
Chlorides
Free sulfur dioxide
Total sulfur dioxide
Density
pH
Sulphates
Alcohol

Output Variable (Sensory Data):

Quality (Score between 0 and 10)

Tips

Consider exploring classification tasks by setting a cutoff for wine quality, e.g., classifying scores of 7 or higher as 'good/1' and the rest as 'not good/0'. Experiment with hyperparameter tuning, decision tree algorithms, ROC curves, and AUC values.

Project Steps

Importing Libraries
Loading Data
Understanding Data
Missing Values
Exploring Variables (Data Analysis)
Feature Selection
Proportion of Good vs Bad Wines
Preparing Data for Modeling
Applying Different Models
Choosing the Right Model

Inspiration

Utilize machine learning to identify physicochemical properties that contribute to a wine being classified as 'good'!

Acknowledgements

The dataset is also available from the UCI machine learning repository. Please include the citation below if you plan to use this database:

Citation: P. Cortez, A. Cerdeira, F. Almeida, T. Matos, and J. Reis. Modeling wine preferences by data mining from physicochemical properties. In Decision Support Systems, Elsevier, 47(4):547-553, 2009.

Relevant Publication

P. Cortez, A. Cerdeira, F. Almeida, T. Matos, and J. Reis. Modeling wine preferences by data mining from physicochemical properties. In Decision Support Systems, Elsevier, 47(4):547-553, 2009.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
wine_quality_prediction.ipynb		wine_quality_prediction.ipynb
winequality-red.csv		winequality-red.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wine Quality Predictions

Description

Context

Content

Tips

Project Steps

Inspiration

Acknowledgements

Relevant Publication

About

Releases

Packages

Languages

Sreeja9428/Wine-Quality-Prediction

Folders and files

Latest commit

History

Repository files navigation

Wine Quality Predictions

Description

Context

Content

Tips

Project Steps

Inspiration

Acknowledgements

Relevant Publication

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages