Machine-Learning

HW1: Basics of Machine Learning

Basics of Machine Learning
The Learning Problems
Feasibility of Learning
Experiments with Perceptron Learning Algorithm

HW2: Generalization & Decision Stumps

Theory of Generalization & Decision Stumps
Linear Models
Beyond Gradient Descent
Experiments with Decision Stumps

HW3: Linear Models and Regularization

Linear Models and More
Playing with Regularization
Virtual Examples and Regularization
Experiments with Linear and Nonlinear Models

HW4: Support Vector Machines

More about Regularization
Validation
Support Vector Machine
Experiments with Regularized Logistic Regression

HW5: Bagging and Boosting

Support Vector Machines
Bagging and Boosting
Experiments with Soft-Margin SVM and AdaBoost

Final Project: Predicting Danceability with Machine Learning

Introduction

The project explores different machine learning approaches for predicting the danceability of music tracks. The focus is on evaluating the performance of multiple models by preprocessing the data and experimenting with various techniques. The final goal is to achieve a balance between model complexity and accuracy.

Data Preprocessing

Dealing with Missing Values:
- Fill missing values with the median to avoid outliers’ influence.
Text Features (Artist, Composer, etc.):
- Calculated mean danceability for each artist/composer. Sparse data were treated as missing values.
Standardization:
- Numerical features were standardized to balance their influence and enhance model performance.

Feature Selection

Correlation Analysis: Examined correlation between features like energy, liveness, and danceability.
Experimentation: Selected features based on performance, testing combinations of numerical values, artist, composer, and other attributes.

Modeling Approaches

Linear Regression:
- Simple and efficient, performed well after feature selection.
Ridge Regression:
- Applied regularization, but improvement was minimal.
Support Vector Regression (SVR):
- Significant improvement with the RBF kernel and fine-tuned regularization parameters.
Random Forest:
- Tended to overfit; required careful tuning of depth and sample leaf size.
Neural Networks:
- Best performance overall, capturing complex relationships in the data.

Dealing with Overfitting

Cross-validation and AdaBoost were applied, but with limited success in reducing the gap between training and test errors.

Best Model

The Neural Network Model outperformed others with an Eout = 1.84, proving the ability to model nonlinear relationships more effectively.

Other Adjustments

Blending: Combined predictions from multiple models, leading to the optimal result of Eout = 1.83.

Conclusion

Preprocessing text features and blending models were critical for improving performance. While simpler models like Linear Regression provided reasonable results, the Neural Network was the best at capturing complex patterns.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
final_project		final_project
hw1		hw1
hw2		hw2
hw3		hw3
hw4		hw4
hw5		hw5
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine-Learning

HW1: Basics of Machine Learning

HW2: Generalization & Decision Stumps

HW3: Linear Models and Regularization

HW4: Support Vector Machines

HW5: Bagging and Boosting

Final Project: Predicting Danceability with Machine Learning

Introduction

Data Preprocessing

Feature Selection

Modeling Approaches

Dealing with Overfitting

Best Model

Other Adjustments

Conclusion

About

Releases

Packages

Languages

Javitsao/Machine-Learning

Folders and files

Latest commit

History

Repository files navigation

Machine-Learning

HW1: Basics of Machine Learning

HW2: Generalization & Decision Stumps

HW3: Linear Models and Regularization

HW4: Support Vector Machines

HW5: Bagging and Boosting

Final Project: Predicting Danceability with Machine Learning

Introduction

Data Preprocessing

Feature Selection

Modeling Approaches

Dealing with Overfitting

Best Model

Other Adjustments

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages