Skip to content

Leryamerlennn/ML_supervised_learning_basics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

14 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ€– Supervised Machine Learning – Regression, Classification & Model Evaluation

This repository contains my completed coursework for Machine Learning Assignment 1. The focus is on building and evaluating supervised learning models from scratch using real-world datasets.

From fitting regression curves to tuning classification algorithms, this project demonstrates my ability to handle data preprocessing, apply core ML algorithms, and assess model performance with industry-standard metrics β€” all implemented in Python using the scikit-learn.


What I Learned / What I Can Do

βœ… Apply linear and polynomial regression to numerical data
βœ… Use GridSearchCV for hyperparameter optimization
βœ… Preprocess datasets: handle missing values, normalize, and encode features
βœ… Train and evaluate classifiers: Logistic Regression, KNN, Naive Bayes
βœ… Compare models using metrics like Accuracy, F1-score, Recall, Precision
βœ… Use pipelines to ensure clean, modular, and reproducible ML workflows


Contents

The solution is presented in the Notebook

Task 1 – Regression Modeling

  • Load and split dataset from task1_data.csv
  • Train and evaluate linear regression model using:
    • MSE, RMSE, MAE, RΒ²
  • Perform polynomial regression and select optimal degree using GridSearchCV
  • Visualize and compare model performance

Task 2 – Classification Pipeline

  • Load pokemon_modified.csv and perform:
    • Missing value imputation
    • One-hot encoding for categorical features
    • Feature scaling
  • Train and tune:
    • Logistic Regression
    • K-Nearest Neighbors (KNN)
    • Naive Bayes
  • Evaluate models using:
    • Accuracy, Precision, Recall, F1-score
  • Select best-performing model

Technologies

  • Python (Jupyter Notebook)
  • numpy, pandas, matplotlib, scikit-learn
  • Data preprocessing pipelines
  • Cross-validation with GridSearchCV
  • Classification and regression metrics

Author

Valeria Neganova
Focus: Practical supervised learning, evaluation & model selection
πŸ“« Valerochka.neganova@mail.ru

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published