GitHub - data42lana/learning_ml_tools: The notebook shows how machine learning tools and algorithms (scikit-learn, XGBoost, LightGBM) work in practice.

Solving regression and classification tasks with ML - Abalone

Note This repository has been archived.

A Jupyter notebook with an example of solving regression and classification tasks using ML tools and algorithms (scikit-learn, XGBoost, LightGBM).

Overview:

The objective was to learn how machine learning tools and algorithms work in practice. To do this, a dataset was selected as an example, in which the target (the age of abalone) it can be predicted using both regression and classification algorithms. The Jupyter notebook describes the following stages of work:

exploring and visualizing the available data with the pandas package and the seaborn and matplotlib libraries;
feature transformation and dimensionality reduction using the scikit-learn tools;
searching for the best regression and classification models (scikit-learn, XGBoost, LightGBM) and configuring them with search by hyperparameters;
evaluating the found best models on test data.

Setup:

The Jupyter notebook was created in a virtual environment configured with Miniconda on a local machine with Windows 10. To run it, also create a virtual environment, check for the packages listed below, and, if necessary, install the corresponding version of them.

Versions of packages used:

python 3.8.5, conda 4.9.2, scikit-learn 0.24.1, xgboost 1.3.3, lightgbm 3.1.1, numpy 1.19.2, pandas 1.1.3, matplotlib 3.3.2, seaborn 0.11.0

Data:

Abalone Data Set licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license. Source: Dua, D. and Graff, C. (2019). UCI Machine Learning Repository (web link) Irvine, CA: University of California, School of Information and Computer Science.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
ml_regression_and_classification.ipynb		ml_regression_and_classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solving regression and classification tasks with ML - Abalone

Overview:

Setup:

Versions of packages used:

Data:

About

Releases

Packages

Languages

data42lana/learning_ml_tools

Folders and files

Latest commit

History

Repository files navigation

Solving regression and classification tasks with ML - Abalone

Overview:

Setup:

Versions of packages used:

Data:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages