Skip to content
#

missing-value-handling

Here are 21 public repositories matching this topic...

A machine learning project to predict loan defaults in a German bank's customer base. Using the German Credit Risk dataset, it explores key factors contributing to defaults and trains models like Random Forest, GBM, and XGBoost. Includes EDA, data processing, hyperparameter tuning, and model evaluation.

  • Updated Nov 24, 2024
  • Jupyter Notebook

SiMI imputes numerical and categorical missing values by making an educated guess based on records that are similar to the record having a missing value. Using the similarity and correlations, missing values are then imputed. To achieve a higher quality of imputation some segments are merged together using a novel approach.

  • Updated Mar 24, 2023
  • Java

FIMUS imputes numerical and categorical missing values by using a data set’s existing patterns including co-appearances of attribute values, correlations among the attributes and similarity of values belonging to an attribute.

  • Updated Mar 24, 2023
  • HTML

This project utilizes Python for data preprocessing and analysis, along with Power BI for creating an interactive dashboard, to analyze trends and insights within the movie industry. The project encompasses data collection, cleaning, exploration, visualization, and interpretation to provide valuable insights into various aspects of the industry.

  • Updated Mar 24, 2024
  • Jupyter Notebook

DMI Class implements the DMI imputation algorithm for imputing missing values in a dataset from Rahman, M. G., and Islam, M. Z. (2013): Missing Value Imputation Using Decision Trees and Decision Forests by Splitting and Merging Records: Two Novel Techniques

  • Updated Mar 24, 2023
  • Java

kDMI employs two levels of horizontal partitioning (based on a decision tree and k-NN algorithm) of a data set, in order to find the records that are very similar to the one with missing value/s. Additionally, it uses a novel approach to automatically find the value of k for each record.

  • Updated Mar 25, 2023
  • Java

This repository demonstrates data cleaning with a layoffs dataset. It covers handling missing values, detecting outliers, and encoding categorical data, using visualizations like boxplots and distplots to enhance data quality. Check out the code to see these techniques in action.

  • Updated Sep 11, 2024
  • Python

Improve this page

Add a description, image, and links to the missing-value-handling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the missing-value-handling topic, visit your repo's landing page and select "manage topics."

Learn more