Skip to content

ruchir321/Data-Cleaning-101

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data Cleaning 101

In this repo, I have used the Kaggle Dataset to explore data preparation techniques.

Code

The missing_data_practice.ipynb notebook contains the code for the data preparation techniques.

Concepts

  • Missingness Types: Missing Completely at Random (MCAR), Missing at Random (MAR), Missing Not at Random (MNAR)
  • Univariate Imputation Techniques: Mean/Median/Mode Imputation, Random Sample Imputation
  • Multivariate Imputation Techniques: KNN Imputation, MICE Imputation

Pyhton libraries used:

  • pandas
  • numpy
  • matplotlib
  • missingno
  • fastimpute
  • sklearn

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published