MissNoMore is a Python-based missing value imputation tool designed to handle CSV datasets with missing data. It offers a range of imputation techniques, from simple mean and median strategies to more advanced methods like K-Nearest Neighbors (KNN), iterative imputation, and decision tree-based imputations. The tool provides both a "Basic" mode for quick imputations and an "Advanced" mode for more sophisticated approaches.
Try out the deployed MissNoMore application here.
- Missing value imputation for CSV datasets
- Two modes: "Basic" (mean, median, interpolate) and "Advanced" (KNN, iterative, decision tree)
- Easy-to-use Streamlit interface
- Leveraging the power of Pandas for data manipulation
- Improves data quality by handling missing data effectively
- Python 3.8+
- Pandas library
- Streamlit library
- Scikit library
-
Clone the repository:
git clone https://github.com/souravsuvarna/MissNoMore.git cd MissNoMore cd app
-
Install the required libraries:
pip install pandas pip install streamlit pip install scikit-learn
-
Run the Streamlit app:
streamlit run app.py
-
Choose the CSV dataset with missing values you want to impute.
-
Select the desired imputation mode (Basic or Advanced).
-
For Basic mode, choose the column from drop-down list then choose appropriate imputation technique.(mean, median,interpolate,etc) .
-
For Advanced mode, appropriate imputation techniques.( KNN, iterative, decision tree-based imputations,etc).
-
Click the "Submit" button to process the data and generate the imputed dataset.
-
Download the imputed dataset for further analysis.
Contributions to MissNoMore are welcome! If you find any issues or have suggestions for improvements, please feel free to create a pull request or raise an issue.
- The MissNoMore project was inspired by the need for a user-friendly missing value imputation tool.
- Thanks to the developers of Pandas, Streamlit, and other open-source libraries used in this project.