forked from Devanik21/ISRO_Mining_Site_FINAL_APP
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
fa7e783
commit 1c9e4ba
Showing
5 changed files
with
10,383 additions
and
0 deletions.
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
<h1 align="center"> Water Potability Project </h1> | ||
|
||
### Introduction | ||
The Water Potability Prediction project aims to predict the potability of water based on various physicochemical properties. The project involves analyzing various aspects of water quality, including pH, hardness, solids, and other chemical properties, to determine whether the water is potable or not. This was a semester-end project for the machine learning course offered by Jai Hind College Mumbai. I did the analysis and preprocessing in jupyter notebook and created a web application using python's Streamlit library. This project will provide a powerful tool for researchers, policymakers, and the general public to analyze the potability of water and make informed decisions based on the insights gained from the analysis. The project will help identify which physicochemical properties are most important in determining water potability, while the web app will provide a user-friendly tool for predicting water potability based on the input of various physicochemical properties. | ||
|
||
|
||
### Workflow: | ||
1) Importing required libraries and functions from models and loading the data. | ||
2) Exploratory data analysis and data visualization to understand the relationship between the attributes. | ||
3) Data preprocessing which involves filling in the missing values and scaling the numerical columns. | ||
4) Splitting the model into train & test data and evaluating its performance using a confusion matrix. | ||
5) Visualizing the decision tree and getting the feature importances. | ||
6) Saving the model for further use and concluding the notebook. | ||
|
Oops, something went wrong.