GitHub - mahavirbha/spamclassifier: A Spam Classifier for SMS/Emails using ML

A Spam Classifier for SMS/Emails

Data Cleaning
- renaming
- missing values
- remove duplicates
EDA (to understand underlying data)
- plotting charts (ham vs spam)
- wordcount
Text Pre-Processing (with the help of nltk library)
- Lower case
- Tokenization
- Removing special characters
- Removing stop words and punctuation
- Stemming
Model Building (with the help of sklearn library)
- train-test data
- tfidf vectorization
- model training on various ML classifiers
Evaluation
- compare and choose on best model
Improvement
- re-train model by hyper parameter tuning (here TfidfVectorizer(max_features=3000))
Website
- create & open a project in editor
- crete & code app.py file
- import .pkl files & functions from ipynb file
- integrate it in streamlit

Example Outputs:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
Screenshot (234).png		Screenshot (234).png
Screenshot (235).png		Screenshot (235).png
Screenshot (236).png		Screenshot (236).png
app.py		app.py
model.pkl		model.pkl
spamClassifier.ipynb		spamClassifier.ipynb
vectorizer.pkl		vectorizer.pkl