spam-sms-classification

In this project, we model a classifier to label a given piece of text as spam or not a spam. Also, we created an API for the model, using Flask, the Python micro framework for building web applications. The model is currently being hosted on herokuapp server - here

Dataset

The classifier is trained offline with spam and non-spam messages. The trained model is deployed as a service to serve users. We have used the famous SPAM or HAM Dataset by UCI-ML.

The dataset consists of a file naming data.csv which contains one message per line. Each line is composed by two columns: v1 contains the label (ham or spam) and v2 contains the raw text.

Features

We have used Naive Bayes and Count Vectorizer. Later its accuracy has been improved by using SVM Classifier and TF-IDF Vectorizer. It is further improved by using LSTM classifier implemented using Keras API in Python.

Tools

Main modules used are - Flask, Pandas, sklearn, numpy, PIL, Keras, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
static		static
templates		templates
LSTM_spam_model.ipynb		LSTM_spam_model.ipynb
LSTM_spam_model.pkl		LSTM_spam_model.pkl
NB_spam_model.ipynb		NB_spam_model.ipynb
NB_spam_model.pkl		NB_spam_model.pkl
Procfile		Procfile
README.md		README.md
SVM_spam_model.ipynb		SVM_spam_model.ipynb
SVM_spam_model.pkl		SVM_spam_model.pkl
app.py		app.py
requirements.txt		requirements.txt
runtime.txt		runtime.txt
spam.csv		spam.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spam-sms-classification

Dataset

Features

Tools

About

Releases

Packages

Languages

kumargauravsingh14/spam-sms-classifier

Folders and files

Latest commit

History

Repository files navigation

spam-sms-classification

Dataset

Features

Tools

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages