Spam has been a major concern given the offensive content of messages, spam is a waste of time. End user is at risk of deleting legitimate mail by mistake. Moreover, spam also impacted the economical which led some countries to adopt legislation.
This is a Machine Learning Application to classify Emails as Spam or Ham.
1. Data Processing & Manipulation:
- Pandas – Data analysis and manipulation.
- NumPy – Numerical computing and array operations.
- Pickle – Object serialization and deserialization.
2. Natural Language Processing (NLP):
- NLTK – Text preprocessing and linguistic analysis.
3. Machine Learning:
- Scikit-learn – Machine learning algorithms and model evaluation.
- Joblib – Model persistence and efficient computation.
4. Data Visualization:
- Altair – Declarative statistical visualization.
- PyDeck – 3D geospatial data visualization.
- Pillow – Image processing.
5. Utilities & Supporting Tools:
- GitPython – Git repository interaction.
- JSONSchema – JSON validation.
- Protobuf – Data serialization.
- PyArrow – Columnar data format support.