Includes Decision Trees (DT) for interpretability Logistic Regression (LR) for baseline performance K-Nearest Neighbors (KNN) for non-linear patterns and Random Forests (RF) for robust classification. This project requires internet connection to download the dataset from Google Drive. Dataset Source: https://www.kaggle.com/datasets/purusinghvi/email-spam-classification-dataset
1. Create the virtual environment:
python -m venv venv
2. Activate the virtual environment:
venv\Scripts\activate
3. Install packages from requirements.txt:
pip install -r requirements.txt
4. Open your prefered Editor
5. Change kernel to the virtual environment
Ninis - Ryangga - Andhika