Phishing Email Classifier

Overview

I built this project to classify phishing emails based on the text of the email using machine learning. I trained a logistic regression model.

Each email has the following columns:

EDA, Feature Engineering
- Extract features from email text, visualize.
Modeling
- Apply logistic regression for classification.
- Use cross-validation to evaluate model performance and prevent overfitting.
Evaluation
- acuraccy, precision, recall, F-1 score.
Test on unlabeled data

The dataset is from a Data Science class at UC Berkeley.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
phishingemails.ipynb		phishingemails.ipynb
test.csv		test.csv