GitHub - Ashish25/ML_Spam_Detection: Machine Learning Project to build an algorithm which identifies Enron Employees who may have committed fraud based on the public Enron financial and email dataset.

Identify Fraud from Enron Email

Enron Scandal: The Fall of a Wall Street Darling

Project Overview

Played detective role and put my machine learning skills to use by building an algorithm to identify Enron Employees who may have committed fraud based on the public Enron financial and email dataset.

My Report 🔗

In 2000, Enron was one of the largest companies in the United States. By 2002, it had collapsed into bankruptcy due to widespread corporate fraud. In the resulting Federal investigation, a significant amount of typically confidential information entered into the public record, including tens of thousands of emails and detailed financial data for top executives. In this project, you will play detective, and put your new skills to use by building a person of interest identifier based on financial and email data made public as a result of the Enron scandal. To assist you in your detective work, we've combined this data with a hand-generated list of persons of interest in the fraud case, which means individuals who were indicted, reached a settlement or plea deal with the government, or testified in exchange for prosecution immunity.

Highlight of the projet:

Deal with an imperfect, real-world dataset (Class Imbalance problem)
Validate a machine learning result using test data (K-fold cross validation, SelectKBest
Evaluate a machine learning result using quantitative metrics (Accuracy-Precision-Recall)
Create, select and transform features (sklearn.preprocessing)
Compare the performance of few machine learning algorithms (Naive Bayes, SVM, DecisionTree)
Tune machine learning algorithms for maximum performance
Communicate your machine learning algorithm results clearly

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
tools		tools
License.md		License.md
MyEnronProject.pdf		MyEnronProject.pdf
Output.png		Output.png
POI labels plot.png		POI labels plot.png
README.md		README.md
Salary vs bonus plot.png		Salary vs bonus plot.png
index.html		index.html
my_classifier.pkl		my_classifier.pkl
my_dataset.pkl		my_dataset.pkl
my_feature_list.pkl		my_feature_list.pkl
poi_id.py		poi_id.py
sorted_Kbest.py		sorted_Kbest.py
tester.py		tester.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Identify Fraud from Enron Email

Project Overview

My Report 🔗

Highlight of the projet:

About

Uh oh!

Releases

Packages

Languages

License

Ashish25/ML_Spam_Detection

Folders and files

Latest commit

History

Repository files navigation

Identify Fraud from Enron Email

Project Overview

My Report 🔗

Highlight of the projet:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages