Skip to content

Exploratory Data Analysis with R

License

Notifications You must be signed in to change notification settings

sbsousa/LoanEDA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Loan Exploratory Data Analysis

This project is an Exploratory Data Analysis (EDA) with R. The project was created for Udacity's Data Analyst Nanodegree.

Access the final report here: https://sbsousa.github.io/LoanEDA

Project Description

Per Udacity, the goal of this project is to "use R and apply exploratory data analysis techniques to explore relationships in one variable to multiple variables and to explore a selected data set for distributions, outliers, and anomalies". 10-15 variables should be explored.

I selected the Prosper Loans data set from the list of available choices. The data set contains 113,937 loans with 81 variables on each loan.

Approach

The EDA was conducted in R-Studio using multiple R packages such as knitr, diplyr, and ggplot2. The R Markdown file is included as well as the final knitted report in HTML format.

License

This project is publicly available for educational purposes. Please acknowledge this source if you use it.

Sources

The CSV file and EDA project outline were provided by Udacity:

https://www.udacity.com/course/data-analyst-nanodegree--nd002

Additional sources are acknowledged in the code and report.