Skip to content

Latest commit

 

History

History
56 lines (42 loc) · 3.02 KB

README.md

File metadata and controls

56 lines (42 loc) · 3.02 KB

Data-Science-Project

These notebooks are my data science portfolio in Python programming language. These notebooks utilize data science libraries such as Pandas, NumPy, and scikit-learn. In addition, these notebooks also include the utilization of data visualization libraries such as seaborn, matplotlib, plotly, and many more.

Project 1 : 911 Calls Capstone Project

For this project, I will be analyzing some 911 data from Kaggle. The data contains the following fields :

  • lat : String variable, Latitude
  • lng: String variable, Longitude
  • desc: String variable, Description of the Emergency Call
  • zip: String variable, Zipcode
  • title: String variable, Title
  • timeStamp: String variable, YYYY-MM-DD HH:MM:SS
  • twp: String variable, Township
  • addr: String variable, Address
  • e: String variable, Dummy variable (always 1)

I also made several changes by created a new features in the dataframe :

  • Reason
  • Hour
  • Month
  • Day of Week
  • Date

Project Highlight

911-Calls_Capstone Project 911-Calls_Capstone Project 911-Calls_Capstone Project 911-Calls_Capstone Project 911-Calls_Capstone Project 911-Calls_Capstone Project 911-Calls_Capstone Project

Project 2 : Finance Data Project

This data project will focus on exploratory data analysis of stock prices. This project is meant to practice my visualization and pandas skills, and it is not meant to be a robust financial analysis or be taken as financial advice.

I focused on the bank stocks and visualized how they progressed throughout the financial crisis all the way to early 2016.

To get the data, I used pandas datareader. I got stocks information for the following banks :

  • Bank of America
  • CitiGroup
  • Goldman Sachs
  • JPMorgan Chase
  • Morgan Stanley
  • Wells Fargo

EDA

screenshots screenshots screenshots screenshots