Twitter Data Wrangling

Data wrangling project based on Twitter data

by Sooyeon Won

Keywords

Gathering data from different sources: Flat file, URL, API
Handling Data Quality and Tidiness Issues

Summary of Findings

This project is mainly focuses on how I, as a data analyst, get the proper data. In this analysis, I collected datasets from Udacity URL, directly-downloaded flatfiles and also using Tweeter API. In the second part, I made a small report to answer the following questions based on the obtained datasets.

The popularity of each dog "stage" (i.e. doggo, floofer, pupper, and puppo)
The method of accessing to twitter
The number of counts for retweet and favorite to get insight into popularity of tweets
Relationship between retweet_count and favorite_count
The proportion of image predictions that predict dog images as the first stage

References

Getting Twitter Data in Python
Accessing the Twitter API with Python
Learn Python by analyzing Donald Trump’s tweets

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Data_Wrangling_Project_Part1.ipynb		Data_Wrangling_Project_Part1.ipynb
Data_Wrangling_Project_Part2_Report.ipynb		Data_Wrangling_Project_Part2_Report.ipynb
README.md		README.md
predictions_master.csv		predictions_master.csv
twitter_archive_master.csv		twitter_archive_master.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter Data Wrangling

by Sooyeon Won

Keywords

Summary of Findings

References

About

Releases

Packages

Languages

SooyeonWon/Twitter_data_analysis

Folders and files

Latest commit

History

Repository files navigation

Twitter Data Wrangling

by Sooyeon Won

Keywords

Summary of Findings

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages