GitHub - erinsteiner-NOAA/rHD-Data-Import-Export: First lesson in the R for Social Science and Human Dimensions Research Course. This lesson covers importing data to R workspaces from a variety of raw sources (spreadsheet-type applications, STATA & SAS formats, APIs, and Databases). Lesson also offers a brief diversions into writing data to .csv files and reading from Excel Workbooks.

The primary objective of this lesson is to show how data from spreadsheet-type applications can be loaded into R.

Instructions

This lesson will use the following .Rmd files

skill-1-Import-Spreadsheet-Data.Rmd
skill-2-Export-Spreadsheet-Data.Rmd
skill-3-More-Data-Import-Examples.Rmd
skill-4-Working-with-APIs.Rmd
skill-5-Working-with-Databases.Rmd

These files are located in the skills directory with the project.

Additionally, there is a single .Rmd file in the recipes directory:

Recipe-Build-a-Dataset.Rmd

Lesson Narrative

If you care for some insight into why I organized this lesson the way I did, it is provided here. If this does not interest you and you wish to skip this section nothing bad will happen.

In my experience Social Scientists work with spreadsheet-type data A LOT. Particularly within NMFS, I observe that many of our research projects are centered around data that we have had to collect (either by scraping the web, going to a data warehouse like the St. Louis Fed, or conducting our own primary data collection). So I see the task of getting data from a .csv file into R to be a near universally important task for this group.

I wanted to lead off the course with this lesson in order to try and make things immediately “fun.” I suppose fun is pretty subjective but I suspect we all consider ourselves “data” people. And as “data” people, I imagine that we would all rather be playing with real-life data than talking about syntax or the difference between a vector and a list.

There will be some discussion in this course about data types and structures. For this very first lesson however, I wanted to provide as much of a feeling of accomplishment and excitement as I could.

This project repository also provides lessons on accessing data via APIs and connecting R to databases.

Databases

For me, the value proposition of learning about database access is pretty simple:

There are lots of primary data sources that we work with (VMS data, Fish Tickets) that are just too big to be efficiently stored as flat files in spreadsheet-type applications.

APIs

I claim no expertise in developing data streams using APIs. In fact, “upping my API-game” is probably my biggest coding resolution for 2020/2021. I think familiarity with R-API relationships is important because:

The amount of data available through APIs is growing rapidly. A lot of data sources that Social Scientists have traditonally relied on (Census Bureau, Federal Reserve, Bureau of Labor Statistics) are already available through APIs, with new stuff coming online every day.
Connecting to databases can be hard. Like really hard. I have database connections that took weeks to set up and involved complicated coordination between multiple IT professionals at multiple organizations. In many cases, APIs can be developed to simplify access to databases that might otherwise be really hard to connect to.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
data		data
figures		figures
recipes		recipes
skills		skills
Lesson-3-More-Data-Import-Examples.html		Lesson-3-More-Data-Import-Examples.html
README.Rmd		README.Rmd
README.md		README.md
Resources-and-Readings.Rmd		Resources-and-Readings.Rmd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instructions

Contents

Code

Data

Lesson Narrative

Databases

APIs

About

Releases

Packages

Languages

erinsteiner-NOAA/rHD-Data-Import-Export

Folders and files

Latest commit

History

Repository files navigation

Instructions

Contents

Code

Data

Lesson Narrative

Databases

APIs

About

Resources

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages