In the ever-growing universe of dry academic texts, impressive and efficient graphics are quite rare. Driven by widespread software legacy issues and mostly outdated limitations imposed by traditional scientific publishers, researchers often consider producing high quality graphics as a peripheral optional task – “if time allows” (spoiler: it won’t). Yet, modern tools place data visualization in the focus of research workflows when it comes to conveying the results. Hence, the ability to turn a large dataset into an insightful visualization is an increasingly valuable skill in academia.
The course aims to empower the participants with the flexibility that the R+tidyverse framework gives to visualize data (the practical examples mostly use demographic data). The course covers some aspects of data visualization theory and best/worst practice examples, but it's primarily practice oriented including live coding sessions and short lecture/showcase parts.
Practical coding sessions start from basic introduction to tidy data manipulation and ggplot2 basics. Next, practical examples cover the creation of certain most useful types of plots. Important data visualization choices and caveats are discussed along the way. Special attention is devoted to producing geographical maps, which are no longer the luxury of professional cartographers but have turned, with the help of R, into yet another data visualization type. Going beyond ggplot2, the course presents an introduction to interactive data visualization.
Course BlueSky account: @datavizartskill.ikashnitsky.phd
- R
- Rstudio
- Git (help page)
- Github Desktop
- R {packages}
- Basic familiarity with R, consider RStudio Primers
- Basic dataviz principles (slides)
- Impressive dataviz showcases
tidyverse
: Tidy approach to data
ggplot2
basics- Colors and themes in dataviz
- Arranging and exporting plots
- Useful types of plots
- Heatmap, dotplot, ggridges, treemap, geofacet
- Interactivity:
plotly
,gganimate
- The basics of map projections (slides)
- With
geom_sf
maps become yet another type of dataviz - Useful spatial processing tricks with
rmapshaper
- dataviz challenge in teams
- brief presentations by teams and discussion
- course wrap-up
- Ugly
ggplot2
theme -- https://bit.ly/bssd24-ugly (day 2) - Any plot with own data -- https://bit.ly/bssd24-own (day 3)
- Geocoding -- https://bit.ly/bssd24-geocoding (day 4, in class)
- Maps -- https://bit.ly/bssd24-maps (day 4)
- Challenge -- https://bit.ly/bssd24-challenge (day 5)
- Oscar Baruffa's Big Book of R
- Hadley Wickham's R For Data Science
- Kyle Walker's dataviz book chapter
- Neal Grantham's TidyTuesday Rocks app
- Andy Kirk's The little of visualisation design
- Jonas Schoeley's 2017 IDEM dataviz course
- Garrick Aden-Buie's Gentle Guide to the Grammar of Graphics
- R Graph Gallery
- Yan Holtz's tool for selecting color palletes
- Hans Rosling: sample talk; Factfulness; gapminder.org
- Nathaniel Smith's talk on the principles of viridis palettes
- David Robinson's RStudio Conference 2019 talk on the immense benefits of being open and sharing stuff freely
- John Burn-Murdoch's RStudio 2021 talk
- Thread on excess deaths plots
- Thread on log transformation of the ratios
- Post on efficient RStudio layout
- Replication materials for Zarulli etal 2021 PNAS paper
- Boxplot + jitter example: tweet
- Jonas Schoeley's HMD explorer app
- US names shiny app: repository; tweet; shiny app
- Jim Vaupel's brilliant talk on the unique central positioning of demography in science
- Years of life stolen by gun shooting in the US -- dataviz by Periscopic
- NYT Giorgia Lupi's Long COVID data storytelling
- NYT (Motoko Rich, Amanda Cox, Matthew Bloch) US districts school kids performance against parent's wealthiness
This course was offered several times in past:
- Barcelona Summer School of Demography, Universitat Autònoma de Barcelona, Barcelona, Spain, 2019, 2021, 2022, 2024
- Max Planck Institute for Demographic Research & International Max Planck Research School for Population, Health and Data Science, Rostock, Germany, 2020, 2021
- Leverhulme Centre for Demographic Science, Oxford, UK 2022