Skip to content

A collection of images and captions to explain core data concepts

Notifications You must be signed in to change notification settings

MrPowers/data-scrapbook

Repository files navigation

data-scrapbook

This repo contains a collection of images that explain core data computing concepts like Apache Spark, file formats, Delta Lake and associated libraries.

You can learn a lot about data computing with some images with descriptive captions. See these pages to learn more:

  • Spark
  • Delta Lake

PySpark:

  • quinn - TODO
  • chispa
  • ceja - TODO
  • mack
  • farsante
  • unicron - TODO

Scala Spark:

  • spark-sbt.g8 - TODO
  • bebe - TODO
  • spark-daria
  • spark-fast-tests

Pandas:

  • beavis - TODO

About

A collection of images and captions to explain core data concepts

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published