This repo contains a collection of images that explain core data computing concepts like Apache Spark, file formats, Delta Lake and associated libraries.
You can learn a lot about data computing with just some well designed images with descriptive captions. See these pages to learn more:
- Spark
- Delta Lake
- Chispa
- Mack
- Farsante