This repo is developed by Emanuel Fontelles.
Theses notebooks contained here are implementations of basics scrutures on PySpark.
This repository contains some of my codes to work with Big Data tools as Hadoop Ecosystem including Spark, Pig, Sqoop, HBase.
Here we focus on Spark API to Python, PySpark and developing some codes to exemplify some of basics scrutures.
index: