This repository is dedicated to the Big Data and Analytics course at Politecnico di Torino. It covers the following topics: Hadoop, Spark, SparkSQL, SparkMLib, and SparkStreaming. Currently, the repository only contains the BD-Lab folder, which includes the completed lab exercises. Please refer to the BD-Lab folder to access the content.
The repository is organized as follows:
BD-Lab
: Contains the lab exercises.
If you encounter any issues or have suggestions for improvement, please feel free to open an issue or submit a pull request.
This project is licensed under the MIT License.