By Gianluca Campanella (g.campanella@estimand.com)
This repository contains teaching materials for the Getting to grips with Databricks tutorial presented at the Applied AI Conference 2019 in London, UK.
By the end of the session, you should be able to:
- Decide when (not) to use Spark
- Load and explore data files using PySpark
- Use Spark ML to fit and cross-validate machine learning models