A guide on how to install and run PySpark on your local machine
This is the recommended was as it ensures that you're using a setup that is replicable on a wide variety of operating systems.
Have a look at the docker_setup
folder for instructions.
If you can't use Docker, a conda based environment is your next option.
Have a look at the anaconda_setup
folder for further instructions.
Please note that there are different instructions for Windows and Mac/Linux based systems.