The repository will include code for the project. This includes data, notebooks and reports.
At this notebook, Foursquare API was used to cluster subway station (Metro) neighborhoods in Sao Paulo regarding closely venues. The amount of venues per category in each subway station was acquired by a HTTP request to Foursquare API, and then used to group the stations into clusters. For this notebook k-means clustering algorithm was selected to complete this task. Finally, Folium library has been used to visualize the neighborhoods in Sao Paulo and their emerging clusters.
- Clone the repository
git clone git@github.com:felipetestaaa/ml-sao-paulo-subway-clustering.git
- Move inside the newly created repository
cd ml-sao-paulo-subway-clustering
- Create a virtual environment and activate it
virtualenv -p python3 .venv
- Activate the virtual environment
source .venv/bin/activate
- Install all libraries
pip install -r requirements.txt
- Start jupyter
jupyter notebook
- Deactivate environment (if created and activated before)
deactivate