Project to link national park hiking trail data with US census data
Analyzing National Park Suitability
Summary Data were pulled out from US National Park Service and US Census Bureau website to analyze the location of trails within parks and calculate population within that radius. The study aims to show the population density around the park and the suitability of the park to recreational actvities based on the weather and the infrastructure. Data analystics tools such as python, Matplotlib, Pandas along with statistical tools were used to analyse the data from the entire United States. Results show poor relation between Trail number and park size, visitor number and population density.
Assumptions
- The radius around the park for trail selection will be 2 times the circular surface area of the park.
- Any linkage with census should use a zip code radius of 60 miles (possible for a day trip)
Talking in Code Python, Pandas, Matplotlib, API and Jupyter notebook were used to explore and process the data.
GitHub Link: https://github.com/theaddies/bootcamp_project_1.git
Data Collection: API Sources Hikingproject.com (trail information) National Park Service US Census Bureau Wikipedia (National Park list check)
Methodology: The methodology consists on: Radius of park zip code set to 60 miles Locate trails within parks Calculate population within that radius Radius for trail search is 2x circular radius of park surface area
Retrieving & Cleaning Jupyter notebook (Python) were used to retrieve and clean data Zip-Code Coding! Pinpointing Park Locations & Census Data Pinpointing Park Locations & Census Data Total Pop. Per National Park Zip Code
Results Different maps and statistical analysis were done:
- Visitor Count Per Park
- Population Within 60 Miles Radius of Parks
- Visitor Count Per Park Radius
- Park Visitors vs. Surrounding Population
- Number of Trails Per National Park
- Number of Trails Per National Park
The statistical analysis show. No correlations found for:
- Trail number and park size
- Visitor number and population density 495 National Park installations
- Only 60 “National Park” properties
- Excluded monuments, trails, rivers, reserves
Coolness Factor:
- Popularity due to park characteristics → visitor ratings vs. surrounding population
Future Directions
If we had more time:
- GEOJSON
- Categorize trails by specific use
- Prices
- Park attributes
- Trail attributes
- Weather information
- Visitor demographics
- Visitor count and wildlife habitat Applications:
- No API (of which we are aware) allowing users to search exclusively on National Park trails
- Expand to a website or application allowing users to look for specific trails across parks