Himanvi Kopuri & Samidha Sane
Data Sources

Yelp is an American multinational corporation that develops, hosts and markets crowd-sourced reviews about local activities.
We initially scraped 100 data points covering trails in Pennsylvania, reviews, and addresses.

AllTrails is a platform with trail information, maps, detailed reviews, and photos curated by millions of hikers, campers, and nature lovers.
We initially scraped 958 data points in Pennsylvania, 100 in Delaware, and 996 in New Jersey with corresponding trail names, reviews, elevation, difficulty, and region.
TrailLink is a platform with trail maps, photos, reviews, and driving directions to help hikers and walkers find the best outdoor activities.
We initially scraped 133 data points with trail names and length, state, and reviews.
Methodology
1. Tableau Prep: All data was scraped with the Webscraper Chrome Extension. We then cleaned and edited the data in Tableau Prep to cluster similar regions together and to separate latitude and longitude coordinates for visualization purposes. After cleaning the data, we ended with 169 total data points.
2. Microsoft Excel: Next, we used Microsoft Excel to split certain strings into two different columns using LEFT, RIGHT, and SEARCH functions to get drive times and distance from campus in a standardized format.
3. Tableau Desktop: Using Tableau Desktop to merge data, we created visualizations based on the most popular concerns or preferences that Penn students might have when it comes to planning hikes. For example—proximity, quality, difficulty, and length of trail.