Saturday, February 7, 2015

MLB Ballparks & Total Population within 30 Minutes

After reading "The Importance of the 30 minute Population Radius on MLB Attendance" on Fangraphs - http://www.fangraphs.com/community/the-importance-of-the-30-minute-population-radius-on-mlb-attendance/ I decided 2 things:

1.  I would use Geographic Information Systems (GIS) to run this process using more current US Census Bureau data (2010 as opposed to 2000).

2.  I decided to create a blog to share this information.  I plan on running similar analyses in the future and will take a deeper look at demographics and the relationships between Geography and Baseball.  

An interactive map displaying the results can be found here:  http://bit.ly/1EOz6B5

Table with results:  https://docs.google.com/spreadsheets/d/1oI-uAU0CjxNgtAEieNOUmayXJemIw3e_WXdivojQ2IU/edit#gid=0

The numbers are quite a bit higher than the original study referenced in the Fangraphs article.  Part of this might be due to the fact that I did not use rush hour traffic as a variable. I might take that into consideration the next time I run this analysis.  Using ArcGIS and Network Analyst I used the process below:

1.  Identified the locations of all 29 ball parks.
2.  Using Network Analyst, I used the Generate Service Areas tool to create a 30 minute drive time polygon to the identified ballparks.  For the most part I used the default settings, and these polygons do not account for rush hour traffic.
3.  Using the Union tool, I joined the 2010 Census data with the polygon drive time polygons.
4.  I then subtracted out the FIDs with -1 to remove the polygons outside of the 30 minute distance.
5.  Used the Field Calculator to obtain population totals.