Project title
Preregistration of analyses
Analysis #1
We will determine if we can reliably use population size as an indicator of whether a county is rural or urban by comparing population size of counties with the percentage of county total population that lives over half a mile from a grocery store. We will use a scatter plot to visualize this relationship, with the percentage of total population that lives over half a mile from the grocery store on the y axis, and the total population size on the x axis. Data from all counties in New York will be used for this analysis. We will plot a linear regression of the trend between the two variables. We hypothesize that there will be a negative correlation between the percentage of the population living over half a mile from a grocery store and the total population size. If our hypothesis is correct, this is enough to assume that low population counties can be considered rural for the purposes of this study. If our hypothesis is incorrect or inconclusive, we will not make this assumption.
Analysis #2
We will analyze the relationship between total low access population and distance from grocery stores of the largest 10 counties and smallest 10 counties in New York. We will use a stacked bar chart where each stack of the bar represents the population of low access people of a certain distance (1/2 mile, 1 mile, 5 mile etc.) and each bar represents a county in New York. There will be two graphs to differentiate the largest 10 and smallest 10 counties based on total population size. We hypothesize that there will be more people that are farther away from grocery stores in the smallest 10 counties compared to the largest 10 counties in New York.