Project title

Preregistration of analyses

Analysis #1

Research question: Does Fantano generally give higher scores than Pitchfork?

When recontextualized, this research question can be stated as: is the source of a review a significant factor in determining the review’s score?

To perform this analysis, we will join the data frames by album name (leaving only albums both sources have reviewed), as well as cut out every variable that isn’t album name or score. We will then investigate which source tends to rate albums higher overall, and perform hypothesis testing to determine if any difference we find is likely to be explained by chance, or if the source of the review is a driving factor behind the review score.

Hypotheses:

Null Hypothesis: The score of a review is independent of what source the review originates from.

Alternative Hypothesis: The score of a review is not independent from what source it originates from. Any difference observed is not due to natural variability, but rather due to Fantano or Pitchfork having different standards.

Analysis #2

Research Question: What’s the difference in pitchfork people’s taste over time for folk/country music and rock music?

To perform the analysis, we focus on release year, genre, and score, while only keep observation for folk/country and rock genre in pitchfork data frame. We will do preliminary analysis by making a plot to compare the score V.S. year and faceting on the two genres. Then we will perform the hypothesis test to determine if the difference between average score of the two genres over the years is significant.

Null hypothesis: The difference between average score of the two genres over the years is insignificant.

Alternative hypothesis: The difference between average score of the two genres over the years is significant.