Brewed to Perfection

A Case Study on Coffee Rating and Other Factors

Skillful Hitmontop
Yvie Cai, Amy Zhang, Cathryn Li, David Yang, Serena Zhao

5/4/23

Introduce the data

We are using the data set retrieved from https://think.cs.vt.edu/corgis/csv/coffee/. The original creator of this data set is Buzzfeed Data Scientist James LeDoux who collected the data from the Coffee Quality Institute’s review pages in January 2018.

Research Question: What factors affect the rating of coffee?

Exploratory Analysis & Data Cleaning

Analysis #1:

We aim to test whether variables altitude and country have an impact on the rating scores of coffee. We will be using coffee rating as the dependent variable and country and altitude as independent variables.

\[ H_0: There~is~not~a~significant~relationship~between~\\altitude~,country,~and~coffee~rating. \]

\[ H_a: There~is~a~significant~relationship~between~\\altitude~,country,~and~coffee~rating \]

We got a p-value that is lower than the significance level based on our additive and interactive model. There is a significant interaction between altitude, country, and coffee rating. Upon further analysis, we examined altitude alone and got a p-value that is higher than the significance level, which means it does not have a relationship with the coffee rating.

Analysis #2:

We are testing for whether the processing and storage method of the coffee beans will have an impact on the rating scores of the coffee. We will use coffee rating as the dependent variable and processing method as the independent variable.

\[ H_0: There~is~not~a~significant~relationship~between~processing~method~and~coffee~rating. \]

\[ H_a: There~is~a~significant~relationship~between~processing~method~and~coffee~rating \]

The p values for processing methods are lower than 0.02, so we can reject the null hypothesis. Interestingly, only the Washed / Wet processing method and Other methods have a significant impact on the coffee rating. The remaining methods are mostly attributed to randomness.

Conclusions + future work

Based on our analysis, we came to the conclusion that while altitude has no effect on ratings, a few countries and certain processing methods have an impact on ratings. There is no interaction between the countries and the altitude.

----

Limitations of our analysis:

  • We are not able to determine if the coffee has a more sour flavor or bitter flavor.

  • All the flavors are rated on the numerical scale and flavor is usually a preference thing for users so the numerical scales only a reflection of the reviewers’ taste.