Project title

Preregistration of analyses

Analysis #1

Research Question: Are individuals more likely to abuse one substance over another based on their age?

Hypothesis: Older age groups will more likely abuse more “hardcore” drugs over younger age groups.

Null Hypothesis: There is no significant relationship between age and the likelihood of abusing one substance over another. In the equation, ρ is the population correlation coefficient between age and the likelihood of abusing one substance over another.

\[ H_0: p = 0 \]

Alternative Hypothesis: There is a significant relationship between age and the likelihood of abusing one substance over another.

\[ H_A: p \neq 0 \]

Detailed Analysis Plan:

  • We cleaned our data using inner_join and mutate in order to create the data set that we desire.

  • We plan on furthering our descriptive analysis for this question through numerous ways.

    • we can look at the distribution of substance abuse across all of the different age groups. We can utilize visual analysis (histogram, scatterplots, or other forms of frequency distribution) in order to visualize the substance abuse data for each age group

    • we can calculate the summary statistics; this includes the mean, median, standard deviation, and other things that may be useful. This will help us identify important patterns and trends in the data

    • other forms of visual analysis can be used such as box plots (this will help us identify outliers or the spread of the data between each individual age group which is something that the histogram is not very efficient at doing)

  • Inferential Analysis: we plan on conducting a hypothesis test and utilize regression analysis in order to determine if the relationship between age groups and substance abuse is statistically significant

Data Source:

  • The Drugs data set from CORGIS, which was compiled from data from the National Survey on Drug Use and Health (NSDUH).

Analysis #2

Research Question: Are individuals, specifically adolescents, more likely to abuse marijuana as legalization efforts succeed throughout the country, and does marijuana abuse increase in states after it is legalized?

Hypothesis: Adolescents are more likely to abuse marijuana in response to legalization efforts, and overall, marijuana abuse is more likely to increase after state legalization.

Null Hypothesis: There is no significant relationship between legalization efforts and marijuana abuse. In the equation ρ is the correlation coefficient between legalization and marijuana abuse.

\[ H_0: p = 0 \]

Alternative Hypothesis: There is a significant relationship between between legalization efforts and marijuana abuse.

\[ H_A: p \neq 0 \]

Detailed Analysis Plan:

  • We have previously done some data tidying, such as joining data frames with both the information on drug use as well as the information on legal status of marijuana by state, which will allow us to easily analyze this question

  • We will conduct a hypothesis test to determine whether or not marijuana use amongst adolescents increases with legalization. The null hypothesis will be that the probability is equal for those in states without legalization and those with, and the alternative hypothesis is that the rates of marijuana use are higher amongst adolescents in states that have legalized marijuana.

  • We want to create visualizations that will reflect what our research question is asking. This may include a bar graph showing the rates of marijuana usage amongst different age groups, separated by legalization status. We may also include a line graph that shows the change in marijuana usage over time amongst adolescents, particularly before and after legalization.

Data Source:

  • The Drugs data set from CORGIS, which was compiled from data from the National Survey on Drug Use and Health (NSDUH).

  • The State Marijuana Laws (2019) data compiled by Selene Arrazolo.