a) Proportion of Users:
The total number of people in the sample is $63 + 357 = 420$. The proportion of users is:
$$ \text{Proportion} = \frac{\text{Number of Users}}{\text{Total Sample Size}} = \frac{63}{420} = 0.15 $$
So, 15% of the sample reported using recreational marijuana.
b) Two-Sample T-test:
The null hypothesis is $H_0: \mu_{users} = \mu_{non-users}$. The alternative is $H_1: \mu_{users} \neq \mu_{non-users}$.
$$ t = \frac{\bar{x}_{users} - \bar{x}_{non-users}}{\sqrt{\frac{s_{users}^2}{n_{users}} + \frac{s_{non-users}^2}{n_{non-users}}}} = \frac{15.6 - 16.3}{\sqrt{\frac{7.1^2}{63} + \frac{7.4^2}{357}}} $$
$$ t = \frac{-0.7}{\sqrt{\frac{50.41}{63} + \frac{54.76}{357}}} = \frac{-0.7}{\sqrt{0.8001 + 0.1534}} = \frac{-0.7}{\sqrt{0.9535}} = \frac{-0.7}{0.976} \approx -0.717 $$
Since the absolute value of our test statistic, |-0.717|, is less than the critical value of 1.96, we
fail to reject the null hypothesis. There is not sufficient evidence to conclude a difference in mean PSS scores between users and non-users.
c) Regression Model:
Using the indicator variable $MJ=1$ for recreational marijuana users and $MJ=0$ for non-users, the model is $E[PSS] = \beta_0 + \beta_1 \times MJ$.
- The intercept, $\beta_0$, is the mean PSS for the reference group (non-users, where $MJ=0$). So, $\mathbf{\beta_0 = 16.3}$.
- The slope, $\beta_1$, is the difference in means between the two groups (Users - Non-users). So, $\mathbf{\beta_1 = 15.6 - 16.3 = -0.7}$.
The fitted equation is: $\mathbf{E[PSS] = 16.3 - 0.7 \times MJ}$.
d) Testing with the Regression Model:
To test for a difference in mean PSS scores using this model, you would perform a hypothesis test on the slope parameter, $\beta_1$. The null hypothesis would be $H_0: \beta_1 = 0$, which states that there is no difference in mean PSS between the groups. The alternative would be $H_1: \beta_1 \neq 0$. This test is mathematically equivalent to the two-sample t-test performed in part (b).