Introduction
This post is for the week 2 assignment of the Coursera course Data Analysis Tools by Wesleyan University. It’s the 2nd course for the Data Analysis and Interpretation specialization.
I will use the NESARC dataset. This time, I will use Chi-Square to examine whether there is a relationship between region and depression. Since I have four levels, I will also use Bonferroni Adjustment to make pair comparisons.
Explanatory variable:
REGION
————-
1. Northeast
2. Midwest
3. South
4. West
Response variable:
MAJORDEPLIFE
label: MAJOR DEPRESSION – LIFETIME (NON-HIERARCHICAL)
———————————————-
0. No
1. Yes
CODE
Results
A chi-square test of independence was conducted to examine the relationship between region and depression among American adults. The results indicated that life-time occurrence of major depression was significantly associated with region, X²(3)=46.70, p < .0001.
Post hoc comparisons of major depression prevalence by pairs of region categories revealed that the prevalence of major depression was significantly higher for those who were in the midwest (20.06%) compared to Northeast (17.74%) and South (16.85%). In addition, the prevalence of major depression in the South (16.85%) was significantly lower than that in the West (19.06%).