Depression Incidence and Vitamin D Concentration in NHANES Survey Data
A Case Study in Collaboration and Causal Inference
Julia Piaskowski & Yimin Chen
October 10, 2024
https://jpiaskowski.gitlab.io/talks/nhanes-magic/
Statistical Consulting
Frequently
linear model -> ANOVA -> p-values
correlation -> coefficient of correlation
t-test -> p-values
test every column in a spreadsheet
Problems with this Approach
Single-minded focus on p-values and/or correlation at the expense of understanding systems or correctly reporting results.
An underlying assumption that statistics (magically) extracts meaningful results.
Fishing for any pairwise association without fully considering the implications of that relationship
Case Study of (Maybe) a Better Approach
Depression During Pregnancy and Postpartum
New moms experience new stress
Antepartum depression associated with stunted infant growth
Postpartum depression associated with child behavioral issues and developmental challenges
Vitamin D concentration negatively associated with depression, although literature is inconsistent
Vitamin D Mechanism
Potential protective mechanism through serotonin, the “happy” neurotransmitter
Serotonin has a consistent negative association with depression, anxiety, etc.
Brain makes serotonin from tryptophan – which requires vitamin D to activate transcription factor
Vitamin D inhibits monoamine oxidase (breaks down neurotransmitters)
Vitamin D inhibits serotonin reuptake receptors (terminates serotonin signaling)
NHANES
The National Health and Nutrition Examination Survey (NHANES) is a program of studies designed to assess the health and nutritional status of adults and children in the United States. The survey is unique in that it combines interviews and physical examinations. -CDC Website
NHANES
Semi-annual survey run by the CDC; they sample ~5,000 individuals/year since early 1960s
Results intended to reflect diversity of the U.S. population.
Allows assessment of nutritional status associated with health promotion & disease prevention
Widely used as large epidemiological data set that’s representative of the entire US population
We used data from 2007-2018 because the assay protocol for Vitamin D was consistent in that time period
Define What You Want to Estimate
Our Estimand
The impact of vitamin D on depression risk in pregnant and postpartum women
AND
The differential impact of vitamin D on depression risk in postpartum women, stratified by breastfeeding status
Understand the System
Consider possible confounders that prevent proper estimation of a estimand.
Confounders are things that influence both the ‘exposure’ and the outcome
Avoid the ‘Causal Salad’
Depression = VitaminD + Food Security + Diet + Income/Poverty Ratio + BMI + Data Collection Season + Marital status + Education + Race