Do the PHQ-8 and the PHQ-2 accurately screen for depressive disorders in a sample of pregnant women?

Abstract Objective The aim of this study was to assess the psychometric properties of the Patient Health Questionnaire (PHQ)-8, and the PHQ-2, a two-item version of the PHQ, respectively, in pregnancy. These screeners were compared to a structured diagnostic interview in a cohort of pregnant women a...

Full description

Saved in:
Bibliographic Details
Published inGeneral hospital psychiatry Vol. 32; no. 5; pp. 544 - 548
Main Authors Smith, Megan V., Dr.P.H. M.P.H, Gotman, Nathan, M.S, Lin, Haiqun, M.D., Ph.D, Yonkers, Kimberly A., M.D
Format Journal Article
LanguageEnglish
Published New York, NY Elsevier Inc 01.09.2010
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Objective The aim of this study was to assess the psychometric properties of the Patient Health Questionnaire (PHQ)-8, and the PHQ-2, a two-item version of the PHQ, respectively, in pregnancy. These screeners were compared to a structured diagnostic interview in a cohort of pregnant women attending prenatal care. B ased upon studies documenting high sensitivity and specificity on the PHQ-8 and PHQ-2 in the general adult population, we hypothesized that both instruments would be effective in this population. Methods Two hundred eighteen women, 13 of them depressed, were given the Composite International Diagnostic Interview and the PHQ-8 before 17 weeks of pregnancy. Receiver Operating Characteristic curves determined optimal thresholds and sensitivity and specificity were calculated using both dimensional and categorical approaches. Agreement between the PHQ-2 and PHQ-8 was measured using Cohen's kappa. Results Optimal cutoffs for the PHQ-8 and PHQ-2 were 11 and 4, respectively. Using these cutoffs, the PHQ-8 had a sensitivity of 77% and a specificity of 62% while the PHQ-2 had a sensitivity of 62% and a specificity of 79%. The categorical method of scoring the PHQ-8 yielded a sensitivity of 54% and a specificity of 84%. Conclusions In our sample, the PHQ-8 and PHQ-2 performed almost equally in detecting probable major depressive disorder in a sample of pregnant women. The categorical scoring method for the PHQ-8 had lower sensitivity but slightly higher specificity than the dimensional version. We found the PHQ-8 and PHQ-2 to have lower sensitivity and specificity in our pregnant population as compared to findings in nonpregnant populations; however, characteristics of our sample and choice of diagnostics instrument could explain these discrepant findings.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0163-8343
1873-7714
DOI:10.1016/j.genhosppsych.2010.04.011