Between‐ and Within‐Cluster Spearman Rank Correlations

ABSTRACT Clustered data are common in practice. Clustering arises when subjects are measured repeatedly, or subjects are nested in groups (e.g., households, schools). It is often of interest to evaluate the correlation between two variables with clustered data. There are three commonly used Pearson...

Full description

Saved in:
Bibliographic Details
Published inStatistics in medicine Vol. 44; no. 3-4; pp. e10326 - n/a
Main Authors Tu, Shengxin, Li, Chun, Shepherd, Bryan E.
Format Journal Article
LanguageEnglish
Published Hoboken, USA John Wiley & Sons, Inc 10.02.2025
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text
ISSN0277-6715
1097-0258
1097-0258
DOI10.1002/sim.10326

Cover

Loading…
More Information
Summary:ABSTRACT Clustered data are common in practice. Clustering arises when subjects are measured repeatedly, or subjects are nested in groups (e.g., households, schools). It is often of interest to evaluate the correlation between two variables with clustered data. There are three commonly used Pearson correlation coefficients (total, between‐, and within‐cluster), which together provide an enriched perspective of the correlation. However, these Pearson correlation coefficients are sensitive to extreme values and skewed distributions. They also vary with data transformation, which is arbitrary and often difficult to choose, and they are not applicable to ordered categorical data. Current nonparametric correlation measures for clustered data are only for the total correlation. Here we define population parameters for the between‐ and within‐cluster Spearman rank correlations. The definitions are natural extensions of the Pearson between‐ and within‐cluster correlations to the rank scale. We show that the total Spearman rank correlation approximates a linear combination of the between‐ and within‐cluster Spearman rank correlations, where the weights are functions of rank intraclass correlations of the two random variables. We also discuss the equivalence between the within‐cluster Spearman rank correlation and the covariate‐adjusted partial Spearman rank correlation. Furthermore, we describe estimation and inference for the three Spearman rank correlations, conduct simulations to evaluate the performance of our estimators, and illustrate their use with data from a longitudinal biomarker study and a clustered randomized trial.
Bibliography:Funding
This study was supported by the National Institutes of Health, K23AI120875, P30AI110527, R01AI093234, R01MH113478.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Funding: This study was supported by the National Institutes of Health, K23AI120875, P30AI110527, R01AI093234, R01MH113478.
ISSN:0277-6715
1097-0258
1097-0258
DOI:10.1002/sim.10326