Between‐ and Within‐Cluster Spearman Rank Correlations

ABSTRACT Clustered data are common in practice. Clustering arises when subjects are measured repeatedly, or subjects are nested in groups (e.g., households, schools). It is often of interest to evaluate the correlation between two variables with clustered data. There are three commonly used Pearson...

Full description

Saved in:

Bibliographic Details
Published in	Statistics in medicine Vol. 44; no. 3-4; pp. e10326 - n/a
Main Authors	Tu, Shengxin, Li, Chun, Shepherd, Bryan E.
Format	Journal Article
Language	English
Published	Hoboken, USA John Wiley & Sons, Inc 10.02.2025 Wiley Subscription Services, Inc
Subjects	Cluster Analysis clustered data Computer Simulation Correlation of Data Data Interpretation, Statistical Humans Models, Statistical nonparametric correlation measures Random variables rank association measures Statistics, Nonparametric clustered data rank association measures nonparametric correlation measures
Online Access	Get full text
ISSN	0277-6715 1097-0258 1097-0258
DOI	10.1002/sim.10326

Cover

Loading…

More Information
Summary:	ABSTRACT Clustered data are common in practice. Clustering arises when subjects are measured repeatedly, or subjects are nested in groups (e.g., households, schools). It is often of interest to evaluate the correlation between two variables with clustered data. There are three commonly used Pearson correlation coefficients (total, between‐, and within‐cluster), which together provide an enriched perspective of the correlation. However, these Pearson correlation coefficients are sensitive to extreme values and skewed distributions. They also vary with data transformation, which is arbitrary and often difficult to choose, and they are not applicable to ordered categorical data. Current nonparametric correlation measures for clustered data are only for the total correlation. Here we define population parameters for the between‐ and within‐cluster Spearman rank correlations. The definitions are natural extensions of the Pearson between‐ and within‐cluster correlations to the rank scale. We show that the total Spearman rank correlation approximates a linear combination of the between‐ and within‐cluster Spearman rank correlations, where the weights are functions of rank intraclass correlations of the two random variables. We also discuss the equivalence between the within‐cluster Spearman rank correlation and the covariate‐adjusted partial Spearman rank correlation. Furthermore, we describe estimation and inference for the three Spearman rank correlations, conduct simulations to evaluate the performance of our estimators, and illustrate their use with data from a longitudinal biomarker study and a clustered randomized trial.
Bibliography:	Funding This study was supported by the National Institutes of Health, K23AI120875, P30AI110527, R01AI093234, R01MH113478. ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Funding: This study was supported by the National Institutes of Health, K23AI120875, P30AI110527, R01AI093234, R01MH113478.
ISSN:	0277-6715 1097-0258 1097-0258
DOI:	10.1002/sim.10326