Between‐ and Within‐Cluster Spearman Rank Correlations
ABSTRACT Clustered data are common in practice. Clustering arises when subjects are measured repeatedly, or subjects are nested in groups (e.g., households, schools). It is often of interest to evaluate the correlation between two variables with clustered data. There are three commonly used Pearson...
Saved in:
Published in | Statistics in medicine Vol. 44; no. 3-4; pp. e10326 - n/a |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Hoboken, USA
John Wiley & Sons, Inc
10.02.2025
Wiley Subscription Services, Inc |
Subjects | |
Online Access | Get full text |
ISSN | 0277-6715 1097-0258 1097-0258 |
DOI | 10.1002/sim.10326 |
Cover
Loading…
Summary: | ABSTRACT
Clustered data are common in practice. Clustering arises when subjects are measured repeatedly, or subjects are nested in groups (e.g., households, schools). It is often of interest to evaluate the correlation between two variables with clustered data. There are three commonly used Pearson correlation coefficients (total, between‐, and within‐cluster), which together provide an enriched perspective of the correlation. However, these Pearson correlation coefficients are sensitive to extreme values and skewed distributions. They also vary with data transformation, which is arbitrary and often difficult to choose, and they are not applicable to ordered categorical data. Current nonparametric correlation measures for clustered data are only for the total correlation. Here we define population parameters for the between‐ and within‐cluster Spearman rank correlations. The definitions are natural extensions of the Pearson between‐ and within‐cluster correlations to the rank scale. We show that the total Spearman rank correlation approximates a linear combination of the between‐ and within‐cluster Spearman rank correlations, where the weights are functions of rank intraclass correlations of the two random variables. We also discuss the equivalence between the within‐cluster Spearman rank correlation and the covariate‐adjusted partial Spearman rank correlation. Furthermore, we describe estimation and inference for the three Spearman rank correlations, conduct simulations to evaluate the performance of our estimators, and illustrate their use with data from a longitudinal biomarker study and a clustered randomized trial. |
---|---|
Bibliography: | Funding This study was supported by the National Institutes of Health, K23AI120875, P30AI110527, R01AI093234, R01MH113478. ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Funding: This study was supported by the National Institutes of Health, K23AI120875, P30AI110527, R01AI093234, R01MH113478. |
ISSN: | 0277-6715 1097-0258 1097-0258 |
DOI: | 10.1002/sim.10326 |