Linear regression and its inference on noisy network‐linked data

Linear regression on network‐linked observations has been an essential tool in modelling the relationship between response and covariates with additional network structures. Previous methods either lack inference tools or rely on restrictive assumptions on social effects and usually assume that netw...

Full description

Saved in:
Bibliographic Details
Published inJournal of the Royal Statistical Society. Series B, Statistical methodology Vol. 84; no. 5; pp. 1851 - 1885
Main Authors Le, Can M., Li, Tianxi
Format Journal Article
LanguageEnglish
Published Oxford Oxford University Press 01.11.2022
Subjects
Online AccessGet full text
ISSN1369-7412
1467-9868
DOI10.1111/rssb.12554

Cover

More Information
Summary:Linear regression on network‐linked observations has been an essential tool in modelling the relationship between response and covariates with additional network structures. Previous methods either lack inference tools or rely on restrictive assumptions on social effects and usually assume that networks are observed without errors. This paper proposes a regression model with non‐parametric network effects. The model does not assume that the relational data or network structure is exactly observed and can be provably robust to network perturbations. Asymptotic inference framework is established under a general requirement of the network observational errors, and the robustness of this method is studied in the specific setting when the errors come from random network models. We discover a phase‐transition phenomenon of the inference validity concerning the network density when no prior knowledge of the network model is available while also showing a significant improvement achieved by knowing the network model. Simulation studies are conducted to verify these theoretical results and demonstrate the advantage of the proposed method over existing work in terms of accuracy and computational efficiency under different data‐generating models. The method is then applied to middle school students' network data to study the effectiveness of educational workshops in reducing school conflicts.
Bibliography:Funding information
College and Graduate School of Arts and Sciences, Grant/Award Number: 3Caverliers Award; National Science Foundation, DMS‐2015134; Grant/Award Numbers: DMS‐2015298
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1369-7412
1467-9868
DOI:10.1111/rssb.12554