Bayesian Spanning Tree: Estimating the Backbone of the Dependence Graph
In multivariate data analysis, it is often important to estimate a graph characterizing dependence among (p) variables. A popular strategy uses the non-zero entries in a (p\times p) covariance or precision matrix, typically requiring restrictive modeling assumptions for accurate graph recovery. To i...
Saved in:
Published in | arXiv.org |
---|---|
Main Authors | , |
Format | Paper |
Language | English |
Published |
Ithaca
Cornell University Library, arXiv.org
30.06.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | In multivariate data analysis, it is often important to estimate a graph characterizing dependence among (p) variables. A popular strategy uses the non-zero entries in a (p\times p) covariance or precision matrix, typically requiring restrictive modeling assumptions for accurate graph recovery. To improve model robustness, we instead focus on estimating the {\em backbone} of the dependence graph. We use a spanning tree likelihood, based on a minimalist graphical model that is purposely overly-simplified. Taking a Bayesian approach, we place a prior on the space of trees and quantify uncertainty in the graphical model. In both theory and experiments, we show that this model does not require the population graph to be a spanning tree or the covariance to satisfy assumptions beyond positive-definiteness. The model accurately recovers the backbone of the population graph at a rate competitive with existing approaches but with better robustness. We show combinatorial properties of the spanning tree, which may be of independent interest, and develop an efficient Gibbs sampler for Bayesian inference. Analyzing electroencephalography data using a Hidden Markov Model with each latent state modeled by a spanning tree, we show that results are much more interpretable compared with popular alternatives. |
---|---|
AbstractList | In multivariate data analysis, it is often important to estimate a graph characterizing dependence among (p) variables. A popular strategy uses the non-zero entries in a (p\times p) covariance or precision matrix, typically requiring restrictive modeling assumptions for accurate graph recovery. To improve model robustness, we instead focus on estimating the {\em backbone} of the dependence graph. We use a spanning tree likelihood, based on a minimalist graphical model that is purposely overly-simplified. Taking a Bayesian approach, we place a prior on the space of trees and quantify uncertainty in the graphical model. In both theory and experiments, we show that this model does not require the population graph to be a spanning tree or the covariance to satisfy assumptions beyond positive-definiteness. The model accurately recovers the backbone of the population graph at a rate competitive with existing approaches but with better robustness. We show combinatorial properties of the spanning tree, which may be of independent interest, and develop an efficient Gibbs sampler for Bayesian inference. Analyzing electroencephalography data using a Hidden Markov Model with each latent state modeled by a spanning tree, we show that results are much more interpretable compared with popular alternatives. |
Author | Dunson, David B Duan, Leo L |
Author_xml | – sequence: 1 givenname: Leo surname: Duan middlename: L fullname: Duan, Leo L – sequence: 2 givenname: David surname: Dunson middlename: B fullname: Dunson, David B |
BookMark | eNqNjEEOgjAURBujiajcoYlrEmjBqksUcS97UvEjoP7Wtiy8vWg8gKvJvHmZGRmjQhgRj3EeBeuYsSnxre3CMGQrwZKEeyRP5QtsK5GetERs8UoLA7ClmXXtQ7oPcA3QVFa38_BGVf3te9CAF8AKaG6kbhZkUsu7Bf-Xc7I8ZMXuGGijnj1YV3aqNzhMJUtiEQkhNpz_Z70B9LE8pA |
ContentType | Paper |
Copyright | 2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
Copyright_xml | – notice: 2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
DBID | 8FE 8FG ABJCF ABUWG AFKRA AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ L6V M7S PIMPY PQEST PQQKQ PQUKI PRINS PTHSS |
DatabaseName | ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials ProQuest Central Technology Collection ProQuest One Community College ProQuest Central Korea SciTech Premium Collection ProQuest Engineering Collection Engineering Database Publicly Available Content Database ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection |
DatabaseTitle | Publicly Available Content Database Engineering Database Technology Collection ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea Materials Science & Engineering Collection ProQuest One Academic Engineering Collection |
DatabaseTitleList | Publicly Available Content Database |
Database_xml | – sequence: 1 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Physics |
EISSN | 2331-8422 |
Genre | Working Paper/Pre-Print |
GroupedDBID | 8FE 8FG ABJCF ABUWG AFKRA ALMA_UNASSIGNED_HOLDINGS AZQEC BENPR BGLVJ CCPQU DWQXO FRJ HCIFZ L6V M7S M~E PIMPY PQEST PQQKQ PQUKI PRINS PTHSS |
ID | FETCH-proquest_journals_25471777933 |
IEDL.DBID | 8FG |
IngestDate | Thu Oct 10 17:51:23 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-proquest_journals_25471777933 |
OpenAccessLink | https://www.proquest.com/docview/2547177793?pq-origsite=%requestingapplication% |
PQID | 2547177793 |
PQPubID | 2050157 |
ParticipantIDs | proquest_journals_2547177793 |
PublicationCentury | 2000 |
PublicationDate | 20210630 |
PublicationDateYYYYMMDD | 2021-06-30 |
PublicationDate_xml | – month: 06 year: 2021 text: 20210630 day: 30 |
PublicationDecade | 2020 |
PublicationPlace | Ithaca |
PublicationPlace_xml | – name: Ithaca |
PublicationTitle | arXiv.org |
PublicationYear | 2021 |
Publisher | Cornell University Library, arXiv.org |
Publisher_xml | – name: Cornell University Library, arXiv.org |
SSID | ssj0002672553 |
Score | 3.3464668 |
SecondaryResourceType | preprint |
Snippet | In multivariate data analysis, it is often important to estimate a graph characterizing dependence among (p) variables. A popular strategy uses the non-zero... |
SourceID | proquest |
SourceType | Aggregation Database |
SubjectTerms | Backbone Bayesian analysis Combinatorial analysis Covariance Data analysis Electroencephalography Estimation Graph theory Markov chains Multivariate analysis Robustness Statistical inference |
Title | Bayesian Spanning Tree: Estimating the Backbone of the Dependence Graph |
URI | https://www.proquest.com/docview/2547177793 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LS8NAEB60QfDmEx-1LOg1aF5b40WIJilCS9EKvZXdzeQiNDGpBy_-dmfWVA9Cj7MLu-wyO6-dmQ_gylcMYK3RlaWJ3LCUXKwsIzdWGoMgLuMi5trh8USOXsOneTTvAm5tl1a5lolWUBeV4Rj5NTky5HkMiZ3u63eXUaP4d7WD0NgGx-NOeFwpnuW_MRZfDsliDv6JWas7sj1wpqrGZh-2cHkAOzbl0rSHkCfqE7mEUbzUP8BBYtYg3omUXh3bkTRA1plIlHnT1RJFVVr6sYOtNShy7jZ9BJdZOnsYuevdFx1_tIu_0wTH0CNHH09AMNib9pRR3o0JizLWoU8KOFLKkBETqdtT6G9a6Wzz9Dns-pyPYXPd-tBbNR94QQp1pQf21gbgJOlk-kzU-Cv9Bo1qgC0 |
link.rule.ids | 786,790,12792,21416,33406,33777,43633,43838 |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NS8MwFH9oi-jNT9RNDei16Pq11YtQ7VZ1K0Mr7FaS9PUyWGs7D_vvzauZHoQdk0BCwsv7yu_lB3BjcyKwFmj5hfQst_CpWNn3rIALdJygCPKAaocniR9_uC8zb6YTbo2GVa51Yquo81JSjvxWBTIq8ugrcXqoPi1ijaLXVU2hsQ0mfbk5MMAMo2T69ptlsf2-8pmdf4q2tR7DfTCnvML6ALZwcQg7LehSNkcwCvkKqYiRvVc_1EEsrRHvWaTuHXmSqkP5Zyzkci7KBbKyaNtPmrhWIhvRf9PHcD2M0sfYWq-eaQlpsr_9OCdgqFAfT4ER3Zvoccl7d9LNi0C4tjLBHudSuTEeH5xBd9NM55uHr2A3TifjbPycvHZgzyZ0Rot864KxrL_wQpnXpbjUZ_gN2jWBuQ |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Bayesian+Spanning+Tree%3A+Estimating+the+Backbone+of+the+Dependence+Graph&rft.jtitle=arXiv.org&rft.au=Duan%2C+Leo+L&rft.au=Dunson%2C+David+B&rft.date=2021-06-30&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422 |