Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently

We propose a new framework for formulating optimal transport distances between Markov chains. Previously known formulations studied couplings between the entire joint distribution induced by the chains, and derived solutions via a reduction to dynamic programming (DP) in an appropriately defined Mar...

Full description

Saved in:
Bibliographic Details
Main Authors Calo, Sergio, Jonsson, Anders, Neu, Gergely, Schwartz, Ludovic, Segovia-Aguas, Javier
Format Journal Article
LanguageEnglish
Published 06.06.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract We propose a new framework for formulating optimal transport distances between Markov chains. Previously known formulations studied couplings between the entire joint distribution induced by the chains, and derived solutions via a reduction to dynamic programming (DP) in an appropriately defined Markov decision process. This formulation has, however, not led to particularly efficient algorithms so far, since computing the associated DP operators requires fully solving a static optimal transport problem, and these operators need to be applied numerous times during the overall optimization process. In this work, we develop an alternative perspective by considering couplings between a flattened version of the joint distributions that we call discounted occupancy couplings, and show that calculating optimal transport distances in the full space of joint distributions can be equivalently formulated as solving a linear program (LP) in this reduced space. This LP formulation allows us to port several algorithmic ideas from other areas of optimal transport theory. In particular, our formulation makes it possible to introduce an appropriate notion of entropy regularization into the optimization problem, which in turn enables us to directly calculate optimal transport distances via a Sinkhorn-like method we call Sinkhorn Value Iteration (SVI). We show both theoretically and empirically that this method converges quickly to an optimal coupling, essentially at the same computational cost of running vanilla Sinkhorn in each pair of states. Along the way, we point out that our optimal transport distance exactly matches the common notion of bisimulation metrics between Markov chains, and thus our results also apply to computing such metrics, and in fact our algorithm turns out to be significantly more efficient than the best known methods developed so far for this purpose.
AbstractList We propose a new framework for formulating optimal transport distances between Markov chains. Previously known formulations studied couplings between the entire joint distribution induced by the chains, and derived solutions via a reduction to dynamic programming (DP) in an appropriately defined Markov decision process. This formulation has, however, not led to particularly efficient algorithms so far, since computing the associated DP operators requires fully solving a static optimal transport problem, and these operators need to be applied numerous times during the overall optimization process. In this work, we develop an alternative perspective by considering couplings between a flattened version of the joint distributions that we call discounted occupancy couplings, and show that calculating optimal transport distances in the full space of joint distributions can be equivalently formulated as solving a linear program (LP) in this reduced space. This LP formulation allows us to port several algorithmic ideas from other areas of optimal transport theory. In particular, our formulation makes it possible to introduce an appropriate notion of entropy regularization into the optimization problem, which in turn enables us to directly calculate optimal transport distances via a Sinkhorn-like method we call Sinkhorn Value Iteration (SVI). We show both theoretically and empirically that this method converges quickly to an optimal coupling, essentially at the same computational cost of running vanilla Sinkhorn in each pair of states. Along the way, we point out that our optimal transport distance exactly matches the common notion of bisimulation metrics between Markov chains, and thus our results also apply to computing such metrics, and in fact our algorithm turns out to be significantly more efficient than the best known methods developed so far for this purpose.
Author Neu, Gergely
Segovia-Aguas, Javier
Schwartz, Ludovic
Jonsson, Anders
Calo, Sergio
Author_xml – sequence: 1
  givenname: Sergio
  surname: Calo
  fullname: Calo, Sergio
– sequence: 2
  givenname: Anders
  surname: Jonsson
  fullname: Jonsson, Anders
– sequence: 3
  givenname: Gergely
  surname: Neu
  fullname: Neu, Gergely
– sequence: 4
  givenname: Ludovic
  surname: Schwartz
  fullname: Schwartz, Ludovic
– sequence: 5
  givenname: Javier
  surname: Segovia-Aguas
  fullname: Segovia-Aguas, Javier
BackLink https://doi.org/10.48550/arXiv.2406.04056$$DView paper in arXiv
BookMark eNqFzr0OgjAUhuEOOvh3AU6eC1CsCsRZxLgYF5zJEQ7JSUrbtMXI3fsTd6dvefPlGYuBNpqEmG9kFO-TRK7RPfkRbWOZRjKWSToStwN7bjuFgY2GCwXHlQd0BFcbuEUFhUPtrXEBjuwD6or8ElDXkKGGO0FmWtsFqiFvGq6YdFD9VAwbVJ5mv52IxSkvsvPqCyitez-7vvxAyi9k9794AaumQAM
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
AKZ
EPD
GOX
DOI 10.48550/arxiv.2406.04056
DatabaseName arXiv Computer Science
arXiv Mathematics
arXiv Statistics
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2406_04056
GroupedDBID AKY
AKZ
EPD
GOX
ID FETCH-arxiv_primary_2406_040563
IEDL.DBID GOX
IngestDate Tue Jun 18 04:50:29 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2406_040563
OpenAccessLink https://arxiv.org/abs/2406.04056
ParticipantIDs arxiv_primary_2406_04056
PublicationCentury 2000
PublicationDate 2024-06-06
PublicationDateYYYYMMDD 2024-06-06
PublicationDate_xml – month: 06
  year: 2024
  text: 2024-06-06
  day: 06
PublicationDecade 2020
PublicationYear 2024
Score 3.8495162
SecondaryResourceType preprint
Snippet We propose a new framework for formulating optimal transport distances between Markov chains. Previously known formulations studied couplings between the...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Learning
Mathematics - Optimization and Control
Statistics - Machine Learning
Title Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently
URI https://arxiv.org/abs/2406.04056
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1LSwMxEB7anryIolLfc_Doapum-zhqbS1C7cXC3pbMNoEFW2S3iv57Z7Lr49JrMoQhD775ksk3AFeDpTANygPTk9sqF_cC0ioPYuM0Oav72slv5NlzOF3op3SYtgB__sKY8rP4qPWBqboVuLnhbTYM29BWSlK2Hudp_Tjppbga-z87jjF90z-QmOzBbhPd4V29HPvQsusDWNwXVbFqymThTEpY5RWa0uKcz-uK7X8VxvFBwjnJbL5Gpvg4Mmski03phSWOveAD48Tr1yFcTsYvo2ngHcneatWITHzMvI-DI-gwt7ddQFH_UQkxQ-XAJYmJIhflzAgoSsI-xfYYuttGOdnedQo7irHXZzSFZ9DZlO_2nLFzQxd-Ar8B__Bzcw
link.rule.ids 228,230,783,888
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Bisimulation+Metrics+are+Optimal+Transport+Distances%2C+and+Can+be+Computed+Efficiently&rft.au=Calo%2C+Sergio&rft.au=Jonsson%2C+Anders&rft.au=Neu%2C+Gergely&rft.au=Schwartz%2C+Ludovic&rft.date=2024-06-06&rft_id=info:doi/10.48550%2Farxiv.2406.04056&rft.externalDocID=2406_04056