Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently
We propose a new framework for formulating optimal transport distances between Markov chains. Previously known formulations studied couplings between the entire joint distribution induced by the chains, and derived solutions via a reduction to dynamic programming (DP) in an appropriately defined Mar...
Saved in:
Main Authors | , , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
06.06.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | We propose a new framework for formulating optimal transport distances
between Markov chains. Previously known formulations studied couplings between
the entire joint distribution induced by the chains, and derived solutions via
a reduction to dynamic programming (DP) in an appropriately defined Markov
decision process. This formulation has, however, not led to particularly
efficient algorithms so far, since computing the associated DP operators
requires fully solving a static optimal transport problem, and these operators
need to be applied numerous times during the overall optimization process. In
this work, we develop an alternative perspective by considering couplings
between a flattened version of the joint distributions that we call discounted
occupancy couplings, and show that calculating optimal transport distances in
the full space of joint distributions can be equivalently formulated as solving
a linear program (LP) in this reduced space. This LP formulation allows us to
port several algorithmic ideas from other areas of optimal transport theory. In
particular, our formulation makes it possible to introduce an appropriate
notion of entropy regularization into the optimization problem, which in turn
enables us to directly calculate optimal transport distances via a
Sinkhorn-like method we call Sinkhorn Value Iteration (SVI). We show both
theoretically and empirically that this method converges quickly to an optimal
coupling, essentially at the same computational cost of running vanilla
Sinkhorn in each pair of states. Along the way, we point out that our optimal
transport distance exactly matches the common notion of bisimulation metrics
between Markov chains, and thus our results also apply to computing such
metrics, and in fact our algorithm turns out to be significantly more efficient
than the best known methods developed so far for this purpose. |
---|---|
AbstractList | We propose a new framework for formulating optimal transport distances
between Markov chains. Previously known formulations studied couplings between
the entire joint distribution induced by the chains, and derived solutions via
a reduction to dynamic programming (DP) in an appropriately defined Markov
decision process. This formulation has, however, not led to particularly
efficient algorithms so far, since computing the associated DP operators
requires fully solving a static optimal transport problem, and these operators
need to be applied numerous times during the overall optimization process. In
this work, we develop an alternative perspective by considering couplings
between a flattened version of the joint distributions that we call discounted
occupancy couplings, and show that calculating optimal transport distances in
the full space of joint distributions can be equivalently formulated as solving
a linear program (LP) in this reduced space. This LP formulation allows us to
port several algorithmic ideas from other areas of optimal transport theory. In
particular, our formulation makes it possible to introduce an appropriate
notion of entropy regularization into the optimization problem, which in turn
enables us to directly calculate optimal transport distances via a
Sinkhorn-like method we call Sinkhorn Value Iteration (SVI). We show both
theoretically and empirically that this method converges quickly to an optimal
coupling, essentially at the same computational cost of running vanilla
Sinkhorn in each pair of states. Along the way, we point out that our optimal
transport distance exactly matches the common notion of bisimulation metrics
between Markov chains, and thus our results also apply to computing such
metrics, and in fact our algorithm turns out to be significantly more efficient
than the best known methods developed so far for this purpose. |
Author | Neu, Gergely Segovia-Aguas, Javier Schwartz, Ludovic Jonsson, Anders Calo, Sergio |
Author_xml | – sequence: 1 givenname: Sergio surname: Calo fullname: Calo, Sergio – sequence: 2 givenname: Anders surname: Jonsson fullname: Jonsson, Anders – sequence: 3 givenname: Gergely surname: Neu fullname: Neu, Gergely – sequence: 4 givenname: Ludovic surname: Schwartz fullname: Schwartz, Ludovic – sequence: 5 givenname: Javier surname: Segovia-Aguas fullname: Segovia-Aguas, Javier |
BackLink | https://doi.org/10.48550/arXiv.2406.04056$$DView paper in arXiv |
BookMark | eNqFzr0OgjAUhuEOOvh3AU6eC1CsCsRZxLgYF5zJEQ7JSUrbtMXI3fsTd6dvefPlGYuBNpqEmG9kFO-TRK7RPfkRbWOZRjKWSToStwN7bjuFgY2GCwXHlQd0BFcbuEUFhUPtrXEBjuwD6or8ElDXkKGGO0FmWtsFqiFvGq6YdFD9VAwbVJ5mv52IxSkvsvPqCyitez-7vvxAyi9k9794AaumQAM |
ContentType | Journal Article |
Copyright | http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
Copyright_xml | – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
DBID | AKY AKZ EPD GOX |
DOI | 10.48550/arxiv.2406.04056 |
DatabaseName | arXiv Computer Science arXiv Mathematics arXiv Statistics arXiv.org |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
ExternalDocumentID | 2406_04056 |
GroupedDBID | AKY AKZ EPD GOX |
ID | FETCH-arxiv_primary_2406_040563 |
IEDL.DBID | GOX |
IngestDate | Tue Jun 18 04:50:29 EDT 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-arxiv_primary_2406_040563 |
OpenAccessLink | https://arxiv.org/abs/2406.04056 |
ParticipantIDs | arxiv_primary_2406_04056 |
PublicationCentury | 2000 |
PublicationDate | 2024-06-06 |
PublicationDateYYYYMMDD | 2024-06-06 |
PublicationDate_xml | – month: 06 year: 2024 text: 2024-06-06 day: 06 |
PublicationDecade | 2020 |
PublicationYear | 2024 |
Score | 3.8495162 |
SecondaryResourceType | preprint |
Snippet | We propose a new framework for formulating optimal transport distances
between Markov chains. Previously known formulations studied couplings between
the... |
SourceID | arxiv |
SourceType | Open Access Repository |
SubjectTerms | Computer Science - Learning Mathematics - Optimization and Control Statistics - Machine Learning |
Title | Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently |
URI | https://arxiv.org/abs/2406.04056 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1LSwMxEB7anryIolLfc_Doapum-zhqbS1C7cXC3pbMNoEFW2S3iv57Z7Lr49JrMoQhD775ksk3AFeDpTANygPTk9sqF_cC0ioPYuM0Oav72slv5NlzOF3op3SYtgB__sKY8rP4qPWBqboVuLnhbTYM29BWSlK2Hudp_Tjppbga-z87jjF90z-QmOzBbhPd4V29HPvQsusDWNwXVbFqymThTEpY5RWa0uKcz-uK7X8VxvFBwjnJbL5Gpvg4Mmski03phSWOveAD48Tr1yFcTsYvo2ngHcneatWITHzMvI-DI-gwt7ddQFH_UQkxQ-XAJYmJIhflzAgoSsI-xfYYuttGOdnedQo7irHXZzSFZ9DZlO_2nLFzQxd-Ar8B__Bzcw |
link.rule.ids | 228,230,783,888 |
linkProvider | Cornell University |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Bisimulation+Metrics+are+Optimal+Transport+Distances%2C+and+Can+be+Computed+Efficiently&rft.au=Calo%2C+Sergio&rft.au=Jonsson%2C+Anders&rft.au=Neu%2C+Gergely&rft.au=Schwartz%2C+Ludovic&rft.date=2024-06-06&rft_id=info:doi/10.48550%2Farxiv.2406.04056&rft.externalDocID=2406_04056 |