Competitive Markov decision processes with partial observation

We study a class of Markov decision processes (MDPs) in the infinite time horizon where the number of controllers is two and the observation information is allowed to be imperfect. Suppose the system, space and action space are both finite, and the controllers, having conflicting interests with each...

Full description

Saved in:
Bibliographic Details
Published in2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583) Vol. 1; pp. 236 - 241 vol.1
Main Authors Shun-Pin Hsu, Arapostathis, A.
Format Conference Proceeding
LanguageEnglish
Published Piscataway NJ IEEE 2004
Subjects
Online AccessGet full text

Cover

Loading…
Abstract We study a class of Markov decision processes (MDPs) in the infinite time horizon where the number of controllers is two and the observation information is allowed to be imperfect. Suppose the system, space and action space are both finite, and the controllers, having conflicting interests with each other, make decisions independently to seek their own best long-run average profit. Under the hypothesis that at least one system state is perfectly observable and accessible (by each system state no matter what actions are taken), we prove the existence of optimal policies for both controllers and characterize them by the min-max type of dynamic programming equations. An example on a class of machine maintenance process is presented to show our work
AbstractList We study a class of Markov decision processes (MDPs) in the infinite time horizon where the number of controllers is two and the observation information is allowed to be imperfect. Suppose the system, space and action space are both finite, and the controllers, having conflicting interests with each other, make decisions independently to seek their own best long-run average profit. Under the hypothesis that at least one system state is perfectly observable and accessible (by each system state no matter what actions are taken), we prove the existence of optimal policies for both controllers and characterize them by the min-max type of dynamic programming equations. An example on a class of machine maintenance process is presented to show our work
Author Arapostathis, A.
Shun-Pin Hsu
Author_xml – sequence: 1
  surname: Shun-Pin Hsu
  fullname: Shun-Pin Hsu
  organization: Dept. of Electr. Eng., Nat. Chi-Nan Univ., Nantou
– sequence: 2
  givenname: A.
  surname: Arapostathis
  fullname: Arapostathis, A.
BackLink http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17523435$$DView record in Pascal Francis
BookMark eNpF0EtLw0AUBeBBK9hW_4BusnGZeued2QgSfBRaXKjgrtxM7-BoTcJMqPjvDVRwdRfn43C4MzZpu5YYu-Cw4Bzc9bJ-XtcLAaAWXLpKgjxiU6GtLbnR-pjNwFYgK22MnbApByNKJ8TbKZvl_AEgQPFqym7q7qunIQ5xT8Ua02e3L7bkY45dW_Sp85Qz5eI7Du9Fj2mIuCu6JlPa4zCSM3YScJfp_O_O2ev93Uv9WK6eHpb17aqMAvRQCq6VDxiUs6Jxxm9RGesaqEBZLxwhKZJGcdRKQ4AmGPJuBNJLQq6DnLOrQ2-P2eMuJGzHjZs-xS9MPxtutZBK6tFdHlwkov_48B_5C53YWd4
ContentType Conference Proceeding
Copyright 2006 INIST-CNRS
Copyright_xml – notice: 2006 INIST-CNRS
DBID 6IE
6IH
CBEJK
RIE
RIO
IQODW
DOI 10.1109/ICSMC.2004.1398303
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEL
IEEE Proceedings Order Plans (POP) 1998-present
Pascal-Francis
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEL
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Sciences (General)
Applied Sciences
EISSN 2577-1655
EndPage 241 vol.1
ExternalDocumentID 17523435
1398303
Genre orig-research
GroupedDBID 29F
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
IPLJI
JC5
M43
OCL
RIE
RIL
RIO
RNS
AAVQY
IQODW
ID FETCH-LOGICAL-i205t-2154cfaf4972b96cda4679b08047c29eae4e3641a5450f0bf6ec979b3c3ea15f3
IEDL.DBID RIE
ISBN 0780385667
9780780385665
ISSN 1062-922X
IngestDate Sun Oct 29 17:06:48 EDT 2023
Wed Jun 26 19:26:52 EDT 2024
IsPeerReviewed false
IsScholarly true
Keywords Markov process
Infinite time
Markov decision
Optimal control
Minimax method
Optimal policy
Defect
Infinite horizon
Maintenance
Dynamic programming
Modeling
Observable
Language English
License CC BY 4.0
LinkModel DirectLink
MeetingName 2004 IEEE international conference on systems, man & cybernetics (The Hague, Netherlands, 10-13 october 2004)
MergedId FETCHMERGED-LOGICAL-i205t-2154cfaf4972b96cda4679b08047c29eae4e3641a5450f0bf6ec979b3c3ea15f3
ParticipantIDs pascalfrancis_primary_17523435
ieee_primary_1398303
PublicationCentury 2000
PublicationDate 20040000
2004
PublicationDateYYYYMMDD 2004-01-01
PublicationDate_xml – year: 2004
  text: 20040000
PublicationDecade 2000
PublicationPlace Piscataway NJ
PublicationPlace_xml – name: Piscataway NJ
PublicationTitle 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583)
PublicationTitleAbbrev ICSMC
PublicationYear 2004
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0020418
ssj0000454138
Score 1.6008102
Snippet We study a class of Markov decision processes (MDPs) in the infinite time horizon where the number of controllers is two and the observation information is...
SourceID pascalfrancis
ieee
SourceType Index Database
Publisher
StartPage 236
SubjectTerms Airplanes
Applied sciences
Computer networks
Computer science; control theory; systems
Control systems
Control theory. Systems
Cost function
Equations
Exact sciences and technology
Nash equilibrium
Optimal control
Processor scheduling
Stochastic processes
Stochastic systems
Title Competitive Markov decision processes with partial observation
URI https://ieeexplore.ieee.org/document/1398303
Volume 1
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA5zJ73ofojzx8jBg4Lp2jRNm4uX4pjCRNDBbiNNExBlHW7z4F_vS5t1Uzx4a2lCyWuafO_lfd9D6DKROlJRLIkF94TlgSGJ8SWhucjgb4JOxrKRx498NGEP02jaQDc1F0ZrXSafac9elmf5eaHWNlQ2ALSShFbacy8WouJq1fEUKyUXhPUJAvVZUNHgOCWC0mnpsif2HIzz2CnvbO6jDZvGF4P79Hmcln6j517n6q7YrEm5BMOZquLFzjY0PETjzQCq7JM3b73KPPX1S9vxvyM8Qt0t4Q8_1VtZCzX0vI0OdrQK26jlVoElvnJS1dcddJuWsLvMP8KW9lN84txV7cGLioMAPWywFy_sJJXvuMjqQHAXTYZ3L-mIuIoM5JX60YoAPmDKSMNETDPBVS5hnRUZoE4WKyq01EyHnAUScJlv_MxwrQQ0CFWoZRCZ8Bg158VcnyCch0YpQHMS_ClmWCwM7Iwsg7bWg-GyhzrWPLNFJboxc5bpof6Pr7B9HoNLDdjv9O9-Z2i_yrmxwZNz1Fx9rPUFwIlV1i_n0TcH0cNW
link.rule.ids 310,311,783,787,792,793,799,4059,4060,27939,55088
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED5VZQAW6ANRHiUDA0gkzcN5eGGJqFpoKiRaqVvlOLaEQE1FUwZ-PeckTQtiYEsUW5Evjv3d-b7vAK4DJlzu-kxX4F4niSX1QJpMtxMa49-EnaRiI0djbzAljzN3VoO7igsjhMiTz4ShLvOz_CTlaxUq6yFaCRwl7bnnKlxRsLWqiIoSk7Oc6gzBNolVEOE8W6e2Pcud9kCdhHmeX2rvbO7dDZ_GpL1h-BKFuedolC8sK6-ovEm2QtPJoubFzkbUP4JoM4Qi_-TNWGexwb9-qTv-d4zH0N5S_rTnajNrQE0smnC4o1bYhEa5Dqy0m1Ks-rYF92EOvPMMJE0Rf9JPLSnr9mjLgoWAPVS4V1uqacretTSuQsFtmPYfJuFAL2sy6K-26WY6IgTCJZOE-nZMPZ4wXGlpjLiT-NymggkiHI9YDJGZKc1YeoJTbOBwRzDLlc4J1BfpQpyCljiSc8RzDD0qIolPJe6NJMa2yofxWAdayjzzZSG7MS8t04Huj6-wfe6jU43o7-zvflewP5hEo_loOH46h4MiA0eFUi6gnn2sxSWCiyzu5nPqG3ZoxqM
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2004+IEEE+International+Conference+on+Systems%2C+Man+and+Cybernetics+%28IEEE+Cat.+No.04CH37583%29&rft.atitle=Competitive+Markov+decision+processes+with+partial+observation&rft.au=Shun-Pin+Hsu&rft.au=Arapostathis%2C+A.&rft.date=2004-01-01&rft.pub=IEEE&rft.isbn=9780780385665&rft.issn=1062-922X&rft.eissn=2577-1655&rft.volume=1&rft.spage=236&rft.epage=241+vol.1&rft_id=info:doi/10.1109%2FICSMC.2004.1398303&rft.externalDocID=1398303
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1062-922X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1062-922X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1062-922X&client=summon