Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control

This paper addresses distributed optimal tracking control of multi-agent linear systems subject to external disturbances. The concept of differential game theory is utilized to formulate this distributed control problem into a multi-player zero-sum differential graphical game, which provides a new p...

Full description

Saved in:
Bibliographic Details
Published inAutomatica (Oxford) Vol. 69; pp. 24 - 34
Main Authors Jiao, Qiang, Modares, Hamidreza, Xu, Shengyuan, Lewis, Frank L., Vamvoudakis, Kyriakos G.
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.07.2016
Subjects
Online AccessGet full text

Cover

Loading…
Abstract This paper addresses distributed optimal tracking control of multi-agent linear systems subject to external disturbances. The concept of differential game theory is utilized to formulate this distributed control problem into a multi-player zero-sum differential graphical game, which provides a new perspective on distributed tracking of multiple agents influenced by disturbances. In the presented differential graphical game, the dynamics and performance indices for each node depend on local neighbor information and disturbances. It is shown that the solution to the multi-agent differential graphical games in the presence of disturbances requires the solution to coupled Hamilton–Jacobi–Isaacs (HJI) equations. Multi-agent learning policy iteration (PI) algorithm is provided to find the solution to these coupled HJI equations and its convergence is proven. It is also shown that L2-bounded synchronization errors can be guaranteed using this technique. An online PI algorithm is given to solve the zero-sum game in real time. A simulation example is provided to show the effectiveness of the online approach.
AbstractList This paper addresses distributed optimal tracking control of multi-agent linear systems subject to external disturbances. The concept of differential game theory is utilized to formulate this distributed control problem into a multi-player zero-sum differential graphical game, which provides a new perspective on distributed tracking of multiple agents influenced by disturbances. In the presented differential graphical game, the dynamics and performance indices for each node depend on local neighbor information and disturbances. It is shown that the solution to the multi-agent differential graphical games in the presence of disturbances requires the solution to coupled Hamilton–Jacobi–Isaacs (HJI) equations. Multi-agent learning policy iteration (PI) algorithm is provided to find the solution to these coupled HJI equations and its convergence is proven. It is also shown that L2-bounded synchronization errors can be guaranteed using this technique. An online PI algorithm is given to solve the zero-sum game in real time. A simulation example is provided to show the effectiveness of the online approach.
This paper addresses distributed optimal tracking control of multi-agent linear systems subject to external disturbances. The concept of differential game theory is utilized to formulate this distributed control problem into a multi-player zero-sum differential graphical game, which provides a new perspective on distributed tracking of multiple agents influenced by disturbances. In the presented differential graphical game, the dynamics and performance indices for each node depend on local neighbor information and disturbances. It is shown that the solution to the multi-agent differential graphical games in the presence of disturbances requires the solution to coupled Hamilton-Jacobi-Isaacs (HJI) equations. Multi-agent learning policy iteration (PI) algorithm is provided to find the solution to these coupled HJI equations and its convergence is proven. It is also shown that L sub(2)L2-bounded synchronization errors can be guaranteed using this technique. An online PI algorithm is given to solve the zero-sum game in real time. A simulation example is provided to show the effectiveness of the online approach.
Author Xu, Shengyuan
Vamvoudakis, Kyriakos G.
Lewis, Frank L.
Modares, Hamidreza
Jiao, Qiang
Author_xml – sequence: 1
  givenname: Qiang
  surname: Jiao
  fullname: Jiao, Qiang
  email: qjiao0312@gmail.com
  organization: School of Automation, Nanjing University of Science and Technology, Nanjing, 210094, Jiangsu, PR China
– sequence: 2
  givenname: Hamidreza
  surname: Modares
  fullname: Modares, Hamidreza
  email: modares@uta.edu
  organization: University of Texas at Arlington Research Institute, 7300 Jack Newell Blvd. S., Ft. Worth, TX 76118, USA
– sequence: 3
  givenname: Shengyuan
  surname: Xu
  fullname: Xu, Shengyuan
  email: syxu@njust.edu.cn
  organization: School of Automation, Nanjing University of Science and Technology, Nanjing, 210094, Jiangsu, PR China
– sequence: 4
  givenname: Frank L.
  surname: Lewis
  fullname: Lewis, Frank L.
  email: lewis@uta.edu
  organization: University of Texas at Arlington Research Institute, 7300 Jack Newell Blvd. S., Ft. Worth, TX 76118, USA
– sequence: 5
  givenname: Kyriakos G.
  surname: Vamvoudakis
  fullname: Vamvoudakis, Kyriakos G.
  email: kyriakos@ece.ucsb.edu
  organization: Center for Control, Dynamical-systems and Computation (CCDC), University of California, Santa Barbara, CA 93106-9560, USA
BookMark eNqNkE1LxDAQhoOs4O7qf-jRS2uSfmUvgi5-geJFz2GaTtYsbbMmqaC_3tQVBC96ymTmnQfmWZDZYAckJGE0Y5RVZ9sMxmB7CEZBxmMnozyjlB-QORN1nnKRVzMyp5SWKaMrcUQW3m_jt2CCz0nzMHbBpLDBISQf6Gzqxz5pjdboYstAl2wc7F4iPVbQo0-0dTHgw-gaGBQmDreogrFDYoavgTPNGLBNlB2Cs90xOdTQeTz5fpfk-frqaX2b3j_e3K0v7lOVlyykq7qknMdSgMgbqpVQumkYbxUUgrE8r0FphKKsVcxUKrZo2ZYaq1XBGRT5kpzuuTtnX0f0QfbGK-w6GNCOXsZ7y4lE8xgV-6hy1nuHWu6c6cG9S0blpFVu5Y9WOWmVlMuoNa6e_1pVJsB0fnBguv8ALvcAjC7eDDrplcEosjUuepStNX9DPgGgCJ7X
CitedBy_id crossref_primary_10_1016_j_cja_2021_08_005
crossref_primary_10_1002_oca_3174
crossref_primary_10_1016_j_neucom_2021_03_021
crossref_primary_10_1631_FITEE_2200010
crossref_primary_10_1109_ACCESS_2024_3400590
crossref_primary_10_1109_TCYB_2021_3090067
crossref_primary_10_1016_j_amc_2024_128979
crossref_primary_10_1109_JIOT_2023_3303448
crossref_primary_10_1109_TNNLS_2020_3044039
crossref_primary_10_1080_00207179_2020_1790663
crossref_primary_10_1109_TAES_2020_3010593
crossref_primary_10_1002_rnc_5263
crossref_primary_10_1109_TFUZZ_2018_2859904
crossref_primary_10_1109_TNNLS_2023_3291542
crossref_primary_10_1007_s11071_018_4228_8
crossref_primary_10_1109_TCYB_2018_2819695
crossref_primary_10_1109_TCYB_2018_2856089
crossref_primary_10_1109_TCNS_2024_3395852
crossref_primary_10_1016_j_jfranklin_2023_02_033
crossref_primary_10_1109_TCSI_2024_3366942
crossref_primary_10_1002_rnc_7710
crossref_primary_10_1109_TFUZZ_2021_3075501
crossref_primary_10_1016_j_neucom_2017_09_020
crossref_primary_10_1109_TAC_2018_2879568
crossref_primary_10_1109_TCYB_2025_3534463
crossref_primary_10_1007_s11063_022_11085_0
crossref_primary_10_1007_s11071_023_08496_6
crossref_primary_10_1002_acs_3945
crossref_primary_10_1109_TCYB_2018_2856510
crossref_primary_10_1016_j_automatica_2019_108656
crossref_primary_10_3390_app12157551
crossref_primary_10_1016_j_ast_2025_110080
crossref_primary_10_29252_joc_12_2_13
crossref_primary_10_1080_00207179_2018_1467044
crossref_primary_10_1109_TAC_2020_3027795
crossref_primary_10_1109_TSMC_2022_3177043
crossref_primary_10_1360_SST_2022_0208
crossref_primary_10_1002_rnc_7189
crossref_primary_10_1109_JSYST_2022_3223715
crossref_primary_10_1109_LCSYS_2020_3001240
crossref_primary_10_1016_j_isatra_2019_01_021
crossref_primary_10_1002_acs_2866
crossref_primary_10_1049_iet_cta_2017_0875
crossref_primary_10_1016_j_jfranklin_2025_107639
crossref_primary_10_1016_j_neunet_2022_08_010
crossref_primary_10_1109_TNSE_2022_3185019
crossref_primary_10_1016_j_ifacol_2020_12_2180
crossref_primary_10_1016_j_ins_2023_118949
crossref_primary_10_1016_j_neunet_2018_06_007
crossref_primary_10_1109_TSMC_2018_2810117
crossref_primary_10_1016_j_neucom_2021_03_017
crossref_primary_10_1109_TSMC_2019_2944259
crossref_primary_10_1109_TSMC_2018_2861470
crossref_primary_10_1109_TCYB_2024_3419056
crossref_primary_10_1049_iet_cta_2020_0259
crossref_primary_10_1002_rnc_7698
crossref_primary_10_1002_rnc_7574
crossref_primary_10_1109_TITS_2024_3362959
crossref_primary_10_1109_TNSE_2023_3309816
crossref_primary_10_1016_j_automatica_2023_111468
crossref_primary_10_1109_TCYB_2018_2868715
crossref_primary_10_1016_j_nahs_2019_03_003
crossref_primary_10_1109_ACCESS_2023_3239665
crossref_primary_10_1109_TCYB_2017_2788819
crossref_primary_10_1109_TCYB_2024_3372606
crossref_primary_10_1016_j_ast_2021_106568
crossref_primary_10_1109_TAC_2021_3122382
crossref_primary_10_1109_TCYB_2022_3215716
crossref_primary_10_1109_TCYB_2019_2950262
crossref_primary_10_1109_TII_2021_3137816
crossref_primary_10_1080_00207179_2018_1441550
crossref_primary_10_1109_TNNLS_2019_2958948
crossref_primary_10_1002_rnc_4650
crossref_primary_10_1016_j_jfranklin_2022_01_012
crossref_primary_10_1109_TAES_2024_3407735
crossref_primary_10_1016_j_isatra_2018_10_005
crossref_primary_10_1109_TSMC_2017_2693209
crossref_primary_10_1049_iet_cta_2017_0259
crossref_primary_10_1109_TCYB_2016_2611613
crossref_primary_10_1109_TNNLS_2023_3287881
crossref_primary_10_1109_TASE_2023_3327264
crossref_primary_10_3390_math10152728
crossref_primary_10_1109_TCNS_2022_3181550
crossref_primary_10_12677_AAM_2022_111022
crossref_primary_10_1016_j_jfranklin_2023_06_015
crossref_primary_10_1016_j_neunet_2022_09_024
crossref_primary_10_1109_JSYST_2024_3391766
crossref_primary_10_1109_TCYB_2022_3195361
crossref_primary_10_1109_TNNLS_2020_2969249
crossref_primary_10_1109_ACCESS_2020_2970760
crossref_primary_10_1016_j_automatica_2021_110076
crossref_primary_10_1109_TCSI_2024_3377641
crossref_primary_10_1016_j_ijepes_2023_109704
crossref_primary_10_1109_TNNLS_2020_3042508
crossref_primary_10_1080_00207179_2017_1300685
crossref_primary_10_1016_j_ast_2022_107759
crossref_primary_10_1016_j_jfranklin_2021_08_022
crossref_primary_10_1109_TSMC_2018_2889377
crossref_primary_10_1016_j_jfranklin_2024_107263
crossref_primary_10_1109_TCYB_2025_3530456
crossref_primary_10_1109_TSMC_2019_2914160
crossref_primary_10_1109_JSYST_2023_3318525
crossref_primary_10_1016_j_ins_2024_121234
crossref_primary_10_1049_iet_cta_2018_5832
crossref_primary_10_1002_rnc_7786
crossref_primary_10_1109_MSMC_2023_3282774
crossref_primary_10_1007_s11071_024_10493_2
crossref_primary_10_1016_j_jfranklin_2018_01_016
crossref_primary_10_1109_TNNLS_2020_3023711
crossref_primary_10_1016_j_isatra_2022_09_012
crossref_primary_10_1587_transfun_E99_A_1721
crossref_primary_10_1002_rnc_4538
crossref_primary_10_1016_j_neucom_2020_06_106
crossref_primary_10_1109_TAC_2022_3226142
crossref_primary_10_1177_09544100241277236
crossref_primary_10_1016_j_isatra_2020_10_043
crossref_primary_10_1109_TIE_2017_2782245
crossref_primary_10_1109_LCSYS_2022_3175665
crossref_primary_10_1080_00207179_2024_2447570
crossref_primary_10_1109_TCNS_2019_2959163
crossref_primary_10_1109_TCNS_2022_3203896
crossref_primary_10_1007_s40435_022_00983_9
crossref_primary_10_1016_j_ast_2020_105894
crossref_primary_10_1016_j_ast_2025_110052
crossref_primary_10_1016_j_neucom_2017_01_047
crossref_primary_10_1016_j_neucom_2021_01_063
crossref_primary_10_1002_rnc_7378
crossref_primary_10_1109_TICPS_2024_3373715
crossref_primary_10_1016_j_neunet_2024_106566
Cites_doi 10.1109/TSMCC.2007.913919
10.1080/00207170903267039
10.1007/s12555-012-0495-1
10.1007/s12555-011-0609-1
10.1109/TCSI.2004.835655
10.1109/TAC.2004.834433
10.1016/S0378-4371(02)00772-0
10.1109/TAC.2009.2017977
10.1002/rnc.1760
10.1109/TAC.1986.1104342
10.1109/TAC.2003.812781
10.1007/s11424-009-9145-y
10.1002/acs.2348
10.1016/j.automatica.2006.02.013
10.1109/ACC.2006.1656455
10.1109/RiiSS.2013.6607932
10.1016/j.automatica.2004.11.034
10.1016/j.automatica.2011.01.054
10.1109/TAC.2005.846556
10.1115/1.2764508
10.1080/00207179.2011.654264
10.1109/TMECH.2009.2014057
10.1109/TAC.2004.834113
10.1109/TSMCB.2008.920998
10.1109/TNN.2008.2000204
10.1016/j.sysconle.2008.01.002
10.1109/TAC.2006.884959
10.1016/S1389-0417(01)00015-8
10.1016/j.automatica.2012.05.074
ContentType Journal Article
Copyright 2016 Elsevier Ltd
Copyright_xml – notice: 2016 Elsevier Ltd
DBID AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
DOI 10.1016/j.automatica.2016.02.002
DatabaseName CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Technology Research Database
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISSN 1873-2836
EndPage 34
ExternalDocumentID 10_1016_j_automatica_2016_02_002
S0005109816300346
GroupedDBID --K
--M
-~X
.DC
.~1
0R~
1B1
1~.
1~5
23N
3R3
4.4
457
4G.
5GY
5VS
6TJ
7-5
71M
8P~
9JN
9JO
AAAKF
AAAKG
AABNK
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AARIN
AAXUO
ABDEX
ABFNM
ABFRF
ABJNI
ABMAC
ABUCO
ABXDB
ABYKQ
ACBEA
ACDAQ
ACGFO
ACGFS
ACNNM
ACRLP
ADBBV
ADEZE
ADIYS
ADMUD
ADTZH
AEBSH
AECPX
AEFWE
AEKER
AENEX
AFFNX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHJVU
AHPGS
AI.
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
APLSM
ASPBG
AVWKF
AXJTR
AZFZN
BJAXD
BKOJK
BLXMC
CS3
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-2
G-Q
GBLVA
HAMUX
HLZ
HVGLF
HZ~
H~9
IHE
J1W
JJJVA
K-O
KOM
LG9
LY7
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
R2-
RIG
ROL
RPZ
RXW
SBC
SDF
SDG
SDP
SES
SET
SEW
SPC
SPCBC
SSB
SSD
SST
SSZ
T5K
T9H
TAE
TN5
VH1
WH7
WUQ
X6Y
XFK
XPP
ZMT
~G-
AATTM
AAXKI
AAYWO
AAYXX
ABWVN
ACRPL
ACVFH
ADCNI
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AFXIZ
AGCQF
AGQPQ
AGRNS
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
BNPGV
CITATION
SSH
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c351t-9750223518a83b0fc8cfbb12dca4811337acfea457c5186c11305d5fe69421a43
IEDL.DBID .~1
ISSN 0005-1098
IngestDate Fri Jul 11 08:33:05 EDT 2025
Thu Apr 24 22:58:47 EDT 2025
Tue Jul 01 00:43:39 EDT 2025
Fri Feb 23 02:23:49 EST 2024
IsPeerReviewed true
IsScholarly true
Keywords L2-gain
Multi-agent system
External disturbances
Graphical games
Hamilton–Jacobi–Isaacs equations
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c351t-9750223518a83b0fc8cfbb12dca4811337acfea457c5186c11305d5fe69421a43
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
PQID 1825481103
PQPubID 23500
PageCount 11
ParticipantIDs proquest_miscellaneous_1825481103
crossref_primary_10_1016_j_automatica_2016_02_002
crossref_citationtrail_10_1016_j_automatica_2016_02_002
elsevier_sciencedirect_doi_10_1016_j_automatica_2016_02_002
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate July 2016
2016-07-00
20160701
PublicationDateYYYYMMDD 2016-07-01
PublicationDate_xml – month: 07
  year: 2016
  text: July 2016
PublicationDecade 2010
PublicationTitle Automatica (Oxford)
PublicationYear 2016
Publisher Elsevier Ltd
Publisher_xml – name: Elsevier Ltd
References Liu, Jia (br000105) 2011; 9
Vamvoudakis, Lewis (br000160) 2012; 22
Liu, Jia (br000100) 2010; 83
Li, Duan, Chen (br000075) 2011; 47
Ren, Moore, Chen (br000145) 2007; 129
Li, Wang, Chen (br000085) 2004; 51
Luy, N.T., Thanh, N.T., & Tri, H.M. (2013). Reinforcement learning-based robust adaptive tracking control for multi-wheeled mobile robots synchronization with optimality. In
Wheeler, Narendra (br000185) 1986; 31
Vamvoudakis, Lewis, Hudas (br000165) 2012; 48
Olfati-Saber, Murray (br000120) 2004; 49
Lakshmanan, H., & Farias, D.P. (2006). Decentralized approximate dynamic programming for dynamic networks of agents. In
Abu-Khalaf, Lewis (br000005) 2005; 41
Chang (br000035) 2009; 54
Modares, Lewis, Naghibi-Sistani (br000115) 2014; 28
(pp. 1859–1864).
Wang, Chen (br000175) 2002; 310
Tsitsiklis (br000150) 1984
Lewis, Zhang, Hengster-Movric, Das (br000070) 2014
Li, Duan, Huang (br000080) 2009; 22
Jadbabaie, Lin, Morse (br000050) 2003; 48
Ren, Beard (br000130) 2005; 50
Abu-Khalaf, Lewis, Huang (br000010) 2006; 51
Yang, Wang (br000190) 2013; 11
Hong, Hu, Gao (br000045) 2006; 42
Wen, Duan, Li, Chen (br000180) 2012; 85
Lewis, Vrabie, Syrmos (br000065) 2012
Qu (br000125) 2009
Littman (br000095) 2001; 2
(pp. 74–81).
Ren, Beard (br000135) 2008
Vrancx, Verbeeck, Nowe (br000170) 2008; 38
Basar, Bernhard (br000025) 2008
(pp. 1648–1653).
Vamvoudakis, K.G., Carrillo, L.R.G., & Hespanha, J.P. (2013). Learning consensus in adversarial environments. In
Khoo, Xie, Man (br000055) 2009; 14
Fax, Murray (br000040) 2004; 49
(pp. 87410K–87410K).
Busoniu, Babuska, De Schutter (br000030) 2008; 38
Lin, Jia, Li (br000090) 2008; 57
Aliyu (br000020) 2011
Abu-Khalaf, Lewis, Huang (br000015) 2008; 19
Ren, W., Beard, R.W., & Atkins, E.M. (2005). A survey of consensus problems in multi-agent coordination. In
Olfati-Saber (10.1016/j.automatica.2016.02.002_br000120) 2004; 49
Qu (10.1016/j.automatica.2016.02.002_br000125) 2009
Modares (10.1016/j.automatica.2016.02.002_br000115) 2014; 28
Tsitsiklis (10.1016/j.automatica.2016.02.002_br000150) 1984
Chang (10.1016/j.automatica.2016.02.002_br000035) 2009; 54
Wheeler (10.1016/j.automatica.2016.02.002_br000185) 1986; 31
Li (10.1016/j.automatica.2016.02.002_br000085) 2004; 51
Abu-Khalaf (10.1016/j.automatica.2016.02.002_br000015) 2008; 19
Abu-Khalaf (10.1016/j.automatica.2016.02.002_br000010) 2006; 51
Liu (10.1016/j.automatica.2016.02.002_br000100) 2010; 83
Abu-Khalaf (10.1016/j.automatica.2016.02.002_br000005) 2005; 41
Li (10.1016/j.automatica.2016.02.002_br000075) 2011; 47
10.1016/j.automatica.2016.02.002_br000060
10.1016/j.automatica.2016.02.002_br000140
Ren (10.1016/j.automatica.2016.02.002_br000130) 2005; 50
Wen (10.1016/j.automatica.2016.02.002_br000180) 2012; 85
Wang (10.1016/j.automatica.2016.02.002_br000175) 2002; 310
Basar (10.1016/j.automatica.2016.02.002_br000025) 2008
Vrancx (10.1016/j.automatica.2016.02.002_br000170) 2008; 38
Ren (10.1016/j.automatica.2016.02.002_br000135) 2008
Aliyu (10.1016/j.automatica.2016.02.002_br000020) 2011
Hong (10.1016/j.automatica.2016.02.002_br000045) 2006; 42
Liu (10.1016/j.automatica.2016.02.002_br000105) 2011; 9
Lin (10.1016/j.automatica.2016.02.002_br000090) 2008; 57
Ren (10.1016/j.automatica.2016.02.002_br000145) 2007; 129
Fax (10.1016/j.automatica.2016.02.002_br000040) 2004; 49
10.1016/j.automatica.2016.02.002_br000155
Khoo (10.1016/j.automatica.2016.02.002_br000055) 2009; 14
10.1016/j.automatica.2016.02.002_br000110
Busoniu (10.1016/j.automatica.2016.02.002_br000030) 2008; 38
Lewis (10.1016/j.automatica.2016.02.002_br000070) 2014
Li (10.1016/j.automatica.2016.02.002_br000080) 2009; 22
Jadbabaie (10.1016/j.automatica.2016.02.002_br000050) 2003; 48
Yang (10.1016/j.automatica.2016.02.002_br000190) 2013; 11
Vamvoudakis (10.1016/j.automatica.2016.02.002_br000160) 2012; 22
Littman (10.1016/j.automatica.2016.02.002_br000095) 2001; 2
Lewis (10.1016/j.automatica.2016.02.002_br000065) 2012
Vamvoudakis (10.1016/j.automatica.2016.02.002_br000165) 2012; 48
References_xml – reference: (pp. 74–81).
– year: 2014
  ident: br000070
  article-title: Cooperative control of multi-agent systems: optimal and adaptive design approaches
– year: 2008
  ident: br000025
  article-title: optimal control and related minimax design problems: A dynamic game approach
– reference: (pp. 1859–1864).
– volume: 54
  start-page: 1648
  year: 2009
  end-page: 1653
  ident: br000035
  article-title: Decentralized learning in finite Markov chains: Revisited
  publication-title: IEEE Transactions on Automatic Control
– volume: 49
  start-page: 1465
  year: 2004
  end-page: 1476
  ident: br000040
  article-title: Information flow and cooperative control of vehicle formations
  publication-title: IEEE Transactions on Automatic Control
– volume: 50
  start-page: 655
  year: 2005
  end-page: 661
  ident: br000130
  article-title: Consensus seeking in multi-agent systems under dynamically changing interaction topologies
  publication-title: IEEE Transactions on Automatic Control
– volume: 2
  start-page: 55
  year: 2001
  end-page: 66
  ident: br000095
  article-title: Value-function reinforcement learning in Markov games
  publication-title: Journal of Cognitive Systems Research
– volume: 31
  start-page: 519
  year: 1986
  end-page: 526
  ident: br000185
  article-title: Decentralized learning in finite Markov chains
  publication-title: IEEE Transactions on Automatic Control
– volume: 48
  start-page: 1598
  year: 2012
  end-page: 1611
  ident: br000165
  article-title: Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality
  publication-title: Automatica
– reference: Luy, N.T., Thanh, N.T., & Tri, H.M. (2013). Reinforcement learning-based robust adaptive tracking control for multi-wheeled mobile robots synchronization with optimality. In
– volume: 47
  start-page: 797
  year: 2011
  end-page: 803
  ident: br000075
  article-title: On
  publication-title: Automatica
– reference: Ren, W., Beard, R.W., & Atkins, E.M. (2005). A survey of consensus problems in multi-agent coordination. In
– year: 2011
  ident: br000020
  article-title: Nonlinear
– volume: 51
  start-page: 1989
  year: 2006
  end-page: 1995
  ident: br000010
  article-title: Policy iteration on the hamilton–jacobi-isaacs equation for
  publication-title: IEEE Transactions on Automatic Control
– volume: 19
  start-page: 1243
  year: 2008
  end-page: 1252
  ident: br000015
  article-title: Neurodynamic programming and zero-sum games for constrained control systems
  publication-title: IEEE Transactions on Neural Networks
– volume: 129
  start-page: 678
  year: 2007
  end-page: 688
  ident: br000145
  article-title: High-order and model reference consensus algorithms in cooperative control of multivehicle systems
  publication-title: Journal of Dynamic Systems, Measurement, and Control
– volume: 22
  start-page: 35
  year: 2009
  end-page: 48
  ident: br000080
  article-title: control of networked multi-agent systems
  publication-title: Journal of Systems Science and Complexity
– volume: 310
  start-page: 521
  year: 2002
  end-page: 531
  ident: br000175
  article-title: Pinning control of scale-free dynamical networks
  publication-title: Physica A. Statistical Mechanics and its Applications
– volume: 38
  start-page: 976
  year: 2008
  end-page: 981
  ident: br000170
  article-title: Decentralized Learning in Markov Games
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
– reference: (pp. 87410K–87410K).
– volume: 11
  start-page: 666
  year: 2013
  end-page: 674
  ident: br000190
  article-title: Finite-gain
  publication-title: International Journal of Control, Automation and Systems
– year: 1984
  ident: br000150
  article-title: Problems in decentralized decision making and computation
– volume: 22
  start-page: 1460
  year: 2012
  end-page: 1483
  ident: br000160
  article-title: Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
  publication-title: International Journal of Robust and Nonlinear Control
– volume: 42
  start-page: 1177
  year: 2006
  end-page: 1182
  ident: br000045
  article-title: Tracking control for multi-agent consensus with an active leader and variable topology
  publication-title: Automatica
– reference: (pp. 1648–1653).
– volume: 38
  start-page: 156
  year: 2008
  end-page: 172
  ident: br000030
  article-title: A Comprehensive Survey of Multiagent Reinforcement Learning
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
– volume: 83
  start-page: 527
  year: 2010
  end-page: 537
  ident: br000100
  article-title: consensus control of multi-agent systems with switching topology: a dynamic output feedback control
  publication-title: International Journal of Control
– volume: 14
  start-page: 219
  year: 2009
  end-page: 228
  ident: br000055
  article-title: Robust finite-time consensus tracking algorithm for multirobot systems
  publication-title: IEEE Transactions on Mechatronics
– reference: Vamvoudakis, K.G., Carrillo, L.R.G., & Hespanha, J.P. (2013). Learning consensus in adversarial environments. In
– volume: 85
  start-page: 384
  year: 2012
  end-page: 396
  ident: br000180
  article-title: Consensus and its
  publication-title: International Journal of Control
– volume: 49
  start-page: 1520
  year: 2004
  end-page: 1533
  ident: br000120
  article-title: Consensus problems in networks of agents with switching topology and time-delays
  publication-title: IEEE Transactions on Automatic Control
– year: 2008
  ident: br000135
  article-title: Distributed consensus in multi-vehicle cooperative control
– volume: 28
  start-page: 232
  year: 2014
  end-page: 254
  ident: br000115
  article-title: Online solution of nonquadratic two-player zero-sum games arising in the
  publication-title: International Journal of Adaptive Control and Signal Processing
– volume: 9
  start-page: 1086
  year: 2011
  end-page: 1094
  ident: br000105
  article-title: Robust
  publication-title: International Journal of Control, Automation and Systems
– year: 2009
  ident: br000125
  article-title: Cooperative control of dynamical systems: applications to autonomous vehicles
– volume: 48
  start-page: 988
  year: 2003
  end-page: 1001
  ident: br000050
  article-title: Coordination of groups of mobile autonomous agents using nearest neighbor rules
  publication-title: IEEE Transactions on Automatic Control
– volume: 57
  start-page: 643
  year: 2008
  end-page: 653
  ident: br000090
  article-title: Distributed robust
  publication-title: Systems & Control Letters
– volume: 41
  start-page: 779
  year: 2005
  end-page: 791
  ident: br000005
  article-title: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
  publication-title: Automatica
– reference: Lakshmanan, H., & Farias, D.P. (2006). Decentralized approximate dynamic programming for dynamic networks of agents. In
– year: 2012
  ident: br000065
  article-title: Optimal control
– volume: 51
  start-page: 2074
  year: 2004
  end-page: 2087
  ident: br000085
  article-title: Pinning a complex dynamical network to its equilibrium
  publication-title: IEEE Transactions on Circuits and Systems I: Regular Papers
– year: 2008
  ident: 10.1016/j.automatica.2016.02.002_br000135
– volume: 38
  start-page: 156
  issue: 2
  year: 2008
  ident: 10.1016/j.automatica.2016.02.002_br000030
  article-title: A Comprehensive Survey of Multiagent Reinforcement Learning
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
  doi: 10.1109/TSMCC.2007.913919
– volume: 83
  start-page: 527
  issue: 3
  year: 2010
  ident: 10.1016/j.automatica.2016.02.002_br000100
  article-title: H∞ consensus control of multi-agent systems with switching topology: a dynamic output feedback control
  publication-title: International Journal of Control
  doi: 10.1080/00207170903267039
– year: 2008
  ident: 10.1016/j.automatica.2016.02.002_br000025
– volume: 11
  start-page: 666
  issue: 4
  year: 2013
  ident: 10.1016/j.automatica.2016.02.002_br000190
  article-title: Finite-gain Lp Consensus of Multi-agent Systems
  publication-title: International Journal of Control, Automation and Systems
  doi: 10.1007/s12555-012-0495-1
– year: 1984
  ident: 10.1016/j.automatica.2016.02.002_br000150
– volume: 9
  start-page: 1086
  issue: 6
  year: 2011
  ident: 10.1016/j.automatica.2016.02.002_br000105
  article-title: Robust H∞ consensus control of uncertain multi-agent systems with time delays
  publication-title: International Journal of Control, Automation and Systems
  doi: 10.1007/s12555-011-0609-1
– volume: 51
  start-page: 2074
  issue: 10
  year: 2004
  ident: 10.1016/j.automatica.2016.02.002_br000085
  article-title: Pinning a complex dynamical network to its equilibrium
  publication-title: IEEE Transactions on Circuits and Systems I: Regular Papers
  doi: 10.1109/TCSI.2004.835655
– volume: 49
  start-page: 1465
  issue: 9
  year: 2004
  ident: 10.1016/j.automatica.2016.02.002_br000040
  article-title: Information flow and cooperative control of vehicle formations
  publication-title: IEEE Transactions on Automatic Control
  doi: 10.1109/TAC.2004.834433
– volume: 310
  start-page: 521
  issue: 3
  year: 2002
  ident: 10.1016/j.automatica.2016.02.002_br000175
  article-title: Pinning control of scale-free dynamical networks
  publication-title: Physica A. Statistical Mechanics and its Applications
  doi: 10.1016/S0378-4371(02)00772-0
– volume: 54
  start-page: 1648
  issue: 7
  year: 2009
  ident: 10.1016/j.automatica.2016.02.002_br000035
  article-title: Decentralized learning in finite Markov chains: Revisited
  publication-title: IEEE Transactions on Automatic Control
  doi: 10.1109/TAC.2009.2017977
– volume: 22
  start-page: 1460
  issue: 13
  year: 2012
  ident: 10.1016/j.automatica.2016.02.002_br000160
  article-title: Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
  publication-title: International Journal of Robust and Nonlinear Control
  doi: 10.1002/rnc.1760
– ident: 10.1016/j.automatica.2016.02.002_br000155
– volume: 31
  start-page: 519
  issue: 6
  year: 1986
  ident: 10.1016/j.automatica.2016.02.002_br000185
  article-title: Decentralized learning in finite Markov chains
  publication-title: IEEE Transactions on Automatic Control
  doi: 10.1109/TAC.1986.1104342
– volume: 48
  start-page: 988
  issue: 6
  year: 2003
  ident: 10.1016/j.automatica.2016.02.002_br000050
  article-title: Coordination of groups of mobile autonomous agents using nearest neighbor rules
  publication-title: IEEE Transactions on Automatic Control
  doi: 10.1109/TAC.2003.812781
– volume: 22
  start-page: 35
  issue: 1
  year: 2009
  ident: 10.1016/j.automatica.2016.02.002_br000080
  article-title: H∞ control of networked multi-agent systems
  publication-title: Journal of Systems Science and Complexity
  doi: 10.1007/s11424-009-9145-y
– volume: 28
  start-page: 232
  year: 2014
  ident: 10.1016/j.automatica.2016.02.002_br000115
  article-title: Online solution of nonquadratic two-player zero-sum games arising in the H∞ control of constrained-input systems
  publication-title: International Journal of Adaptive Control and Signal Processing
  doi: 10.1002/acs.2348
– volume: 42
  start-page: 1177
  issue: 7
  year: 2006
  ident: 10.1016/j.automatica.2016.02.002_br000045
  article-title: Tracking control for multi-agent consensus with an active leader and variable topology
  publication-title: Automatica
  doi: 10.1016/j.automatica.2006.02.013
– ident: 10.1016/j.automatica.2016.02.002_br000060
  doi: 10.1109/ACC.2006.1656455
– ident: 10.1016/j.automatica.2016.02.002_br000110
  doi: 10.1109/RiiSS.2013.6607932
– year: 2009
  ident: 10.1016/j.automatica.2016.02.002_br000125
– volume: 41
  start-page: 779
  issue: 5
  year: 2005
  ident: 10.1016/j.automatica.2016.02.002_br000005
  article-title: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
  publication-title: Automatica
  doi: 10.1016/j.automatica.2004.11.034
– year: 2012
  ident: 10.1016/j.automatica.2016.02.002_br000065
– volume: 47
  start-page: 797
  issue: 4
  year: 2011
  ident: 10.1016/j.automatica.2016.02.002_br000075
  article-title: On H∞ and H2 performance regions of multi-agent systems
  publication-title: Automatica
  doi: 10.1016/j.automatica.2011.01.054
– volume: 50
  start-page: 655
  issue: 5
  year: 2005
  ident: 10.1016/j.automatica.2016.02.002_br000130
  article-title: Consensus seeking in multi-agent systems under dynamically changing interaction topologies
  publication-title: IEEE Transactions on Automatic Control
  doi: 10.1109/TAC.2005.846556
– volume: 129
  start-page: 678
  issue: 5
  year: 2007
  ident: 10.1016/j.automatica.2016.02.002_br000145
  article-title: High-order and model reference consensus algorithms in cooperative control of multivehicle systems
  publication-title: Journal of Dynamic Systems, Measurement, and Control
  doi: 10.1115/1.2764508
– volume: 85
  start-page: 384
  issue: 4
  year: 2012
  ident: 10.1016/j.automatica.2016.02.002_br000180
  article-title: Consensus and its L2-gain performance of multi-agent systems with intermittent information transmissions
  publication-title: International Journal of Control
  doi: 10.1080/00207179.2011.654264
– volume: 14
  start-page: 219
  issue: 2
  year: 2009
  ident: 10.1016/j.automatica.2016.02.002_br000055
  article-title: Robust finite-time consensus tracking algorithm for multirobot systems
  publication-title: IEEE Transactions on Mechatronics
  doi: 10.1109/TMECH.2009.2014057
– volume: 49
  start-page: 1520
  issue: 9
  year: 2004
  ident: 10.1016/j.automatica.2016.02.002_br000120
  article-title: Consensus problems in networks of agents with switching topology and time-delays
  publication-title: IEEE Transactions on Automatic Control
  doi: 10.1109/TAC.2004.834113
– volume: 38
  start-page: 976
  issue: 4
  year: 2008
  ident: 10.1016/j.automatica.2016.02.002_br000170
  article-title: Decentralized Learning in Markov Games
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
  doi: 10.1109/TSMCB.2008.920998
– volume: 19
  start-page: 1243
  issue: 7
  year: 2008
  ident: 10.1016/j.automatica.2016.02.002_br000015
  article-title: Neurodynamic programming and zero-sum games for constrained control systems
  publication-title: IEEE Transactions on Neural Networks
  doi: 10.1109/TNN.2008.2000204
– volume: 57
  start-page: 643
  issue: 8
  year: 2008
  ident: 10.1016/j.automatica.2016.02.002_br000090
  article-title: Distributed robust H∞ consensus control in directed networks of agents with time-delay
  publication-title: Systems & Control Letters
  doi: 10.1016/j.sysconle.2008.01.002
– volume: 51
  start-page: 1989
  issue: 12
  year: 2006
  ident: 10.1016/j.automatica.2016.02.002_br000010
  article-title: Policy iteration on the hamilton–jacobi-isaacs equation for H∞ state feedback control with input saturation
  publication-title: IEEE Transactions on Automatic Control
  doi: 10.1109/TAC.2006.884959
– year: 2011
  ident: 10.1016/j.automatica.2016.02.002_br000020
– volume: 2
  start-page: 55
  issue: 1
  year: 2001
  ident: 10.1016/j.automatica.2016.02.002_br000095
  article-title: Value-function reinforcement learning in Markov games
  publication-title: Journal of Cognitive Systems Research
  doi: 10.1016/S1389-0417(01)00015-8
– ident: 10.1016/j.automatica.2016.02.002_br000140
– volume: 48
  start-page: 1598
  issue: 8
  year: 2012
  ident: 10.1016/j.automatica.2016.02.002_br000165
  article-title: Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality
  publication-title: Automatica
  doi: 10.1016/j.automatica.2012.05.074
– year: 2014
  ident: 10.1016/j.automatica.2016.02.002_br000070
SSID ssj0004182
Score 2.578402
Snippet This paper addresses distributed optimal tracking control of multi-agent linear systems subject to external disturbances. The concept of differential game...
SourceID proquest
crossref
elsevier
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 24
SubjectTerms [formula omitted]-gain
Algorithms
Differential equations
Disturbances
External disturbances
Games
Graphical games
Hamilton–Jacobi–Isaacs equations
Joining
Mathematical analysis
Multi-agent system
Multiagent systems
Online
Title Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control
URI https://dx.doi.org/10.1016/j.automatica.2016.02.002
https://www.proquest.com/docview/1825481103
Volume 69
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8NAEF5KvehBfGJ9sYLX2GyymweeSrFUxZ4s9LZsNhtpKUlp04sHf7szm8SqIBQ8ZrNLwszszGzyzTeE3AY8DTNfpA6k_qHDFedOxDQcXE3sC-16aWjw08DLKBiO-dNETFqk39TCIKyy9v2VT7feuh7p1tLsLqZTrPFFg4ojhqxRPkfabc5DtPK7jw3Mg7OoYgy3jJtxVKN5KoyXWpeFZUZFBiIWVOyd3l8h6pezthFocED269SR9qq3OyQtkx-RvW-EgscksfW0jsJ6KfpuloUDhkabJiiwmefUMlSjZugbAmQpJK0wYQWRJ0EDoEszs-isnE5ze8N2xDIprUHtJ2Q8eHjtD526i4KjfcFKJ4acAHIAwSIV-Ymb6UhnScK8FPnMGRxRQ6Uzo7gINcwJNAy5IhWZCWLuMcX9U9LOi9ycERokIWxgFQgdKO5lWomUmdikLDJa6DjukLARnNQ1xTh2upjLBks2kxuRSxS5dD0JIu8Q9rVyUdFsbLHmvtGN_GEyEqLBFqtvGnVK2FH4m0TlplivJMNDM0jG9c__9YQLsotXFbb3krTL5dpcQQZTJtfWRK_JTu_xeTj6BKjA8y0
linkProvider Elsevier
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8QwEB50PagH8YlvI3gtNm2StngSUdbXnhS8hTRNZUW6su5e_PXOpKkvEASvedAyM5mZtN98A3CkRJXVqawiTP2zSBghopxbvLi6IpU2TqrM0aeB24Hq34urB_kwA2ddLQzBKoPvb32699Zh5DhI8_hlOKQaXzKoIufEGpUKNQtzxE4lezB3enndH3yWR_K8JQ33pJtFHgA9LczLTCcjT45KJERctQSeyW9R6oe_9kHoYhmWQvbITtsXXIEZ16zC4hdOwTUofUltZKhkir258ShCW2NdHxQ8z8_Mk1STctgjYWQZ5q244BWDT0k2wMbuyQO0GjZs_IRviuUqFnDt63B_cX531o9CI4XIppJPogLTAkwDJM9NnpZxbXNblyVPKqI053hLzYytnREys7hGWRyKZSVrpwqRcCPSDeg1o8ZtAlNlhmfYKGmVEUltjay4K1zFc2elLYotyDrBaRtYxqnZxbPu4GRP-lPkmkSu40SjyLeAf-x8aZk2_rDnpNON_mY1GgPCH3YfdurUeKjoT4lp3Gj6qjndm1Eycbr9ryccwHz_7vZG31wOrndggWZaqO8u9CbjqdvDhGZS7geDfQf19_Xe
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Multi-agent+zero-sum+differential+graphical+games+for+disturbance+rejection+in+distributed+control&rft.jtitle=Automatica+%28Oxford%29&rft.au=Jiao%2C+Qiang&rft.au=Modares%2C+Hamidreza&rft.au=Xu%2C+Shengyuan&rft.au=Lewis%2C+Frank+L.&rft.date=2016-07-01&rft.pub=Elsevier+Ltd&rft.issn=0005-1098&rft.eissn=1873-2836&rft.volume=69&rft.spage=24&rft.epage=34&rft_id=info:doi/10.1016%2Fj.automatica.2016.02.002&rft.externalDocID=S0005109816300346
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0005-1098&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0005-1098&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0005-1098&client=summon