Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control
This paper addresses distributed optimal tracking control of multi-agent linear systems subject to external disturbances. The concept of differential game theory is utilized to formulate this distributed control problem into a multi-player zero-sum differential graphical game, which provides a new p...
Saved in:
Published in | Automatica (Oxford) Vol. 69; pp. 24 - 34 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
Elsevier Ltd
01.07.2016
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | This paper addresses distributed optimal tracking control of multi-agent linear systems subject to external disturbances. The concept of differential game theory is utilized to formulate this distributed control problem into a multi-player zero-sum differential graphical game, which provides a new perspective on distributed tracking of multiple agents influenced by disturbances. In the presented differential graphical game, the dynamics and performance indices for each node depend on local neighbor information and disturbances. It is shown that the solution to the multi-agent differential graphical games in the presence of disturbances requires the solution to coupled Hamilton–Jacobi–Isaacs (HJI) equations. Multi-agent learning policy iteration (PI) algorithm is provided to find the solution to these coupled HJI equations and its convergence is proven. It is also shown that L2-bounded synchronization errors can be guaranteed using this technique. An online PI algorithm is given to solve the zero-sum game in real time. A simulation example is provided to show the effectiveness of the online approach. |
---|---|
AbstractList | This paper addresses distributed optimal tracking control of multi-agent linear systems subject to external disturbances. The concept of differential game theory is utilized to formulate this distributed control problem into a multi-player zero-sum differential graphical game, which provides a new perspective on distributed tracking of multiple agents influenced by disturbances. In the presented differential graphical game, the dynamics and performance indices for each node depend on local neighbor information and disturbances. It is shown that the solution to the multi-agent differential graphical games in the presence of disturbances requires the solution to coupled Hamilton–Jacobi–Isaacs (HJI) equations. Multi-agent learning policy iteration (PI) algorithm is provided to find the solution to these coupled HJI equations and its convergence is proven. It is also shown that L2-bounded synchronization errors can be guaranteed using this technique. An online PI algorithm is given to solve the zero-sum game in real time. A simulation example is provided to show the effectiveness of the online approach. This paper addresses distributed optimal tracking control of multi-agent linear systems subject to external disturbances. The concept of differential game theory is utilized to formulate this distributed control problem into a multi-player zero-sum differential graphical game, which provides a new perspective on distributed tracking of multiple agents influenced by disturbances. In the presented differential graphical game, the dynamics and performance indices for each node depend on local neighbor information and disturbances. It is shown that the solution to the multi-agent differential graphical games in the presence of disturbances requires the solution to coupled Hamilton-Jacobi-Isaacs (HJI) equations. Multi-agent learning policy iteration (PI) algorithm is provided to find the solution to these coupled HJI equations and its convergence is proven. It is also shown that L sub(2)L2-bounded synchronization errors can be guaranteed using this technique. An online PI algorithm is given to solve the zero-sum game in real time. A simulation example is provided to show the effectiveness of the online approach. |
Author | Xu, Shengyuan Vamvoudakis, Kyriakos G. Lewis, Frank L. Modares, Hamidreza Jiao, Qiang |
Author_xml | – sequence: 1 givenname: Qiang surname: Jiao fullname: Jiao, Qiang email: qjiao0312@gmail.com organization: School of Automation, Nanjing University of Science and Technology, Nanjing, 210094, Jiangsu, PR China – sequence: 2 givenname: Hamidreza surname: Modares fullname: Modares, Hamidreza email: modares@uta.edu organization: University of Texas at Arlington Research Institute, 7300 Jack Newell Blvd. S., Ft. Worth, TX 76118, USA – sequence: 3 givenname: Shengyuan surname: Xu fullname: Xu, Shengyuan email: syxu@njust.edu.cn organization: School of Automation, Nanjing University of Science and Technology, Nanjing, 210094, Jiangsu, PR China – sequence: 4 givenname: Frank L. surname: Lewis fullname: Lewis, Frank L. email: lewis@uta.edu organization: University of Texas at Arlington Research Institute, 7300 Jack Newell Blvd. S., Ft. Worth, TX 76118, USA – sequence: 5 givenname: Kyriakos G. surname: Vamvoudakis fullname: Vamvoudakis, Kyriakos G. email: kyriakos@ece.ucsb.edu organization: Center for Control, Dynamical-systems and Computation (CCDC), University of California, Santa Barbara, CA 93106-9560, USA |
BookMark | eNqNkE1LxDAQhoOs4O7qf-jRS2uSfmUvgi5-geJFz2GaTtYsbbMmqaC_3tQVBC96ymTmnQfmWZDZYAckJGE0Y5RVZ9sMxmB7CEZBxmMnozyjlB-QORN1nnKRVzMyp5SWKaMrcUQW3m_jt2CCz0nzMHbBpLDBISQf6Gzqxz5pjdboYstAl2wc7F4iPVbQo0-0dTHgw-gaGBQmDreogrFDYoavgTPNGLBNlB2Cs90xOdTQeTz5fpfk-frqaX2b3j_e3K0v7lOVlyykq7qknMdSgMgbqpVQumkYbxUUgrE8r0FphKKsVcxUKrZo2ZYaq1XBGRT5kpzuuTtnX0f0QfbGK-w6GNCOXsZ7y4lE8xgV-6hy1nuHWu6c6cG9S0blpFVu5Y9WOWmVlMuoNa6e_1pVJsB0fnBguv8ALvcAjC7eDDrplcEosjUuepStNX9DPgGgCJ7X |
CitedBy_id | crossref_primary_10_1016_j_cja_2021_08_005 crossref_primary_10_1002_oca_3174 crossref_primary_10_1016_j_neucom_2021_03_021 crossref_primary_10_1631_FITEE_2200010 crossref_primary_10_1109_ACCESS_2024_3400590 crossref_primary_10_1109_TCYB_2021_3090067 crossref_primary_10_1016_j_amc_2024_128979 crossref_primary_10_1109_JIOT_2023_3303448 crossref_primary_10_1109_TNNLS_2020_3044039 crossref_primary_10_1080_00207179_2020_1790663 crossref_primary_10_1109_TAES_2020_3010593 crossref_primary_10_1002_rnc_5263 crossref_primary_10_1109_TFUZZ_2018_2859904 crossref_primary_10_1109_TNNLS_2023_3291542 crossref_primary_10_1007_s11071_018_4228_8 crossref_primary_10_1109_TCYB_2018_2819695 crossref_primary_10_1109_TCYB_2018_2856089 crossref_primary_10_1109_TCNS_2024_3395852 crossref_primary_10_1016_j_jfranklin_2023_02_033 crossref_primary_10_1109_TCSI_2024_3366942 crossref_primary_10_1002_rnc_7710 crossref_primary_10_1109_TFUZZ_2021_3075501 crossref_primary_10_1016_j_neucom_2017_09_020 crossref_primary_10_1109_TAC_2018_2879568 crossref_primary_10_1109_TCYB_2025_3534463 crossref_primary_10_1007_s11063_022_11085_0 crossref_primary_10_1007_s11071_023_08496_6 crossref_primary_10_1002_acs_3945 crossref_primary_10_1109_TCYB_2018_2856510 crossref_primary_10_1016_j_automatica_2019_108656 crossref_primary_10_3390_app12157551 crossref_primary_10_1016_j_ast_2025_110080 crossref_primary_10_29252_joc_12_2_13 crossref_primary_10_1080_00207179_2018_1467044 crossref_primary_10_1109_TAC_2020_3027795 crossref_primary_10_1109_TSMC_2022_3177043 crossref_primary_10_1360_SST_2022_0208 crossref_primary_10_1002_rnc_7189 crossref_primary_10_1109_JSYST_2022_3223715 crossref_primary_10_1109_LCSYS_2020_3001240 crossref_primary_10_1016_j_isatra_2019_01_021 crossref_primary_10_1002_acs_2866 crossref_primary_10_1049_iet_cta_2017_0875 crossref_primary_10_1016_j_jfranklin_2025_107639 crossref_primary_10_1016_j_neunet_2022_08_010 crossref_primary_10_1109_TNSE_2022_3185019 crossref_primary_10_1016_j_ifacol_2020_12_2180 crossref_primary_10_1016_j_ins_2023_118949 crossref_primary_10_1016_j_neunet_2018_06_007 crossref_primary_10_1109_TSMC_2018_2810117 crossref_primary_10_1016_j_neucom_2021_03_017 crossref_primary_10_1109_TSMC_2019_2944259 crossref_primary_10_1109_TSMC_2018_2861470 crossref_primary_10_1109_TCYB_2024_3419056 crossref_primary_10_1049_iet_cta_2020_0259 crossref_primary_10_1002_rnc_7698 crossref_primary_10_1002_rnc_7574 crossref_primary_10_1109_TITS_2024_3362959 crossref_primary_10_1109_TNSE_2023_3309816 crossref_primary_10_1016_j_automatica_2023_111468 crossref_primary_10_1109_TCYB_2018_2868715 crossref_primary_10_1016_j_nahs_2019_03_003 crossref_primary_10_1109_ACCESS_2023_3239665 crossref_primary_10_1109_TCYB_2017_2788819 crossref_primary_10_1109_TCYB_2024_3372606 crossref_primary_10_1016_j_ast_2021_106568 crossref_primary_10_1109_TAC_2021_3122382 crossref_primary_10_1109_TCYB_2022_3215716 crossref_primary_10_1109_TCYB_2019_2950262 crossref_primary_10_1109_TII_2021_3137816 crossref_primary_10_1080_00207179_2018_1441550 crossref_primary_10_1109_TNNLS_2019_2958948 crossref_primary_10_1002_rnc_4650 crossref_primary_10_1016_j_jfranklin_2022_01_012 crossref_primary_10_1109_TAES_2024_3407735 crossref_primary_10_1016_j_isatra_2018_10_005 crossref_primary_10_1109_TSMC_2017_2693209 crossref_primary_10_1049_iet_cta_2017_0259 crossref_primary_10_1109_TCYB_2016_2611613 crossref_primary_10_1109_TNNLS_2023_3287881 crossref_primary_10_1109_TASE_2023_3327264 crossref_primary_10_3390_math10152728 crossref_primary_10_1109_TCNS_2022_3181550 crossref_primary_10_12677_AAM_2022_111022 crossref_primary_10_1016_j_jfranklin_2023_06_015 crossref_primary_10_1016_j_neunet_2022_09_024 crossref_primary_10_1109_JSYST_2024_3391766 crossref_primary_10_1109_TCYB_2022_3195361 crossref_primary_10_1109_TNNLS_2020_2969249 crossref_primary_10_1109_ACCESS_2020_2970760 crossref_primary_10_1016_j_automatica_2021_110076 crossref_primary_10_1109_TCSI_2024_3377641 crossref_primary_10_1016_j_ijepes_2023_109704 crossref_primary_10_1109_TNNLS_2020_3042508 crossref_primary_10_1080_00207179_2017_1300685 crossref_primary_10_1016_j_ast_2022_107759 crossref_primary_10_1016_j_jfranklin_2021_08_022 crossref_primary_10_1109_TSMC_2018_2889377 crossref_primary_10_1016_j_jfranklin_2024_107263 crossref_primary_10_1109_TCYB_2025_3530456 crossref_primary_10_1109_TSMC_2019_2914160 crossref_primary_10_1109_JSYST_2023_3318525 crossref_primary_10_1016_j_ins_2024_121234 crossref_primary_10_1049_iet_cta_2018_5832 crossref_primary_10_1002_rnc_7786 crossref_primary_10_1109_MSMC_2023_3282774 crossref_primary_10_1007_s11071_024_10493_2 crossref_primary_10_1016_j_jfranklin_2018_01_016 crossref_primary_10_1109_TNNLS_2020_3023711 crossref_primary_10_1016_j_isatra_2022_09_012 crossref_primary_10_1587_transfun_E99_A_1721 crossref_primary_10_1002_rnc_4538 crossref_primary_10_1016_j_neucom_2020_06_106 crossref_primary_10_1109_TAC_2022_3226142 crossref_primary_10_1177_09544100241277236 crossref_primary_10_1016_j_isatra_2020_10_043 crossref_primary_10_1109_TIE_2017_2782245 crossref_primary_10_1109_LCSYS_2022_3175665 crossref_primary_10_1080_00207179_2024_2447570 crossref_primary_10_1109_TCNS_2019_2959163 crossref_primary_10_1109_TCNS_2022_3203896 crossref_primary_10_1007_s40435_022_00983_9 crossref_primary_10_1016_j_ast_2020_105894 crossref_primary_10_1016_j_ast_2025_110052 crossref_primary_10_1016_j_neucom_2017_01_047 crossref_primary_10_1016_j_neucom_2021_01_063 crossref_primary_10_1002_rnc_7378 crossref_primary_10_1109_TICPS_2024_3373715 crossref_primary_10_1016_j_neunet_2024_106566 |
Cites_doi | 10.1109/TSMCC.2007.913919 10.1080/00207170903267039 10.1007/s12555-012-0495-1 10.1007/s12555-011-0609-1 10.1109/TCSI.2004.835655 10.1109/TAC.2004.834433 10.1016/S0378-4371(02)00772-0 10.1109/TAC.2009.2017977 10.1002/rnc.1760 10.1109/TAC.1986.1104342 10.1109/TAC.2003.812781 10.1007/s11424-009-9145-y 10.1002/acs.2348 10.1016/j.automatica.2006.02.013 10.1109/ACC.2006.1656455 10.1109/RiiSS.2013.6607932 10.1016/j.automatica.2004.11.034 10.1016/j.automatica.2011.01.054 10.1109/TAC.2005.846556 10.1115/1.2764508 10.1080/00207179.2011.654264 10.1109/TMECH.2009.2014057 10.1109/TAC.2004.834113 10.1109/TSMCB.2008.920998 10.1109/TNN.2008.2000204 10.1016/j.sysconle.2008.01.002 10.1109/TAC.2006.884959 10.1016/S1389-0417(01)00015-8 10.1016/j.automatica.2012.05.074 |
ContentType | Journal Article |
Copyright | 2016 Elsevier Ltd |
Copyright_xml | – notice: 2016 Elsevier Ltd |
DBID | AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D |
DOI | 10.1016/j.automatica.2016.02.002 |
DatabaseName | CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
DatabaseTitleList | Technology Research Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EISSN | 1873-2836 |
EndPage | 34 |
ExternalDocumentID | 10_1016_j_automatica_2016_02_002 S0005109816300346 |
GroupedDBID | --K --M -~X .DC .~1 0R~ 1B1 1~. 1~5 23N 3R3 4.4 457 4G. 5GY 5VS 6TJ 7-5 71M 8P~ 9JN 9JO AAAKF AAAKG AABNK AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AARIN AAXUO ABDEX ABFNM ABFRF ABJNI ABMAC ABUCO ABXDB ABYKQ ACBEA ACDAQ ACGFO ACGFS ACNNM ACRLP ADBBV ADEZE ADIYS ADMUD ADTZH AEBSH AECPX AEFWE AEKER AENEX AFFNX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHPGS AI. AIEXJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ APLSM ASPBG AVWKF AXJTR AZFZN BJAXD BKOJK BLXMC CS3 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-2 G-Q GBLVA HAMUX HLZ HVGLF HZ~ H~9 IHE J1W JJJVA K-O KOM LG9 LY7 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 R2- RIG ROL RPZ RXW SBC SDF SDG SDP SES SET SEW SPC SPCBC SSB SSD SST SSZ T5K T9H TAE TN5 VH1 WH7 WUQ X6Y XFK XPP ZMT ~G- AATTM AAXKI AAYWO AAYXX ABWVN ACRPL ACVFH ADCNI ADNMO AEIPS AEUPX AFJKZ AFPUW AFXIZ AGCQF AGQPQ AGRNS AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP BNPGV CITATION SSH 7SC 7SP 8FD JQ2 L7M L~C L~D |
ID | FETCH-LOGICAL-c351t-9750223518a83b0fc8cfbb12dca4811337acfea457c5186c11305d5fe69421a43 |
IEDL.DBID | .~1 |
ISSN | 0005-1098 |
IngestDate | Fri Jul 11 08:33:05 EDT 2025 Thu Apr 24 22:58:47 EDT 2025 Tue Jul 01 00:43:39 EDT 2025 Fri Feb 23 02:23:49 EST 2024 |
IsPeerReviewed | true |
IsScholarly | true |
Keywords | L2-gain Multi-agent system External disturbances Graphical games Hamilton–Jacobi–Isaacs equations |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c351t-9750223518a83b0fc8cfbb12dca4811337acfea457c5186c11305d5fe69421a43 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
PQID | 1825481103 |
PQPubID | 23500 |
PageCount | 11 |
ParticipantIDs | proquest_miscellaneous_1825481103 crossref_primary_10_1016_j_automatica_2016_02_002 crossref_citationtrail_10_1016_j_automatica_2016_02_002 elsevier_sciencedirect_doi_10_1016_j_automatica_2016_02_002 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | July 2016 2016-07-00 20160701 |
PublicationDateYYYYMMDD | 2016-07-01 |
PublicationDate_xml | – month: 07 year: 2016 text: July 2016 |
PublicationDecade | 2010 |
PublicationTitle | Automatica (Oxford) |
PublicationYear | 2016 |
Publisher | Elsevier Ltd |
Publisher_xml | – name: Elsevier Ltd |
References | Liu, Jia (br000105) 2011; 9 Vamvoudakis, Lewis (br000160) 2012; 22 Liu, Jia (br000100) 2010; 83 Li, Duan, Chen (br000075) 2011; 47 Ren, Moore, Chen (br000145) 2007; 129 Li, Wang, Chen (br000085) 2004; 51 Luy, N.T., Thanh, N.T., & Tri, H.M. (2013). Reinforcement learning-based robust adaptive tracking control for multi-wheeled mobile robots synchronization with optimality. In Wheeler, Narendra (br000185) 1986; 31 Vamvoudakis, Lewis, Hudas (br000165) 2012; 48 Olfati-Saber, Murray (br000120) 2004; 49 Lakshmanan, H., & Farias, D.P. (2006). Decentralized approximate dynamic programming for dynamic networks of agents. In Abu-Khalaf, Lewis (br000005) 2005; 41 Chang (br000035) 2009; 54 Modares, Lewis, Naghibi-Sistani (br000115) 2014; 28 (pp. 1859–1864). Wang, Chen (br000175) 2002; 310 Tsitsiklis (br000150) 1984 Lewis, Zhang, Hengster-Movric, Das (br000070) 2014 Li, Duan, Huang (br000080) 2009; 22 Jadbabaie, Lin, Morse (br000050) 2003; 48 Ren, Beard (br000130) 2005; 50 Abu-Khalaf, Lewis, Huang (br000010) 2006; 51 Yang, Wang (br000190) 2013; 11 Hong, Hu, Gao (br000045) 2006; 42 Wen, Duan, Li, Chen (br000180) 2012; 85 Lewis, Vrabie, Syrmos (br000065) 2012 Qu (br000125) 2009 Littman (br000095) 2001; 2 (pp. 74–81). Ren, Beard (br000135) 2008 Vrancx, Verbeeck, Nowe (br000170) 2008; 38 Basar, Bernhard (br000025) 2008 (pp. 1648–1653). Vamvoudakis, K.G., Carrillo, L.R.G., & Hespanha, J.P. (2013). Learning consensus in adversarial environments. In Khoo, Xie, Man (br000055) 2009; 14 Fax, Murray (br000040) 2004; 49 (pp. 87410K–87410K). Busoniu, Babuska, De Schutter (br000030) 2008; 38 Lin, Jia, Li (br000090) 2008; 57 Aliyu (br000020) 2011 Abu-Khalaf, Lewis, Huang (br000015) 2008; 19 Ren, W., Beard, R.W., & Atkins, E.M. (2005). A survey of consensus problems in multi-agent coordination. In Olfati-Saber (10.1016/j.automatica.2016.02.002_br000120) 2004; 49 Qu (10.1016/j.automatica.2016.02.002_br000125) 2009 Modares (10.1016/j.automatica.2016.02.002_br000115) 2014; 28 Tsitsiklis (10.1016/j.automatica.2016.02.002_br000150) 1984 Chang (10.1016/j.automatica.2016.02.002_br000035) 2009; 54 Wheeler (10.1016/j.automatica.2016.02.002_br000185) 1986; 31 Li (10.1016/j.automatica.2016.02.002_br000085) 2004; 51 Abu-Khalaf (10.1016/j.automatica.2016.02.002_br000015) 2008; 19 Abu-Khalaf (10.1016/j.automatica.2016.02.002_br000010) 2006; 51 Liu (10.1016/j.automatica.2016.02.002_br000100) 2010; 83 Abu-Khalaf (10.1016/j.automatica.2016.02.002_br000005) 2005; 41 Li (10.1016/j.automatica.2016.02.002_br000075) 2011; 47 10.1016/j.automatica.2016.02.002_br000060 10.1016/j.automatica.2016.02.002_br000140 Ren (10.1016/j.automatica.2016.02.002_br000130) 2005; 50 Wen (10.1016/j.automatica.2016.02.002_br000180) 2012; 85 Wang (10.1016/j.automatica.2016.02.002_br000175) 2002; 310 Basar (10.1016/j.automatica.2016.02.002_br000025) 2008 Vrancx (10.1016/j.automatica.2016.02.002_br000170) 2008; 38 Ren (10.1016/j.automatica.2016.02.002_br000135) 2008 Aliyu (10.1016/j.automatica.2016.02.002_br000020) 2011 Hong (10.1016/j.automatica.2016.02.002_br000045) 2006; 42 Liu (10.1016/j.automatica.2016.02.002_br000105) 2011; 9 Lin (10.1016/j.automatica.2016.02.002_br000090) 2008; 57 Ren (10.1016/j.automatica.2016.02.002_br000145) 2007; 129 Fax (10.1016/j.automatica.2016.02.002_br000040) 2004; 49 10.1016/j.automatica.2016.02.002_br000155 Khoo (10.1016/j.automatica.2016.02.002_br000055) 2009; 14 10.1016/j.automatica.2016.02.002_br000110 Busoniu (10.1016/j.automatica.2016.02.002_br000030) 2008; 38 Lewis (10.1016/j.automatica.2016.02.002_br000070) 2014 Li (10.1016/j.automatica.2016.02.002_br000080) 2009; 22 Jadbabaie (10.1016/j.automatica.2016.02.002_br000050) 2003; 48 Yang (10.1016/j.automatica.2016.02.002_br000190) 2013; 11 Vamvoudakis (10.1016/j.automatica.2016.02.002_br000160) 2012; 22 Littman (10.1016/j.automatica.2016.02.002_br000095) 2001; 2 Lewis (10.1016/j.automatica.2016.02.002_br000065) 2012 Vamvoudakis (10.1016/j.automatica.2016.02.002_br000165) 2012; 48 |
References_xml | – reference: (pp. 74–81). – year: 2014 ident: br000070 article-title: Cooperative control of multi-agent systems: optimal and adaptive design approaches – year: 2008 ident: br000025 article-title: optimal control and related minimax design problems: A dynamic game approach – reference: (pp. 1859–1864). – volume: 54 start-page: 1648 year: 2009 end-page: 1653 ident: br000035 article-title: Decentralized learning in finite Markov chains: Revisited publication-title: IEEE Transactions on Automatic Control – volume: 49 start-page: 1465 year: 2004 end-page: 1476 ident: br000040 article-title: Information flow and cooperative control of vehicle formations publication-title: IEEE Transactions on Automatic Control – volume: 50 start-page: 655 year: 2005 end-page: 661 ident: br000130 article-title: Consensus seeking in multi-agent systems under dynamically changing interaction topologies publication-title: IEEE Transactions on Automatic Control – volume: 2 start-page: 55 year: 2001 end-page: 66 ident: br000095 article-title: Value-function reinforcement learning in Markov games publication-title: Journal of Cognitive Systems Research – volume: 31 start-page: 519 year: 1986 end-page: 526 ident: br000185 article-title: Decentralized learning in finite Markov chains publication-title: IEEE Transactions on Automatic Control – volume: 48 start-page: 1598 year: 2012 end-page: 1611 ident: br000165 article-title: Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality publication-title: Automatica – reference: Luy, N.T., Thanh, N.T., & Tri, H.M. (2013). Reinforcement learning-based robust adaptive tracking control for multi-wheeled mobile robots synchronization with optimality. In – volume: 47 start-page: 797 year: 2011 end-page: 803 ident: br000075 article-title: On publication-title: Automatica – reference: Ren, W., Beard, R.W., & Atkins, E.M. (2005). A survey of consensus problems in multi-agent coordination. In – year: 2011 ident: br000020 article-title: Nonlinear – volume: 51 start-page: 1989 year: 2006 end-page: 1995 ident: br000010 article-title: Policy iteration on the hamilton–jacobi-isaacs equation for publication-title: IEEE Transactions on Automatic Control – volume: 19 start-page: 1243 year: 2008 end-page: 1252 ident: br000015 article-title: Neurodynamic programming and zero-sum games for constrained control systems publication-title: IEEE Transactions on Neural Networks – volume: 129 start-page: 678 year: 2007 end-page: 688 ident: br000145 article-title: High-order and model reference consensus algorithms in cooperative control of multivehicle systems publication-title: Journal of Dynamic Systems, Measurement, and Control – volume: 22 start-page: 35 year: 2009 end-page: 48 ident: br000080 article-title: control of networked multi-agent systems publication-title: Journal of Systems Science and Complexity – volume: 310 start-page: 521 year: 2002 end-page: 531 ident: br000175 article-title: Pinning control of scale-free dynamical networks publication-title: Physica A. Statistical Mechanics and its Applications – volume: 38 start-page: 976 year: 2008 end-page: 981 ident: br000170 article-title: Decentralized Learning in Markov Games publication-title: IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics – reference: (pp. 87410K–87410K). – volume: 11 start-page: 666 year: 2013 end-page: 674 ident: br000190 article-title: Finite-gain publication-title: International Journal of Control, Automation and Systems – year: 1984 ident: br000150 article-title: Problems in decentralized decision making and computation – volume: 22 start-page: 1460 year: 2012 end-page: 1483 ident: br000160 article-title: Online solution of nonlinear two-player zero-sum games using synchronous policy iteration publication-title: International Journal of Robust and Nonlinear Control – volume: 42 start-page: 1177 year: 2006 end-page: 1182 ident: br000045 article-title: Tracking control for multi-agent consensus with an active leader and variable topology publication-title: Automatica – reference: (pp. 1648–1653). – volume: 38 start-page: 156 year: 2008 end-page: 172 ident: br000030 article-title: A Comprehensive Survey of Multiagent Reinforcement Learning publication-title: IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews – volume: 83 start-page: 527 year: 2010 end-page: 537 ident: br000100 article-title: consensus control of multi-agent systems with switching topology: a dynamic output feedback control publication-title: International Journal of Control – volume: 14 start-page: 219 year: 2009 end-page: 228 ident: br000055 article-title: Robust finite-time consensus tracking algorithm for multirobot systems publication-title: IEEE Transactions on Mechatronics – reference: Vamvoudakis, K.G., Carrillo, L.R.G., & Hespanha, J.P. (2013). Learning consensus in adversarial environments. In – volume: 85 start-page: 384 year: 2012 end-page: 396 ident: br000180 article-title: Consensus and its publication-title: International Journal of Control – volume: 49 start-page: 1520 year: 2004 end-page: 1533 ident: br000120 article-title: Consensus problems in networks of agents with switching topology and time-delays publication-title: IEEE Transactions on Automatic Control – year: 2008 ident: br000135 article-title: Distributed consensus in multi-vehicle cooperative control – volume: 28 start-page: 232 year: 2014 end-page: 254 ident: br000115 article-title: Online solution of nonquadratic two-player zero-sum games arising in the publication-title: International Journal of Adaptive Control and Signal Processing – volume: 9 start-page: 1086 year: 2011 end-page: 1094 ident: br000105 article-title: Robust publication-title: International Journal of Control, Automation and Systems – year: 2009 ident: br000125 article-title: Cooperative control of dynamical systems: applications to autonomous vehicles – volume: 48 start-page: 988 year: 2003 end-page: 1001 ident: br000050 article-title: Coordination of groups of mobile autonomous agents using nearest neighbor rules publication-title: IEEE Transactions on Automatic Control – volume: 57 start-page: 643 year: 2008 end-page: 653 ident: br000090 article-title: Distributed robust publication-title: Systems & Control Letters – volume: 41 start-page: 779 year: 2005 end-page: 791 ident: br000005 article-title: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach publication-title: Automatica – reference: Lakshmanan, H., & Farias, D.P. (2006). Decentralized approximate dynamic programming for dynamic networks of agents. In – year: 2012 ident: br000065 article-title: Optimal control – volume: 51 start-page: 2074 year: 2004 end-page: 2087 ident: br000085 article-title: Pinning a complex dynamical network to its equilibrium publication-title: IEEE Transactions on Circuits and Systems I: Regular Papers – year: 2008 ident: 10.1016/j.automatica.2016.02.002_br000135 – volume: 38 start-page: 156 issue: 2 year: 2008 ident: 10.1016/j.automatica.2016.02.002_br000030 article-title: A Comprehensive Survey of Multiagent Reinforcement Learning publication-title: IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews doi: 10.1109/TSMCC.2007.913919 – volume: 83 start-page: 527 issue: 3 year: 2010 ident: 10.1016/j.automatica.2016.02.002_br000100 article-title: H∞ consensus control of multi-agent systems with switching topology: a dynamic output feedback control publication-title: International Journal of Control doi: 10.1080/00207170903267039 – year: 2008 ident: 10.1016/j.automatica.2016.02.002_br000025 – volume: 11 start-page: 666 issue: 4 year: 2013 ident: 10.1016/j.automatica.2016.02.002_br000190 article-title: Finite-gain Lp Consensus of Multi-agent Systems publication-title: International Journal of Control, Automation and Systems doi: 10.1007/s12555-012-0495-1 – year: 1984 ident: 10.1016/j.automatica.2016.02.002_br000150 – volume: 9 start-page: 1086 issue: 6 year: 2011 ident: 10.1016/j.automatica.2016.02.002_br000105 article-title: Robust H∞ consensus control of uncertain multi-agent systems with time delays publication-title: International Journal of Control, Automation and Systems doi: 10.1007/s12555-011-0609-1 – volume: 51 start-page: 2074 issue: 10 year: 2004 ident: 10.1016/j.automatica.2016.02.002_br000085 article-title: Pinning a complex dynamical network to its equilibrium publication-title: IEEE Transactions on Circuits and Systems I: Regular Papers doi: 10.1109/TCSI.2004.835655 – volume: 49 start-page: 1465 issue: 9 year: 2004 ident: 10.1016/j.automatica.2016.02.002_br000040 article-title: Information flow and cooperative control of vehicle formations publication-title: IEEE Transactions on Automatic Control doi: 10.1109/TAC.2004.834433 – volume: 310 start-page: 521 issue: 3 year: 2002 ident: 10.1016/j.automatica.2016.02.002_br000175 article-title: Pinning control of scale-free dynamical networks publication-title: Physica A. Statistical Mechanics and its Applications doi: 10.1016/S0378-4371(02)00772-0 – volume: 54 start-page: 1648 issue: 7 year: 2009 ident: 10.1016/j.automatica.2016.02.002_br000035 article-title: Decentralized learning in finite Markov chains: Revisited publication-title: IEEE Transactions on Automatic Control doi: 10.1109/TAC.2009.2017977 – volume: 22 start-page: 1460 issue: 13 year: 2012 ident: 10.1016/j.automatica.2016.02.002_br000160 article-title: Online solution of nonlinear two-player zero-sum games using synchronous policy iteration publication-title: International Journal of Robust and Nonlinear Control doi: 10.1002/rnc.1760 – ident: 10.1016/j.automatica.2016.02.002_br000155 – volume: 31 start-page: 519 issue: 6 year: 1986 ident: 10.1016/j.automatica.2016.02.002_br000185 article-title: Decentralized learning in finite Markov chains publication-title: IEEE Transactions on Automatic Control doi: 10.1109/TAC.1986.1104342 – volume: 48 start-page: 988 issue: 6 year: 2003 ident: 10.1016/j.automatica.2016.02.002_br000050 article-title: Coordination of groups of mobile autonomous agents using nearest neighbor rules publication-title: IEEE Transactions on Automatic Control doi: 10.1109/TAC.2003.812781 – volume: 22 start-page: 35 issue: 1 year: 2009 ident: 10.1016/j.automatica.2016.02.002_br000080 article-title: H∞ control of networked multi-agent systems publication-title: Journal of Systems Science and Complexity doi: 10.1007/s11424-009-9145-y – volume: 28 start-page: 232 year: 2014 ident: 10.1016/j.automatica.2016.02.002_br000115 article-title: Online solution of nonquadratic two-player zero-sum games arising in the H∞ control of constrained-input systems publication-title: International Journal of Adaptive Control and Signal Processing doi: 10.1002/acs.2348 – volume: 42 start-page: 1177 issue: 7 year: 2006 ident: 10.1016/j.automatica.2016.02.002_br000045 article-title: Tracking control for multi-agent consensus with an active leader and variable topology publication-title: Automatica doi: 10.1016/j.automatica.2006.02.013 – ident: 10.1016/j.automatica.2016.02.002_br000060 doi: 10.1109/ACC.2006.1656455 – ident: 10.1016/j.automatica.2016.02.002_br000110 doi: 10.1109/RiiSS.2013.6607932 – year: 2009 ident: 10.1016/j.automatica.2016.02.002_br000125 – volume: 41 start-page: 779 issue: 5 year: 2005 ident: 10.1016/j.automatica.2016.02.002_br000005 article-title: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach publication-title: Automatica doi: 10.1016/j.automatica.2004.11.034 – year: 2012 ident: 10.1016/j.automatica.2016.02.002_br000065 – volume: 47 start-page: 797 issue: 4 year: 2011 ident: 10.1016/j.automatica.2016.02.002_br000075 article-title: On H∞ and H2 performance regions of multi-agent systems publication-title: Automatica doi: 10.1016/j.automatica.2011.01.054 – volume: 50 start-page: 655 issue: 5 year: 2005 ident: 10.1016/j.automatica.2016.02.002_br000130 article-title: Consensus seeking in multi-agent systems under dynamically changing interaction topologies publication-title: IEEE Transactions on Automatic Control doi: 10.1109/TAC.2005.846556 – volume: 129 start-page: 678 issue: 5 year: 2007 ident: 10.1016/j.automatica.2016.02.002_br000145 article-title: High-order and model reference consensus algorithms in cooperative control of multivehicle systems publication-title: Journal of Dynamic Systems, Measurement, and Control doi: 10.1115/1.2764508 – volume: 85 start-page: 384 issue: 4 year: 2012 ident: 10.1016/j.automatica.2016.02.002_br000180 article-title: Consensus and its L2-gain performance of multi-agent systems with intermittent information transmissions publication-title: International Journal of Control doi: 10.1080/00207179.2011.654264 – volume: 14 start-page: 219 issue: 2 year: 2009 ident: 10.1016/j.automatica.2016.02.002_br000055 article-title: Robust finite-time consensus tracking algorithm for multirobot systems publication-title: IEEE Transactions on Mechatronics doi: 10.1109/TMECH.2009.2014057 – volume: 49 start-page: 1520 issue: 9 year: 2004 ident: 10.1016/j.automatica.2016.02.002_br000120 article-title: Consensus problems in networks of agents with switching topology and time-delays publication-title: IEEE Transactions on Automatic Control doi: 10.1109/TAC.2004.834113 – volume: 38 start-page: 976 issue: 4 year: 2008 ident: 10.1016/j.automatica.2016.02.002_br000170 article-title: Decentralized Learning in Markov Games publication-title: IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics doi: 10.1109/TSMCB.2008.920998 – volume: 19 start-page: 1243 issue: 7 year: 2008 ident: 10.1016/j.automatica.2016.02.002_br000015 article-title: Neurodynamic programming and zero-sum games for constrained control systems publication-title: IEEE Transactions on Neural Networks doi: 10.1109/TNN.2008.2000204 – volume: 57 start-page: 643 issue: 8 year: 2008 ident: 10.1016/j.automatica.2016.02.002_br000090 article-title: Distributed robust H∞ consensus control in directed networks of agents with time-delay publication-title: Systems & Control Letters doi: 10.1016/j.sysconle.2008.01.002 – volume: 51 start-page: 1989 issue: 12 year: 2006 ident: 10.1016/j.automatica.2016.02.002_br000010 article-title: Policy iteration on the hamilton–jacobi-isaacs equation for H∞ state feedback control with input saturation publication-title: IEEE Transactions on Automatic Control doi: 10.1109/TAC.2006.884959 – year: 2011 ident: 10.1016/j.automatica.2016.02.002_br000020 – volume: 2 start-page: 55 issue: 1 year: 2001 ident: 10.1016/j.automatica.2016.02.002_br000095 article-title: Value-function reinforcement learning in Markov games publication-title: Journal of Cognitive Systems Research doi: 10.1016/S1389-0417(01)00015-8 – ident: 10.1016/j.automatica.2016.02.002_br000140 – volume: 48 start-page: 1598 issue: 8 year: 2012 ident: 10.1016/j.automatica.2016.02.002_br000165 article-title: Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality publication-title: Automatica doi: 10.1016/j.automatica.2012.05.074 – year: 2014 ident: 10.1016/j.automatica.2016.02.002_br000070 |
SSID | ssj0004182 |
Score | 2.578402 |
Snippet | This paper addresses distributed optimal tracking control of multi-agent linear systems subject to external disturbances. The concept of differential game... |
SourceID | proquest crossref elsevier |
SourceType | Aggregation Database Enrichment Source Index Database Publisher |
StartPage | 24 |
SubjectTerms | [formula omitted]-gain Algorithms Differential equations Disturbances External disturbances Games Graphical games Hamilton–Jacobi–Isaacs equations Joining Mathematical analysis Multi-agent system Multiagent systems Online |
Title | Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control |
URI | https://dx.doi.org/10.1016/j.automatica.2016.02.002 https://www.proquest.com/docview/1825481103 |
Volume | 69 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8NAEF5KvehBfGJ9sYLX2GyymweeSrFUxZ4s9LZsNhtpKUlp04sHf7szm8SqIBQ8ZrNLwszszGzyzTeE3AY8DTNfpA6k_qHDFedOxDQcXE3sC-16aWjw08DLKBiO-dNETFqk39TCIKyy9v2VT7feuh7p1tLsLqZTrPFFg4ojhqxRPkfabc5DtPK7jw3Mg7OoYgy3jJtxVKN5KoyXWpeFZUZFBiIWVOyd3l8h6pezthFocED269SR9qq3OyQtkx-RvW-EgscksfW0jsJ6KfpuloUDhkabJiiwmefUMlSjZugbAmQpJK0wYQWRJ0EDoEszs-isnE5ze8N2xDIprUHtJ2Q8eHjtD526i4KjfcFKJ4acAHIAwSIV-Ymb6UhnScK8FPnMGRxRQ6Uzo7gINcwJNAy5IhWZCWLuMcX9U9LOi9ycERokIWxgFQgdKO5lWomUmdikLDJa6DjukLARnNQ1xTh2upjLBks2kxuRSxS5dD0JIu8Q9rVyUdFsbLHmvtGN_GEyEqLBFqtvGnVK2FH4m0TlplivJMNDM0jG9c__9YQLsotXFbb3krTL5dpcQQZTJtfWRK_JTu_xeTj6BKjA8y0 |
linkProvider | Elsevier |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8QwEB50PagH8YlvI3gtNm2StngSUdbXnhS8hTRNZUW6su5e_PXOpKkvEASvedAyM5mZtN98A3CkRJXVqawiTP2zSBghopxbvLi6IpU2TqrM0aeB24Hq34urB_kwA2ddLQzBKoPvb32699Zh5DhI8_hlOKQaXzKoIufEGpUKNQtzxE4lezB3enndH3yWR_K8JQ33pJtFHgA9LczLTCcjT45KJERctQSeyW9R6oe_9kHoYhmWQvbITtsXXIEZ16zC4hdOwTUofUltZKhkir258ShCW2NdHxQ8z8_Mk1STctgjYWQZ5q244BWDT0k2wMbuyQO0GjZs_IRviuUqFnDt63B_cX531o9CI4XIppJPogLTAkwDJM9NnpZxbXNblyVPKqI053hLzYytnREys7hGWRyKZSVrpwqRcCPSDeg1o8ZtAlNlhmfYKGmVEUltjay4K1zFc2elLYotyDrBaRtYxqnZxbPu4GRP-lPkmkSu40SjyLeAf-x8aZk2_rDnpNON_mY1GgPCH3YfdurUeKjoT4lp3Gj6qjndm1Eycbr9ryccwHz_7vZG31wOrndggWZaqO8u9CbjqdvDhGZS7geDfQf19_Xe |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Multi-agent+zero-sum+differential+graphical+games+for+disturbance+rejection+in+distributed+control&rft.jtitle=Automatica+%28Oxford%29&rft.au=Jiao%2C+Qiang&rft.au=Modares%2C+Hamidreza&rft.au=Xu%2C+Shengyuan&rft.au=Lewis%2C+Frank+L.&rft.date=2016-07-01&rft.pub=Elsevier+Ltd&rft.issn=0005-1098&rft.eissn=1873-2836&rft.volume=69&rft.spage=24&rft.epage=34&rft_id=info:doi/10.1016%2Fj.automatica.2016.02.002&rft.externalDocID=S0005109816300346 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0005-1098&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0005-1098&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0005-1098&client=summon |