×

Approximately adaptive neural cooperative control for nonlinear multiagent systems with performance guarantee. (English) Zbl 1362.93078

Summary: This paper studies the cooperative control problem for a class of multiagent dynamical systems with partially unknown nonlinear system dynamics. In particular, the control objective is to solve the state consensus problem for multiagent systems based on the minimisation of certain cost functions for individual agents. Under the assumption that there exist admissible cooperative controls for such class of multiagent systems, the formulated problem is solved through finding the optimal cooperative control using the approximate dynamic programming and reinforcement learning approach. With the aid of neural network parameterisation and online adaptive learning, our method renders a practically implementable approximately adaptive neural cooperative control for multiagent systems. Specifically, based on the Bellman’s principle of optimality, the Hamilton-Jacobi-Bellman (HJB) equation for multiagent systems is first derived. We then propose an approximately adaptive policy iteration algorithm for multiagent cooperative control based on neural network approximation of the value functions. The convergence of the proposed algorithm is rigorously proved using the contraction mapping method. The simulation results are included to validate the effectiveness of the proposed algorithm.

MSC:

93C40 Adaptive control/observation systems
93A14 Decentralized systems
93C10 Nonlinear systems in control theory
68T42 Agent technology and artificial intelligence
92B20 Neural networks for/in biological studies, artificial life and related topics
93C15 Control/observation systems governed by ordinary differential equations
Full Text: DOI

References:

[1] Astrom K.J., Adaptive control (1995) · Zbl 0434.92027
[2] DOI: 10.1016/S0005-1098(97)00128-3 · Zbl 0949.93022 · doi:10.1016/S0005-1098(97)00128-3
[3] DOI: 10.1515/9781400874651 · doi:10.1515/9781400874651
[4] DOI: 10.1080/00207721.2010.520096 · Zbl 1307.93025 · doi:10.1080/00207721.2010.520096
[5] DOI: 10.1109/TAC.2004.834433 · Zbl 1365.90056 · doi:10.1109/TAC.2004.834433
[6] DOI: 10.1109/TNN.2002.804306 · doi:10.1109/TNN.2002.804306
[7] Ge S.S., Stable adaptive neural network control (2001)
[8] DOI: 10.1109/TAC.2003.812781 · Zbl 1364.93514 · doi:10.1109/TAC.2003.812781
[9] Khalil H., Nonlinear systems, 3. ed. (2003)
[10] DOI: 10.1109/72.363477 · doi:10.1109/72.363477
[11] Lewis F.L., IEEE Control Systems Magazine, 32 pp 77– (2012)
[12] DOI: 10.1109/TAC.2004.825639 · Zbl 1365.93208 · doi:10.1109/TAC.2004.825639
[13] DOI: 10.1137/050626405 · Zbl 1141.93032 · doi:10.1137/050626405
[14] Luenberger D.G., Introduction to dynamic systems (1979) · Zbl 0458.93001
[15] DOI: 10.1109/TAC.2004.841888 · Zbl 1365.93268 · doi:10.1109/TAC.2004.841888
[16] DOI: 10.1109/TAC.2010.2044274 · Zbl 1368.34067 · doi:10.1109/TAC.2010.2044274
[17] DOI: 10.1002/9780470182963 · Zbl 1156.90021 · doi:10.1002/9780470182963
[18] Qu Z, Robust control of nonlinear uncertain systems (1998)
[19] Qu Z, Cooperative control of dynamical systems (2009)
[20] DOI: 10.1109/TAC.2008.920232 · Zbl 1367.93076 · doi:10.1109/TAC.2008.920232
[21] DOI: 10.1109/TAC.2005.846556 · Zbl 1365.93302 · doi:10.1109/TAC.2005.846556
[22] DOI: 10.1007/978-1-84800-015-5 · doi:10.1007/978-1-84800-015-5
[23] DOI: 10.1109/JPROC.2006.887293 · Zbl 1376.68138 · doi:10.1109/JPROC.2006.887293
[24] DOI: 10.1109/TAC.2004.834113 · Zbl 1365.93301 · doi:10.1109/TAC.2004.834113
[25] DOI: 10.1109/72.165588 · doi:10.1109/72.165588
[26] DOI: 10.1109/TSMC.1979.4310171 · Zbl 0398.49001 · doi:10.1109/TSMC.1979.4310171
[27] DOI: 10.1109/72.914523 · doi:10.1109/72.914523
[28] DOI: 10.1007/978-1-4615-3618-5 · doi:10.1007/978-1-4615-3618-5
[29] DOI: 10.1109/TAC.2007.895948 · Zbl 1366.93414 · doi:10.1109/TAC.2007.895948
[30] DOI: 10.1016/j.automatica.2010.02.018 · Zbl 1191.49038 · doi:10.1016/j.automatica.2010.02.018
[31] DOI: 10.1016/j.automatica.2012.05.074 · Zbl 1267.93190 · doi:10.1016/j.automatica.2012.05.074
[32] DOI: 10.1016/j.neunet.2009.03.008 · Zbl 1335.93068 · doi:10.1016/j.neunet.2009.03.008
[33] DOI: 10.1016/j.automatica.2008.08.017 · Zbl 1158.93354 · doi:10.1016/j.automatica.2008.08.017
[34] DOI: 10.1049/iet-cta.2008.0163 · doi:10.1049/iet-cta.2008.0163
[35] DOI: 10.1007/s10514-006-5942-5 · doi:10.1007/s10514-006-5942-5
[36] Wang L.X, Adaptive fuzzy systems and control: Design and analysis (1994)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.