×

An improvement of single-network adaptive critic design for nonlinear systems with asymmetry constraints. (English) Zbl 1423.93167

Summary: Traditional approximate/adaptive dynamic programming (ADP) methods can handle a very special class of systems subject to symmetry constraints. In this study, I extend the exiting ADP to a broader class of nonlinear dynamic systems with asymmetry constraints. Firstly, I propose a novel nonquadratic cost function, based on which the developed optimal controller by solving Hamilton-Jacobi-Bellman equation can limit its value to arbitrarily prescribed bound. Then, to avoid “curse of dimensionality”, I approximately implement the addressed controller via single-network adaptive critic design. Fuzzy hyperbolic model is introduced to construct the single critic network by approximating optimal cost function, from which I further derive the optimal control law. The potential advantages are that the control structure is simple and the computational load is low. Lyapunov synthesis proves the ultimately uniformly bounded stability of closed-loop control system. Finally, numerical simulation results verify the efficiency and superiority of the proposed approach.

MSC:

93C40 Adaptive control/observation systems
93C10 Nonlinear systems in control theory
93C42 Fuzzy control/observation systems
93C20 Control/observation systems governed by partial differential equations
Full Text: DOI

References:

[1] Xiao, G.; Zhang, H.; Luo, Y., Online optimal control of unknown discrete-time nonlinear systems by using time-based adaptive dynamic programming, Neurocomputing, 165, 163-170 (2015)
[2] Dai, M.; Xiao, F.; Wei, B., Event-triggered and quantized self-triggered control for multi-agent systems based on relative state measurements, J. Frankl. Inst., 356, 6, 3711-3732 (2019) · Zbl 1411.93114
[3] Tang, L.; Liu, Y.; Chen, C., Adaptive critic design for pure-feedback discrete-time mimo systems preceded by unknown backlashlike hysteresis, IEEE Trans. Neural Netw. Learn. Syst., 29, 11, 5681-5690 (2018)
[4] Zhang, J.; Zhang, H.; Luo, Y.; Feng, T., Model-free optimal control design for a class of linear discrete-time systems with multiple delays using adaptive dynamic programming, Neurocomputing, 135, 163-170 (2014)
[5] Wang, D.; Liu, D., Learning and guaranteed cost control with event-based adaptive critic implementation, IEEE Trans. Neural Netw. Learn. Syst., 29, 12, 6004-6014 (2018)
[6] Gao, W.; Jiang, Y.; Jiang, Z.; Chai, T., Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming, Automatica, 72, 37-45 (2016) · Zbl 1344.93060
[7] Bian, T.; Jiang, Z., Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design, Automatica, 71, 348-360 (2016) · Zbl 1343.93095
[8] Qu, Q.; Zhang, H.; Feng, T.; Jiang, H., Decentralized adaptive tracking control scheme for nonlinear large-scale interconnected systems via adaptive dynamic programming, Neurocomputing, 225, 1-10 (2017)
[9] Wang, D.; Liu, D.; Zhang, Y.; Li, H., Neural network robust tracking control with adaptive critic framework for uncertain nonlinear systems, Neural Netw., 97, 11-18 (2018) · Zbl 1441.93066
[10] Yang, X.; Liu, D.; Wei, Q.; Wang, D., Guaranteed cost neural tracking control for a class of uncertain nonlinear systems using adaptive dynamic programming, Neurocomputing, 198, 80-90 (2016)
[11] Yang, X.; He, H., Adaptive critic designs for optimal control of uncertain nonlinear systems with unmatched interconnections, Neural Netw., 105, 142-153 (2018) · Zbl 1441.93223
[12] Jiang, H.; Zhang, H.; Han, J.; Zhang, K., Iterative adaptive dynamic programming methods with neural network implementation for multi-player zero-sum games, Neurocomputing, 307, 54-60 (2018)
[13] Padhi, R.; Unnikrishnan, N.; Wang, X.; Balakrishnan, S., A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems, Neural Netw., 19, 1648-1660 (2006) · Zbl 1120.90065
[14] Bu, X.; Lei, H.; Gao, Y., Robust tracking control of hypersonic flight vehicles: a continuous model-free control approach, Acta Astronaut., 161, 234-240 (2019)
[15] Bu, X.; Wu, X.; He, G.; Huang, J., Novel adaptive neural control design for a constrained flexible air-breathing hypersonic vehicle based on actuator compensation, Acta Astronaut., 120, 75-86 (2016)
[16] Bu, X.; Wu, X.; Wei, D.; Huang, J., Neural-approximation-based robust adaptive control of flexible air-breathing hypersonic vehicles with parametric uncertainties and control input constraints, Inf. Sci. (N.Y.), 346-347, 29-43 (2016) · Zbl 1398.93261
[17] Abu-Khalaf, M.; Lewis, F., Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach, Automatica, 41, 779-791 (2005) · Zbl 1087.49022
[18] Huang, Y.; Wang, D.; Liu, D., Bounded robust control design for uncertain nonlinear systems using single-network adaptive dynamic programming, Neurocomputing, 266, 128-140 (2017)
[19] Sun, J.; Liu, C., Backstepping-based adaptive dynamic programming for missile-target guidance systems with state and input constraints, J. Frankl. Inst., 355, 8412-8440 (2018) · Zbl 1402.93158
[20] Bertsekas, D.; Tsitsiklis, J., Neuro-Dynamic Programming (1996), Athena Scientific: Athena Scientific Belmont, MA · Zbl 0924.68163
[21] Zhang, H.; Quan, Y., Modeling, identification, and control of a class of nonlinear systems, IEEE Trans. Fuzzy Syst., 9, 2, 349-354 (2001)
[22] Hagan, M.; Demuth, H.; Beale, M., Neural Network Design (1996), PWS Publishing: PWS Publishing Boston, MA
[23] Zhang, J.; Zhang, H.; Luo, Y.; Liang, H., Nearly optimal control scheme using adaptive dynamic programming based on generalized fuzzy hyperbolic model, Acta Autom. Sin., 39, 2, 142-149 (2013) · Zbl 1289.93056
[24] Khalil, H., Nonlinear Systems (2001), Prentice Hall: Prentice Hall New Jersey
[25] Bu, X.; Wu, X.; Huang, J.; Wei, D., A guaranteed transient performance-based adaptive neural control scheme with low-complexity computation for flexible air-breathing hypersonic vehicles, Nonlinear Dyn., 84, 2175-2194 (2016) · Zbl 1355.93126
[26] Gao, T.; Liu, Y.; Liu, L.; Li, D., Adaptive neural network-based control for a class of nonlinear pure-feedback systems with time-varying full state constraints, IEEE/CAA J. Autom. Sin., 5, 5, 923-933 (2018)
[27] Liu, L.; Liu, Y.; Tong, S., Fuzzy based multi-error constraint control for switched nonlinear systems and its applications, IEEE Trans. Fuzzy Syst., 27, 8, 1519-1531 (2019)
[28] Li, D.; Li, D., Adaptive neural tracking control for an uncertain state constrained robotic manipulator with time-varying delays, IEEE Trans. Syst. Man Cybern.: Syst., 48, 12, 2219-2228 (2018)
[29] Bu, X., Actor-critic reinforcement learning control of non-strict feedback nonaffine dynamic systems, IEEE Access., 7, 65569-65578 (2019)
[30] Bu, X.; Wu, X.; Zhang, R.; Ma, Z., A neural approximation-based novel back-stepping control scheme for air-breathing hypersonic vehicles with uncertain parameters, Proc. Inst. Mech. Eng. I: J. Syst. Control Eng., 230, 3, 231-243 (2016)
[31] Bu, X.; Lei, H., A fuzzy wavelet neural network based approach to hypersonic flight vehicle direct nonaffine hybrid control, Nonlinear Dyn., 94, 1657-1668 (2018) · Zbl 1422.93099
[32] Luo, B.; Wu, H.; Huang, T.; Liu, D., Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design, Automatica, 50, 12, 3281-3290 (2014) · Zbl 1309.93188
[33] Zhang, J., Researches on Fuzzy and Model-Free Optimal Control Based on Single-Network Adaptive Dynamic Programming (2014), Northeastern University: Northeastern University Shenyang
[34] Zhang, J.; Liang, H.; Feng, T., Optimal control for nonlinear continuous systems by adaptive dynamic programming based on fuzzy basis functions, Appl. Math. Model., 40, 6766-6774 (2016) · Zbl 1465.49020
[35] Cheng, B.; Li, Z., Consensus disturbance rejection with event-triggered communications, J. Frankl. Inst., 356, 2, 956-974 (2019) · Zbl 1406.93202
[36] Wang, Y.; Zheng, W.; Zhang, H., Dynamic event-based control of nonlinear stochastic systems, IEEE Trans. Autom. Control, 62, 12, 6544-6551 (2017) · Zbl 1390.93847
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.