×

Backstepping-based adaptive dynamic programming for missile-target guidance systems with state and input constraints. (English) Zbl 1402.93158

Summary: In this paper, a novel backstepping-based adaptive dynamic programming (ADP) method is developed to solve the problem of intercepting a maneuver target in the presence of full-state and input constraints. To address state constraints, a barrier Lyapunov function is introduced to every backstepping procedure. An auxiliary design system is employed to compensate the input constraints. Then, an adaptive backstepping feedforward control strategy is designed, by which the tracking problem for strict-feedback systems can be reduced to an equivalence optimal regulation problem for affine nonlinear systems. Secondly, an adaptive optimal controller is developed by using ADP technique, in which a critic network is constructed to approximate the solution of the associated Hamilton-Jacobi-Bellman (HJB) equation. Therefore, the whole control scheme consists of an adaptive feedforward controller and an optimal feedback controller. By utilizing Lyapunov’s direct method, all signals in the closed-loop system are guaranteed to be uniformly ultimately bounded (UUB). Finally, the effectiveness of the proposed strategy is demonstrated by using a simple nonlinear system and a nonlinear two-dimensional missile-target interception system.

MSC:

93C40 Adaptive control/observation systems
93C10 Nonlinear systems in control theory
93B52 Feedback control
90C39 Dynamic programming
93C95 Application models in control theory
93C41 Control/observation systems with incomplete information
Full Text: DOI

References:

[1] Shaferman, V.; Shima, T., Linear quadratic guidance laws for imposing a terminal intercept angle, J. Guidance Control Dyn., 31, 1400-1412, (2008)
[2] Bardhan, R.; Ghose, D., Nonlinear differential games-based impact-angle-constrained guidance law, J. Guidance Control Dyn., 38, 384-402, (2015)
[3] Cho, H.; Ryoo, C.-K.; Tsourdos, A.; White, B., Optimal impact angle control guidance law based on linearization about collision triangle, J. Guidance Control Dyn., 37, 958-964, (2014)
[4] Vamvoudakis, K. G.; Lewis, F. L., Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem, Automatica, 46, 878-888, (2010) · Zbl 1191.49038
[5] Vamvoudakis, K. G.; Miranda, M. F.; Hespanha, J. P., Asymptotically stable adaptive-optimal control algorithm with saturating actuators and relaxed persistence of excitation, IEEE Trans. Neural Netw. Learn. Syst., 27, 2386-2398, (2016)
[6] Wang, D.; Liu, D.; Li, H.; Ma, H., Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming, Inf. Sci., 282, 167-179, (2014) · Zbl 1354.93045
[7] Zhang, H.; Cui, L.; Luo, Y., Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network ADP, IEEE Trans. Cybern., 43, 206-216, (2013)
[8] Zhang, H.; Jiang, H.; Luo, Y.; Xiao, G., Data-driven optimal consensus control for discrete-time multi-agent systems with unknown dynamics using reinforcement learning method, IEEE Trans. Indust. Electron., 64, 4091-4100, (2017)
[9] Zhang, H.; Jiang, H.; Luo, C.; Xiao, G., Discrete-time nonzero-sum games for multiplayer using policy-iteration-based adaptive dynamic programming algorithms, IEEE Trans. Cybern., 47, 3331-3340, (2017)
[10] Sun, J.; Liu, C.; Ye, Q., Robust differential game guidance laws design for uncertain interceptor-target engagement via adaptive dynamic programming, Int. J. Control, 90, 990-1004, (2017) · Zbl 1366.93142
[11] Zargarzadeh, H.; Dierks, T.; Jagannathan, S., Optimal control of nonlinear continuous-time systems in strict-feedback form, IEEE Trans. Neural Netw. Learn. Syst., 26, 2535-2549, (2015)
[12] Zargarzadeh, H.; Dierks, T.; Jagannathan, S., Adaptive neural network-based optimal control of nonlinear continuous-time systems in strict-feedback form, Int. J. Adapt. Control Signal Process., 28, 305-324, (2014) · Zbl 1331.93119
[13] Sun, K. K.; Li, Y. M.; Tong, S. C., Fuzzy adaptive output feedback optimal control design for strict-feedback nonlinear systems, IEEE Trans. Syst. Man Cybern.: Syst., 47, 33-44, (2017)
[14] Sun, K.; Sui, S.; Tong, S., Optimal adaptive fuzzy FTC design for strict-feedback nonlinear uncertain systems with actuator faults, Fuzzy Sets Syst., 316, 20-34, (2017) · Zbl 1392.93027
[15] Li, Y.; Sun, K.; Tong, S., Observer-based adaptive fuzzy fault-tolerant optimal control for SISO nonlinear systems, IEEE Trans. Cybern., 1-13, (2018)
[16] Tong, S.; Sun, K.; Sui, S., Observer-based adaptive fuzzy decentralized optimal control design for strict-feedback nonlinear large-scale systems, IEEE Trans. Fuzzy Syst., 26, 569-584, (2018)
[17] Li, Y.; Sun, K.; Tong, S., Adaptive fuzzy robust fault-tolerant optimal control for nonlinear large-scale systems, IEEE Trans. Fuzzy Syst., 1-14, (2017)
[18] Sun, K.; Sui, S.; Tong, S., Fuzzy adaptive decentralized optimal control for strict feedback nonlinear large-scale systems, IEEE Trans. Cybern., 48, 1326-1339, (2018)
[19] Zhou, D.; Xu, B., Adaptive dynamic surface guidance law with input saturation constraint and autopilot dynamics, J. Guidance Control Dyn., 39, 1155-1162, (2016)
[20] He, S. M.; Wang, W.; Wang, J., Adaptive backstepping impact angle control with autopilot dynamics and acceleration saturation consideration, Int. J. Robust Nonlinear Control, 27, 3777-3793, (2017) · Zbl 1386.93160
[21] Tang, Z. L.; Ge, S. S.; Tee, K. P.; He, W., Robust adaptive neural tracking control for a class of perturbed uncertain nonlinear systems with state constraints, IEEE Trans. Syst. Man Cybern.: Syst., 46, 1618-1629, (2016)
[22] Liu, Y.-J.; Tong, S., Barrier Lyapunov functions-based adaptive control for a class of nonlinear pure-feedback systems with full state constraints, Automatica, 64, 70-75, (2016) · Zbl 1329.93088
[23] Liu, Y.-J.; Tong, S., Barrier Lyapunov functions for nussbaum gain adaptive control of full state constrained nonlinear systems, Automatica, 76, 143-152, (2017) · Zbl 1352.93062
[24] Li, Y.; Tong, S., Adaptive fuzzy output constrained control design for multi-input multioutput stochastic nonstrict-feedback nonlinear systems, IEEE Trans. Cybern., 47, 4086-4095, (2017)
[25] Chen, Z.; Li, Z.; Chen, C. L.P., Adaptive neural control of uncertain MIMO nonlinear systems with state and input constraints, IEEE Trans. Neural Netw. Learn. Syst., 28, 1318-1330, (2017)
[26] Zhou, Q.; Wang, L.; Wu, C.; Li, H.; Du, H., Adaptive fuzzy control for nonstrict-feedback systems with input saturation and output constraint, IEEE Trans. Syst. Man Cybern.: Syst., 47, 1-12, (2017)
[27] Fan, Q.-Y.; Yang, G.-H., Adaptive nearly optimal control for a class of continuous-time nonaffine nonlinear systems with inequality constraints, ISA Trans., 66, 122-133, (2017)
[28] Fan, Q. Y.; Yang, G. H.; Ye, D., Quantization-based adaptive actor-critic tracking control with tracking error constraints, IEEE Trans. Neural Netw. Learn. Syst., 29, 970-980, (2018)
[29] Chen, M.; Tao, G.; Jiang, B., Dynamic surface control using neural networks for a class of uncertain nonlinear systems with input saturation, IEEE Trans. Neural Netw. Learn. Syst., 26, 2086-2097, (2015)
[30] Liu, Y. J.; Li, J.; Tong, S.; Chen, C. L.P., Neural network control-based adaptive learning design for nonlinear systems with full-state constraints, IEEE Trans. Neural Netw. Learn. Syst., 27, 1562-1571, (2016)
[31] Tee, K. P.; Ge, S. S.; Tay, E. H., Barrier Lyapunov functions for the control of output-constrained nonlinear systems, Automatica, 45, 918-927, (2009) · Zbl 1162.93346
[32] Wang, C.; Wu, Y.; Yu, J., Barrier Lyapunov functions-based dynamic surface control for pure-feedback systems with full state constraints, IET Control Theory Appl., 11, 524-530, (2017)
[33] Ren, B.; Ge, S. S.; Tee, K. P.; Lee, T. H., Adaptive neural control for output feedback nonlinear systems using a barrier Lyapunov function, IEEE Trans. Neural Netw., 21, 1339-1345, (2010)
[34] Sutton, R. S.; Barto, A. G., Reinforcement learning: an introduction, (1998), MIT Press Cambridge
[35] Liu, D.; Yang, X.; Wang, D.; Wei, Q., Reinforcement-learning-based robust controller design for continuous-time uncertain nonlinear systems subject to input constraints, IEEE Trans. Cybern., 45, 1372-1385, (2015)
[36] T. Dierks, S. Jagannathan, Optimal control of affine nonlinear continuous-time systems, in: Proceedings of the American Control Conference (ACC), 2010, Marriott Waterfront, Baltimore, MD, USA, 2010, pp. 1568-1573.; T. Dierks, S. Jagannathan, Optimal control of affine nonlinear continuous-time systems, in: Proceedings of the American Control Conference (ACC), 2010, Marriott Waterfront, Baltimore, MD, USA, 2010, pp. 1568-1573.
[37] Wang, D.; He, H.; Zhao, B.; Liu, D., Adaptive near-optimal controllers for non-linear decentralised feedback stabilisation problems, IET Control Theory Appl., 11, 799-806, (2017)
[38] Sun, J.; Liu, C., Disturbance observer-based robust missile autopilot design with full-state constraints via adaptive dynamic programming, J. Frankl. Inst. B, 355, 2344-2368, (2018) · Zbl 1393.93068
[39] Yang, X.; Liu, D.; Wang, D., Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints, Int. J. Control, 87, 553-566, (2014) · Zbl 1317.93158
[40] Modares, H.; Lewis, F. L.; Naghibi-Sistani, M.-B., Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems, Automatica, 50, 193-202, (2014) · Zbl 1298.49042
[41] Tee, K. P.; Ge, S. S., Control of nonlinear systems with partial state constraints using a barrier Lyapunov function, Int. J. Control, 84, 2008-2023, (2011) · Zbl 1236.93099
[42] Li, R.; Chen, M.; Wu, Q., Adaptive neural tracking control for uncertain nonlinear systems with input and output constraints using disturbance observer, Neurocomputing, 235, 27-37, (2017)
[43] Wang, D.; Mu, C., Adaptive-critic-based robust trajectory tracking of uncertain dynamics and its application to a spring-mass-damper system, IEEE Trans. Indust. Electron., 65, 654-663, (2018)
[44] Zhang, H.; Qu, Q.; Xiao, G.; Cui, Y., Optimal guaranteed cost sliding mode control for constrained-input nonlinear systems with matched and unmatched disturbances, IEEE Trans. Neural Netw. Learn. Syst., 29, 2112-2126, (2018)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.