电光与控制, 2018, 25 (9): 17, 网络出版: 2018-09-15  

基于自适应动态规划的未知模型非线性系统H2/H∞控制

H2/H∞ Control of an Unknown Model Nonlinear System Based on Adaptive Dynamic Programming
作者单位
火箭军工程大学,西安 710025
摘要
提出了一种在线的自适应动态规划算法, 近似求解耦合的哈密尔顿雅可比(Hamilton-Jacobi-Isaacs,HJI)方程, 获得非线性系统混合H2/H∞控制的纳什均衡策略。通过在控制策略和干扰策略中加入已知噪声, 从而不依赖系统的模型信息, 得到一个求解混合H2/H∞控制问题的未知模型的近似动态规划算法。分别使用2个评价神经网络和2个执行神经网络, 同步在线更新2个值函数、控制策略和干扰策略, 神经网络未知参数通过最小二乘法进行估计。仿真结果验证了算法的可行性。
Abstract
An online adaptive dynamic programming algorithm is proposed for getting the approximate solution of the coupled Hamilton-Jacobi-Isaacs Equations (HJIE), and obtaining the Nash equilibrium strategy of mixed H2/H∞ control of nonlinear system.By adding the detection signal to the control strategy and the interference strategy, an approximate dynamic programming algorithm is acquired for solving mixed H2/H∞ control problems with unknown model without depending on model information of the system.Two critic neural networks and two executive neural networks are used to synchronously update two value functions, control strategies and interference strategies online.The unknown parameters of the neural network are estimated by generalized least squares.The simulation results verify the feasibility of the algorithm.
参考文献

[1] BERNSTEIN D S, HADDAD W M.LQG control with an H∞ performance bound: a riccati equation approach[C]// American Control Conference, IEEE, 2009:796-802.

[2] 潘伟,王学勇,井元伟.基于遗传算法的混合H2/H∞状态反馈控制器[J].控制与决策,2005,20(2):132-136.

[3] 叶思隽,王新民,张清江,等.不确定系统混合H2/H∞鲁棒控制的直接迭代LMI方法[J].控制理论与应用,2011,28(2):247-255.

[4] 马清亮,杨海燕,吴旭光.多项式模糊系统混合H2/H∞控制[J].电光与控制,2017,24(7):1-6.

[5] 孙景亮,刘春生.基于自适应动态规划的导弹制导律研究综述[J].自动化学报,2017,43(7):1101-1113.

[6] 张化光,张欣,罗艳红,等.自适应动态规划综述[J].自动化学报,2013,39(4):303-311.

[7] LIU D R, YANG X, WANG D, et al.Reinforcement-learning-based robust controller design for continuous-time uncertain nonlinear systems subject to input constraints[J].IEEE Transactions on Cybernetics, 2015, 45(7):1372-1385.

[8] ZHANG H G, QIN C B, LUO Y H.Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming[J].IEEE Transactions on Automation Science and Engineering, 2014, 11(3): 839-849.

[9] VAMVOUDAKIS K G, LEWIS F L.Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi-Equations[J].Automatica, 2011, 47(8):1556-1569.

[10] ALIYU M D S.An iterative computational scheme for solving the coupled Hamilton-Jacobi-Isaacs equations in nonzero-sum differential games of affine nonlinear systems[J].Decisions in Economics & Finance, 2017(40):1-30.

[11] ZHAO D B, XIA Z P, WANG D.Model-free optimal control for affine nonlinear systems with convergence analysis[J].IEEE Transaction on Automation Science and Engineering,2015, 12(4):1461-1468.

[12] TAPIA R A.The Kantorovich theorem for Newtons method[J].American Mathematical Monthly, 1971, 78(4):389-392.

[13] ZHANG Q, ZHAO D, ZHU Y.Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs[J].Neurocomputing, 2017(238):377-386.

蒲俊, 马清亮, 顾凡. 基于自适应动态规划的未知模型非线性系统H2/H∞控制[J]. 电光与控制, 2018, 25(9): 17. PU Jun, MA Qing-liang, GU Fan. H2/H∞ Control of an Unknown Model Nonlinear System Based on Adaptive Dynamic Programming[J]. Electronics Optics & Control, 2018, 25(9): 17.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!