带饱和执行器的非线性离散时滞系统的最优控制

doi:10.12068/j.issn.1005-3026.2014.04.002

东北大学学报:自然科学版 ›› 2014, Vol. 35 ›› Issue (4): 461-464.DOI: 10.12068/j.issn.1005-3026.2014.04.002

带饱和执行器的非线性离散时滞系统的最优控制

王涛，罗艳红

(东北大学信息科学与工程学院，辽宁沈阳110819)

收稿日期:2013-01-14 修回日期:2013-01-14 出版日期:2014-04-15 发布日期:2013-11-22
通讯作者: 王涛
作者简介:王涛(1979-)，男，辽宁丹东人，东北大学博士研究生，沈阳师范大学讲师.
基金资助:
国家自然科学基金资助项目(50977008，61034005);国家基础研究计划项目(2009CB320601)；辽宁省自然科学基金资助项目(201202201).

Optimal Control for Nonlinear DiscreteTime Time Delay Systems with Saturating Actuators

WANG Tao， LUO Yanhong

School of Information Science ＆ Engineering， Northeastern University， Shenyang 110819， China.

Received:2013-01-14 Revised:2013-01-14 Online:2014-04-15 Published:2013-11-22
Contact: WANG Tao
About author:-
Supported by:
-

摘要/Abstract

摘要： 主要针对带有饱和执行器的时滞非线性离散时间系统更加一般的形式，通过启发式动态规划(HDP)算法求解无限时间最优控制策略问题，并在值函数中引入折扣因子.首先通过迭代HDP算法给出值函数序列和相应的控制序列，并给出了收敛性证明，即值函数序列收敛到值函数的最优值，以及控制序列收敛到最优控制；其次为了实现HDP算法，引入3个神经网络:模型网络、评判网络、控制作用网络.模型网络用来近似系统模型，评判网络用来近似值函数，控制作用网络用来近似控制；最后通过一个仿真例子说明上述方法的可行性.

关键词: 近似动态规划, 启发式动态规划, 值函数, 神经网络, 最优控制

Abstract: For the more general form of nonlinear discretetime time delays systems with saturating actuators， an infinitetime optimal control scheme was developed by heuristic dynamic programming(HDP) algorithm. In the proposed scheme， the discount factor was added in the value function. Firstly， value function series and control series were given through iterative HDP algorithm， and the convergence analysis was presented to prove that value function series and control series reach the optimal value simultaneously. Secondly， three neural networks(NN)which are model NN， critic NN， action NN were introduced to carry out the HDP algorithm. Model NN was used to approximate system model， critic NN to approximate value function， action NN to approximate control policy. Lastly， the validity of HDP algorithm was illustrated by one simulation example.

Key words: approximate dynamic programming, heuristic dynamic programming, value function, neural networks, optimal control

中图分类号:

TP273.1

王涛，罗艳红. 带饱和执行器的非线性离散时滞系统的最优控制[J]. 东北大学学报:自然科学版, 2014, 35(4): 461-464.

WANG Tao， LUO Yanhong. Optimal Control for Nonlinear DiscreteTime Time Delay Systems with Saturating Actuators[J]. Journal of Northeastern University Natural Science, 2014, 35(4): 461-464.

参考文献

[1] Lewis F L，Syrmos V L.Optiaml control［M］.2nd ed.Hoboken:Wiley，1995.
[2] Murray J J，Cox C J，Lendaris G G，et al.Adaptive dynamic programming［J］.IEEE Transactions on Systems，Man and Cybernetics:C，2002，32(1):140/153.
[3] Tamimi A，Lewis F L，AbuKhalaf M.Discretetime nonlinear HJB solution using approximate dynamic programming:convergence proof［J］.IEEE Transactions on Systems，Man and Cybernetics:B，2008，38(4):943/949.
[4] Werbos P J.Approximate dynamic programming for realtime control and neural modeling［M］.New York:Van Nostrand Reinhold，1992.
[5] Sussmann H J，Sontag E D，Yang Y D.A general result on the stabilization of linear systems using bounded controls ［J］.IEEE Transactions on Automatic Control，1994，39(12):2411/2425.
[6] Saberi A，Lin Z L，Teel A R.Control of linear systems with saturating actuators［J］.IEEE Transaction on Automatic Control，1996，41(3):368/378.
[7] Luo Y H，Zhang H G.Approximate optimal control for a class of nonlinear discretetime systems with saturating actuators［J］.Progress in Natural Science，2008，18:1023/1029.
[8] Wei Q L，Zhang H G，Liu D R，et al.An optimal control scheme for a class of discretetime nonlinear systems with time delays using adaptive dynamic programming［J］.Acta Automatica Sinica，2010，36(1):121/129.
[9] Song R Z，Zhang H G，Luo Y H，et al.Optimal control laws for timedelays systems with saturating actuators based on heuristic dynamic programming［J］.Neurocomputing，2010，73:3020/3027.
[10] Wang F Y，Jing N，Liu D R，et al.Adaptive dynamic programming for finitehorizon optimal control of discretetime nonlinear systems with εerror bound［J］.IEEE Transaction on Neural Networks，2011，22(1):24/36.

[1]	刘洋，闫冬梅，孟范伟. 基于Transformer改进的两分支行人重识别算法[J]. 东北大学学报（自然科学版）, 2023, 44(1): 26-32.
[2]	张春雷，李鹤，董茂林，张圣杰. 燃料电池空气供应系统自适应神经网络滑模控制[J]. 东北大学学报（自然科学版）, 2022, 43(9): 1270-1276.
[3]	马源源，刘晏泽，刘呈隆，张甜洁. 中国投资者多角度舆情分析及其在股市预测中的作用[J]. 东北大学学报（自然科学版）, 2022, 43(8): 1201-1209.
[4]	张禹，何楷文，李清书，巩亚东. 面向STEP-NC自由曲面特征的加工操作方法智能决策[J]. 东北大学学报（自然科学版）, 2022, 43(7): 981-987.
[5]	季策，张晓. 基于GSA-BP神经网络的OFDM系统信道估计算法[J]. 东北大学学报（自然科学版）, 2022, 43(6): 769-775.
[6]	杨博文，霍军周，张伟，张占葛. 服役结构超前载荷实时预测方法的研究[J]. 东北大学学报（自然科学版）, 2022, 43(4): 541-550.
[7]	范纯龙，李彦达，夏秀峰，乔建忠. 基于随机梯度上升和球面投影的通用对抗攻击方法[J]. 东北大学学报（自然科学版）, 2022, 43(2): 168-175.
[8]	陈兵，韩烬阳，唐晓垒，夏搏然. 基于机器学习的拉矫延伸率预测模型及数值分析[J]. 东北大学学报（自然科学版）, 2022, 43(2): 236-242.
[9]	井元伟，谢海修，白云. TCP/AWM网络系统的自适应有限时间漏斗拥塞控制[J]. 东北大学学报（自然科学版）, 2022, 43(10): 1369-1375.
[10]	王璐，王帅，张国峰，徐礼胜. 基于语义分割注意力与可见区域预测的行人检测方法[J]. 东北大学学报（自然科学版）, 2021, 42(9): 1261-1267.
[11]	郑艳，姜源祥. 基于特征融合的说话人聚类算法[J]. 东北大学学报（自然科学版）, 2021, 42(7): 952-959.
[12]	于洪亮，王旭，杨丹，李维军. 基于电流观测器的链式STATCOM反步控制方法[J]. 东北大学学报（自然科学版）, 2021, 42(6): 761-767.
[13]	张涛，刘天威，杜文丽. 一种基于卷积神经网络的区域调光技术[J]. 东北大学学报（自然科学版）, 2021, 42(5): 624-632.
[14]	廖志伟，陈琳韬，黄杰栋，庄竞. 基于特征空间变换与LSTM的中短期电煤价格预测[J]. 东北大学学报（自然科学版）, 2021, 42(4): 483-493.
[15]	张永超，李琦，任朝晖，周世华. 基于域适应与分类器差异的滚动轴承跨域故障诊断[J]. 东北大学学报（自然科学版）, 2021, 42(3): 367-372.

带饱和执行器的非线性离散时滞系统的最优控制

Optimal Control for Nonlinear DiscreteTime Time Delay Systems with Saturating Actuators

RichHTML

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价