您的位置: 专家智库 > >

国家自然科学基金(61374105)

作品数:7 被引量:34H指数:3
发文基金:国家自然科学基金北京市自然科学基金中国博士后科学基金更多>>
相关领域:理学自动化与计算机技术电气工程更多>>

文献类型

  • 7篇中文期刊文章

领域

  • 4篇理学
  • 2篇自动化与计算...
  • 1篇电气工程

主题

  • 3篇CHAOTI...
  • 2篇最优跟踪控制
  • 2篇混沌系统
  • 2篇跟踪控制
  • 2篇OPTIMA...
  • 2篇REINFO...
  • 2篇CONTIN...
  • 1篇迭代
  • 1篇迭代控制
  • 1篇迭代算法
  • 1篇动态规划
  • 1篇动态规划方法
  • 1篇一致最终有界
  • 1篇有界
  • 1篇离散混沌系统
  • 1篇混沌
  • 1篇NEURAL...
  • 1篇PDP
  • 1篇RESIDE...
  • 1篇SOLAR_...

传媒

  • 4篇Chines...
  • 3篇IEEE/C...

年份

  • 1篇2018
  • 3篇2017
  • 2篇2015
  • 1篇2014
7 条 记 录,以下是 1-7
排序方式:
Chaotic system optimal tracking using data-based synchronous method with unknown dynamics and disturbances
2017年
We develop an optimal tracking control method for chaotic system with unknown dynamics and disturbances. The method allows the optimal cost function and the corresponding tracking control to update synchronously. According to the tracking error and the reference dynamics, the augmented system is constructed. Then the optimal tracking control problem is defined. The policy iteration(PI) is introduced to solve the min-max optimization problem. The off-policy adaptive dynamic programming(ADP) algorithm is then proposed to find the solution of the tracking Hamilton–Jacobi–Isaacs(HJI) equation online only using measured data and without any knowledge about the system dynamics. Critic neural network(CNN), action neural network(ANN), and disturbance neural network(DNN) are used to approximate the cost function, control, and disturbance. The weights of these networks compose the augmented weight matrix, and the uniformly ultimately bounded(UUB) of which is proven. The convergence of the tracking error system is also proven. Two examples are given to show the effectiveness of the proposed synchronous solution method for the chaotic system tracking problem.
宋睿卓魏庆来
关键词:ZERO-SUM
PDP: Parallel Dynamic Programming被引量:14
2017年
Deep reinforcement learning is a focus research area in artificial intelligence. The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods. The principle of adaptive dynamic programming(ADP)is first presented instead of direct dynamic programming(DP),and the inherent relationship between ADP and deep reinforcement learning is developed. Next, analytics intelligence, as the necessary requirement, for the real reinforcement learning, is discussed. Finally, the principle of the parallel dynamic programming, which integrates dynamic programming and analytics intelligence, is presented as the future computational intelligence.
Fei-Yue WangJie ZhangQinglai WeiXinhu ZhengLi Li
Off-policy integral reinforcement learning optimal tracking control for continuous-time chaotic systems
2015年
This paper estimates an off-policy integral reinforcement learning(IRL) algorithm to obtain the optimal tracking control of unknown chaotic systems. Off-policy IRL can learn the solution of the HJB equation from the system data generated by an arbitrary control. Moreover, off-policy IRL can be regarded as a direct learning method, which avoids the identification of system dynamics. In this paper, the performance index function is first given based on the system tracking error and control error. For solving the Hamilton–Jacobi–Bellman(HJB) equation, an off-policy IRL algorithm is proposed.It is proven that the iterative control makes the tracking error system asymptotically stable, and the iterative performance index function is convergent. Simulation study demonstrates the effectiveness of the developed tracking control method.
魏庆来宋睿卓孙秋野肖文栋
关键词:最优跟踪控制HJB方程迭代控制
Policy iteration optimal tracking control for chaotic systems by using an adaptive dynamic programming approach被引量:1
2015年
A policy iteration algorithm of adaptive dynamic programming(ADP) is developed to solve the optimal tracking control for a class of discrete-time chaotic systems. By system transformations, the optimal tracking problem is transformed into an optimal regulation one. The policy iteration algorithm for discrete-time chaotic systems is first described. Then,the convergence and admissibility properties of the developed policy iteration algorithm are presented, which show that the transformed chaotic system can be stabilized under an arbitrary iterative control law and the iterative performance index function simultaneously converges to the optimum. By implementing the policy iteration algorithm via neural networks,the developed optimal tracking control scheme for chaotic systems is verified by a simulation.
魏庆来刘德荣徐延才
关键词:离散混沌系统最优跟踪控制动态规划方法策略迭代迭代算法
A new approach of optimal control for a class of continuous-time chaotic systems by an online ADP algorithm
2014年
We develop an online adaptive dynamic programming(ADP) based optimal control scheme for continuous-time chaotic systems. The idea is to use the ADP algorithm to obtain the optimal control input that makes the performance index function reach an optimum. The expression of the performance index function for the chaotic system is first presented.The online ADP algorithm is presented to achieve optimal control. In the ADP structure, neural networks are used to construct a critic network and an action network, which can obtain an approximate performance index function and the control input, respectively. It is proven that the critic parameter error dynamics and the closed-loop chaotic systems are uniformly ultimately bounded exponentially. Our simulation results illustrate the performance of the established optimal control method.
宋睿卓肖文栋魏庆来
关键词:混沌系统一致最终有界
Optimal Constrained Self-learning Battery Sequential Management in Microgrid Via Adaptive Dynamic Programming被引量:13
2017年
This paper concerns a novel optimal self-learning battery sequential control scheme for smart home energy systems.The main idea is to use the adaptive dynamic programming(ADP) technique to obtain the optimal battery sequential control iteratively. First, the battery energy management system model is established, where the power efficiency of the battery is considered. Next, considering the power constraints of the battery, a new non-quadratic form performance index function is established, which guarantees that the value of the iterative control law cannot exceed the maximum charging/discharging power of the battery to extend the service life of the battery.Then, the convergence properties of the iterative ADP algorithm are analyzed, which guarantees that the iterative value function and the iterative control law both reach the optimums. Finally,simulation and comparison results are given to illustrate the performance of the presented method.
Qinglai WeiDerong LiuYu LiuRuizhuo Song
Residential Energy Scheduling for Variable Weather Solar Energy Based on Adaptive Dynamic Programming被引量:13
2018年
The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable energy resources, are combined together as a nonlinear, time-varying, indefinite and complex system, which is difficult to manage or optimize. Many nations have already applied the residential real-time pricing to balance the burden on their grid. In order to enhance electricity efficiency of the residential micro grid, this paper presents an action dependent heuristic dynamic programming(ADHDP) method to solve the residential energy scheduling problem. The highlights of this paper are listed below. First,the weather-type classification is adopted to establish three types of programming models based on the features of the solar energy. In addition, the priorities of different energy resources are set to reduce the loss of electrical energy transmissions.Second, three ADHDP-based neural networks, which can update themselves during applications, are designed to manage the flows of electricity. Third, simulation results show that the proposed scheduling method has effectively reduced the total electricity cost and improved load balancing process. The comparison with the particle swarm optimization algorithm further proves that the present method has a promising effect on energy management to save cost.
Derong LiuYancai XuQinglai WeiXinliang Liu
共1页<1>
聚类工具0