咨询与建议

限定检索结果

文献类型

  • 747 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 979 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 746 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 电子科学与技术(可...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 979 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 neural network
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 923 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate Dynamic Programming"
979 条 记 录,以下是631-640 订阅
排序:
approximate dynamic programming with a fuzzy parameterization
收藏 引用
AUTOMATICA 2010年 第5期46卷 804-814页
作者: Busoniu, Lucian Ernst, Damien De Schutter, Bart Babuska, Robert Delft Univ Technol Delft Ctr Syst &Control NL-2628 CD Delft Netherlands Univ Liege Inst Montefiore FNRS B-4000 Liege Belgium
dynamic programming (DP) is a powerful paradigm for general, nonlinear optimal control. Computing exact DP solutions is in general only possible when the process states and the control actions take values in a small d... 详细信息
来源: 评论
Spacecraft Autonomy modeled via Markov Decision Process and Associative Rule-based Machine Learning  4
Spacecraft Autonomy modeled via Markov Decision Process and ...
收藏 引用
4th IEEE International Workshop on Metrology for AeroSpace
作者: D'Angelo, Gianni Tipaldi, Massimo Glielmo, Luigi Rampone, Salvatore Univ Sannio Dept Sci & Technol Benevento Italy OHB Italia SpA Via Gallarate 150 I-20151 Milan Italy Univ Sannio Dept Engn Benevento Italy
Spacecraft on-board autonomy is an important topic in currently developed and future space missions. In this study, we present a robust approach to the optimal policy of autonomous space systems modeled via Markov Dec... 详细信息
来源: 评论
Relations between Model Predictive Control and Reinforcement Learning
Relations between Model Predictive Control and Reinforcement...
收藏 引用
20th World Congress of the International-Federation-of-Automatic-Control (IFAC)
作者: Goerges, Daniel Univ Kaiserslautern Electromobil Erwin Schrodinger Str 12 D-67663 Kaiserslautern Germany
In this paper relations between model predictive control and reinforcement learning are studied for discrete-time linear time-invariant systems with state and input constraints and a quadratic value function. The prin... 详细信息
来源: 评论
Four Nonlinear Multi-input Multi-output ADHDP Constructions and Algorithms Based on Topology Principle  4
Four Nonlinear Multi-input Multi-output ADHDP Constructions ...
收藏 引用
4th International Conference on Systems and Informatics (ICSAI)
作者: Huang, Zhijian Zhang, Cheng Zheng, Huan Wang, Shengtang Liu, Yihua Zhang, Guichen Huang, Xing Shanghai Maritime Univ Lab Intelligent Control & Computat Shanghai Peoples R China
In this paper, Four action-dependent heuristic dynamic programming control methods are presented for nonlinear multi-input-multi-output system with different characters based on the topology principle. These four meth... 详细信息
来源: 评论
Active Fault Diagnosis for Jump Markov Nonlinear Systems
收藏 引用
IFAC-PapersOnLine 2017年 第1期50卷 7308-7313页
作者: Škach J. Punčochář I. Straka O. European Centre of Excellence - NTIS Faculty of Applied Sciences University of West Bohemia Pilsen 306 14 Czech Republic
In this paper, a problem of active fault diagnosis for jump Markov nonlinear systems with non-Gaussian noises is considered. The imperfect state information formulation is transformed using sufficient statistics to a ... 详细信息
来源: 评论
Stochastic Zero-Sum Nash Games for Uncertain Nonlinear Markovian Jump Systems  56
Stochastic Zero-Sum Nash Games for Uncertain Nonlinear Marko...
收藏 引用
56th Annual IEEE Conference on Decision and Control (CDC)
作者: Vamvoudakis, Kyriakos G. Safaei, Farshad R. Pour Virginia Tech Dept Aerosp & Ocean Engn Blacksburg VA 24061 USA Bosch Res & Technol Ctr North Amer RTC Palo Alto CA 94304 USA
In this paper, a novel adaptive learning technique is proposed to solve a stochastic zero-sum Nash game with partially unknown nonlinear systems for which the lengths of time intervals that the system spends in each m... 详细信息
来源: 评论
Discrete-time Optimal Zero-sum Games for Nonlinear Systems via Adaptive dynamic programming  6
Discrete-time Optimal Zero-sum Games for Nonlinear Systems v...
收藏 引用
2017 IEEE 6th Data Driven Control and Learning Systems Conference (DDCLS’17)
作者: Qinglai Wei Ruizhuo Song Yancai Xu Derong Liu Qiao Lin The State Key Laboratory of Management and Control for Complex Systems Institute of Automation Chinese Academy of Sciences School of Automation and Electrical Engineering University of Science and Technology Beijing
In this paper, a novel discrete-time iterative zero-sum adaptive dynamic programming(ADP) algorithm is developed for solving the optimal control problems of nonlinear systems. Two iteration processes, which are lower ... 详细信息
来源: 评论
Neural Network Adaptive Critic Control With Disturbance Rejection  29
Neural Network Adaptive Critic Control With Disturbance Reje...
收藏 引用
第29届中国控制与决策会议
作者: Ding Wang Chaoxu Mu Derong Liu The State Key Laboratory of Management and Control for Complex Systems Institute of Automation Chinese Academy of Sciences School of Computer and Control Engineering University of Chinese Academy of Sciences School of Electrical and Information Engineering Tianjin University School of Automation and Electrical Engineering University of Science and Technology Beijing
A neural-network-based adaptive critic control method is established for continuous-time input-affine uncertain nonlinear systems to achieve disturbance *** present problem can be formulated as a two-player zero-sum d... 详细信息
来源: 评论
Local Policy Iteration Adaptive dynamic programming for Discrete-Time Nonlinear Systems  14th
Local Policy Iteration Adaptive Dynamic Programming for Disc...
收藏 引用
14th International Symposium on Neural Networks (ISNN)
作者: Wei, Qinglai Xu, Yancai Lin, Qiao Liu, Derong Song, Ruizhuo Univ Chinese Acad Sci Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
Adaptive dynamic programming is a hot research topic nowadays. Therefore, the paper concerns a new local policy adaptive iterative dynamic programming (ADP) algorithm. Moreover, this algorithm is designed for the disc... 详细信息
来源: 评论
A Data-driven Online ADP of Exponential Convergence Based on k-nearest-neighbor Averager, Stable Term and Persistence Excitation  4
A Data-driven Online ADP of Exponential Convergence Based on...
收藏 引用
4th International Conference on Systems and Informatics (ICSAI)
作者: Huang, Zhijian Wang, Shengtang Zheng, Huan Zhang, Cheng Zhang, Guichen Wu, Qili Tan, Qinmin Yang, Zhiyuan Shanghai Maritime Univ Lab Intelligent Control & Computat Shanghai Peoples R China
With the development of marine science, aeronautics and astronautics, energy, chemical industry, biomedicine and management science, many complex systems face the problem of optimization and control. approximate dynam... 详细信息
来源: 评论