咨询与建议

限定检索结果

文献类型

  • 752 篇 期刊文献
  • 209 篇 会议
  • 21 篇 学位论文
  • 1 册 图书

馆藏范围

  • 983 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 750 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 251 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 41 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 电子科学与技术(可...
  • 360 篇 管理学
    • 343 篇 管理科学与工程(可...
    • 53 篇 工商管理
    • 6 篇 公共管理
  • 233 篇 理学
    • 198 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 80 篇 经济学
    • 56 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 983 篇 approximate dyna...
  • 142 篇 optimal control
  • 141 篇 reinforcement le...
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 61 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 28 篇 adaptive dynamic...
  • 22 篇 neural network
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 18 篇 dynamic pricing
  • 17 篇 value iteration

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 cornell univ sch...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 927 篇 英文
  • 50 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate dynamic Programming"
983 条 记 录,以下是851-860 订阅
排序:
A New Self-Learning Optimal Control Scheme for Discrete-Time Nonlinear Systems Using Policy Iterative Adaptive dynamic programming
收藏 引用
IFAC Proceedings Volumes 2013年 第20期46卷 580-585页
作者: Qinglai Wei Derong Liu The State Key Laboratory of Management and Control for Complex Systems Institute of Automation Chinese Academy of Sciences Beijing 100190 China
In this paper, a new self-learning method using policy iterative adaptive dynamic programming (ADP) is developed to obtain the optimal control scheme of discrete-time nonlinear systems. The iterative ADP algorithm per... 详细信息
来源: 评论
Finite-Horizon Control-Constrained Nonlinear Optimal Control Using Single Network Adaptive Critics
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2013年 第1期24卷 145-157页
作者: Heydari, Ali Balakrishnan, Sivasubramanya N. Missouri Univ Sci & Technol Dept Mech & Aerosp Engn Rolla MO 65401 USA
To synthesize fixed-final-time control-constrained optimal controllers for discrete-time nonlinear control-affine systems, a single neural network (NN)-based controller called the Finite-horizon Single Network Adaptiv... 详细信息
来源: 评论
On the Convergence of Simulation-based Iterative Methods for Solving Singular Linear Systems
收藏 引用
Stochastic Systems 2013年 第1期3卷 1-321页
作者: Mengdi Wang Dimitri P. Bertsekas
We consider the simulation-based solution of linear systems of equations, Ax = b , of various types frequently arising in large-scale applications, where A is singular. We show that the convergence properties of ... 详细信息
来源: 评论
Finite-Approximation-Error-Based Optimal Control Approach for Discrete-Time Nonlinear Systems
收藏 引用
IEEE TRANSACTIONS ON CYBERNETICS 2013年 第2期43卷 779-789页
作者: Liu, Derong Wei, Qinglai Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, a new iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal control problems for infinite-horizon discrete-time nonlinear systems with finite approximation errors. The ide... 详细信息
来源: 评论
Performance Guarantee of a Sub-Optimal Policy for a Robotic Surveillance Application *
收藏 引用
IFAC Proceedings Volumes 2013年 第30期46卷 283-290页
作者: Myoungkuk Park Krishnamoorthy Kalyanam Swaroop Darbha P.P. Khargonekar P.R. Chandler M. Pachter Department of Mechanical Engineering Texas A&M University College Station TX 77843 USA Infoscitex Corporation Dayton OH 45431 USA Department of Electrical Engineering University of Florida Gainesville FL 32525 Autonomous Control Branch Air Force Research Laboratory Wright-Patterson A.F.B. OH 45433 Department of Electrical Engineering Air Force Institute of Technology Wright-Patterson A.F.B. OH 45433
This paper focuses on the development and analysis of sub-optimal decision algorithms for a collection of robots that assist a remotely located operator in perimeter surveillance. The operator is tasked with the class... 详细信息
来源: 评论
Online Partially Model-Free Solution of Two-Player Zero Sum Differential Games
收藏 引用
IFAC Proceedings Volumes 2013年 第32期46卷 696-701页
作者: P Praveen Shubhendu Bhasin Department of Electrical Engineering Indian Institute of Technology Delhi India
An online adaptive dynamic programming based iterative algorithm is proposed for a two-player zero sum linear differential game problem arising in the control of process systems affected by disturbances. The objective... 详细信息
来源: 评论
A Data-driven Model for Large Wildfire Behaviour Prediction in Europe
收藏 引用
Procedia Computer Science 2013年 18卷 1861-1870页
作者: Dario Rodriguez-Aseretto Daniele de Rigo Margherita Di Leo Ana Cortés Jesús San-Miguel-Ayanz European Commission Joint Research Centre Institute for Environment and Sustainability Via E. Fermi 2749 I-21027 Ispra (VA) Italy Politecnico di Milano Dipartimento di Elettronica e Informazione Via Ponzio 34/5 I-20133 Milano Italy Universitat Autonoma de Barcelona Computer Architecture and Operating Systems Campus Bellaterra Cerdanyola 08193 Spain
The European Forest Fire Information System (EFFIS) has been established by the Joint Research Centre (JRC) and the Directorate General for Environment (DG ENV) of the European Commission (EC) in close collaboration w... 详细信息
来源: 评论
A UNIFIED FRAMEWORK FOR LINEAR FUNCTION APPROXIMATION OF VALUE FUNCTIONS IN STOCHASTIC CONTROL
A UNIFIED FRAMEWORK FOR LINEAR FUNCTION APPROXIMATION OF VAL...
收藏 引用
European Signal Processing Conference
作者: Matilde Sanchez-Fernandez Sergio Valcarcel Santiago Zazoy Universidad Carlos III de Madrid Signal Theory & Communictions Dept. Universidad Politecnica de Madrid Signals Systems & Radiocommunications Dept. Av. Complutense Universidad Politecnica de Madrid Signals Systems & Radiocommunications Dept. Av. Complutense
This paper contributes with a unified formulation that merges previous analysis on the prediction of the performance (value function) of certain sequence of actions (policy) when an agent operates a Markov decision pr... 详细信息
来源: 评论
On Integral Value Iteration for Continuous-Time Linear Systems
On Integral Value Iteration for Continuous-Time Linear Syste...
收藏 引用
American Control Conference
作者: Jae Young Lee Jin Bae Park Yoon Ho Choi Department of Electrical and Electronic Engineering Yonsei University Shinchon-Dong Seodaemum-Gu Seoul 120-749 Korea Department of Electronic Engineering Kyonggi University Suwon Kyonggi-Do 443-760 Korea
This paper investigates the properties of integral value iteration (I-VI) which is one of the reinforcement learning (RL) technique for solving online the continuous-time (CT) optimal control problems without using th... 详细信息
来源: 评论
Lagrangian relaxation and constraint generation for allocation and advanced scheduling
收藏 引用
COMPUTERS & OPERATIONS RESEARCH 2012年 第10期39卷 2323-2336页
作者: Gocgun, Yasin Ghate, Archis Univ Washington Seattle WA 98195 USA Univ British Columbia Sauder Sch Business Vancouver BC V5Z 1M9 Canada
Diverse applications in manufacturing, logistics, health care, telecommunications, and computing require that renewable resources be dynamically scheduled to handle distinct classes of job service requests arriving ra... 详细信息
来源: 评论