咨询与建议

限定检索结果

文献类型

  • 747 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 979 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 746 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 电子科学与技术(可...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 979 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 neural network
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 923 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate Dynamic Programming"
979 条 记 录,以下是801-810 订阅
排序:
approximate Linear programming for Average Cost MDPs
收藏 引用
MATHEMATICS OF OPERATIONS RESEARCH 2013年 第3期38卷 535-544页
作者: Veatch, Michael H. Gordon Coll Dept Math Wenham MA 01984 USA
We consider the linear programming approach to approximate dynamic programming with an average cost objective and a finite state space. Using a Lagrangian form of the linear program (LP), the average cost error is sho... 详细信息
来源: 评论
Adaptive Traffic Signal Control for Multi-intersection Based on Microscopic Model
Adaptive Traffic Signal Control for Multi-intersection Based...
收藏 引用
International Conference on Tools with Artificial Intelligence
作者: Biao Yin Mahjoub Dridi Abdellah El Moudni Laboratoire IRTES-SeT Université de Technologie de Belfort-Montbéliard (UTBM) Belfort France
In this paper, we mainly propose an online learning method for adaptive traffic signal control in a multi-intersection system. The method uses approximate dynamic programming (ADP) to achieve a near-optimal solution o... 详细信息
来源: 评论
approximate dynamic programming for link scheduling in wireless mesh networks
收藏 引用
COMPUTERS & OPERATIONS RESEARCH 2008年 第12期35卷 3848-3859页
作者: Papadaki, Katerina Friderikos, Vasilis London Sch Econ Dept Operat Res London WC2A 2AE England Kings Coll London Ctr Telecommun Res London WC2R 2LS England
In this paper a novel interference-based formulation and solution methodology for the problem of link scheduling in wireless mesh networks is proposed. Traditionally, this problem has been formulated as a deterministi... 详细信息
来源: 评论
approximate modified policy iteration and its application to the game of Tetris
The Journal of Machine Learning Research
收藏 引用
The Journal of Machine Learning Research 2015年 第1期16卷
作者: Bruno Scherrer Mohammad Ghavamzadeh Victor Gabillon Boris Lesner Matthieu Geist INRIA Nancy-Grand Est Team Maia Vandœuvre-ls-Nancy France Adobe Research & INRIA Lille San Jose CA INRIA Lille-Nord Europe Team SequeL Villeneuve d'Ascq France CentraleSupélec IMS-MaLIS Research Group & UMI (GeorgiaTech-CNRS) Metz France
Modified policy iteration (MPI) is a dynamic programming (DP) algorithm that contains the two celebrated policy and value iteration methods. Despite its generality, MPI has not been thoroughly studied, especially its ... 详细信息
来源: 评论
Finite-Approximation-Error-Based Discrete-Time Iterative Adaptive dynamic programming
收藏 引用
IEEE TRANSACTIONS ON CYBERNETICS 2014年 第12期44卷 2820-2833页
作者: Wei, Qinglai Wang, Fei-Yue Liu, Derong Yang, Xiong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, a new iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal control problems for infinite horizon discrete-time nonlinear systems with finite approximation errors. First, ... 详细信息
来源: 评论
Reinforcement Learning Output Feedback NN Control Using Deterministic Learning Technique
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2014年 第3期25卷 635-641页
作者: Xu, Bin Yang, Chenguang Shi, Zhongke Northwestern Polytech Univ Sch Automat Xian 710072 Peoples R China Univ Plymouth Sch Comp & Math Plymouth PL4 8AA Devon England Beijing Inst Technol Sch Automat Beijing 100086 Peoples R China
In this brief, a novel adaptive-critic-based neural network (NN) controller is investigated for nonlinear pure-feedback systems. The controller design is based on the transformed predictor form, and the actor-critic N... 详细信息
来源: 评论
A particle-based policy for the optimal control of Markov decision processes
收藏 引用
IFAC Proceedings Volumes 2014年 第3期47卷 10518-10523页
作者: M. Pirotta G. Manganini L. Piroddi M. Prandini M. Restelli Dipartimento di Elettronica Informazione e Bioingegneria Politecnico di Milano Piazza Leonardo da Vinci 32 20133 Milano Italy
When the state dimension is large, classical approximate dynamic programming techniques may become computationally unfeasible, since the complexity of the algorithm grows exponentially with the state space size (curse... 详细信息
来源: 评论
Nearly Optimal Control Scheme for Discrete-Time Nonlinear Systems With Finite Approximation Errors Using Generalized Value Iteration Algorithm
收藏 引用
IFAC Proceedings Volumes 2014年 第3期47卷 4134-4139页
作者: Qinglai Wei Derong Liu The State Key Laboratory of Management and Control for Complex Systems Institute of Automation Chinese Academy of Sciences Beijing 100190 China (Tel: +86-10-82544761 Fax: +86-10-82544799
In this paper, a new generalized value iteration algorithm is developed to solve infinite horizon optimal control problems for discrete-time nonlinear systems. The idea is to use iterative adaptive dynamic programming... 详细信息
来源: 评论
dynamic Planning of System of Systems Architecture Evolution
收藏 引用
Procedia Computer Science 2014年 28卷 449-456页
作者: Zhemei Fang Daniel DeLaurentis Purdue Unicersity 701W. Stadium Avenue.West Lafayette IN 47907 USA
The dynamic planning and development of a large collection of systems or a ‘System of Systems’ (SoS) pose significant programmatic challenges due to the complex interactions that exist between its constituent system... 详细信息
来源: 评论
Online Optimal Switching of Single Phase DC/AC Inverters using Partial Information
Online Optimal Switching of Single Phase DC/AC Inverters usi...
收藏 引用
American Control Conference
作者: Kyriakos G. Vamvoudakis Joao P. Hespanha Center for Control Dynamical-systems and Computation (CCDC) University of California Santa Barbara CA 93106-9560 USA
This paper proposes an online optimal tracking algorithm to provide the desired voltage magnitude and frequency at the load. This eventually will work as a DC/AC inverter that with appropriate switching of semiconduct... 详细信息
来源: 评论