咨询与建议

限定检索结果

文献类型

  • 752 篇 期刊文献
  • 209 篇 会议
  • 21 篇 学位论文
  • 1 册 图书

馆藏范围

  • 983 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 750 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 251 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 41 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 电子科学与技术(可...
  • 360 篇 管理学
    • 343 篇 管理科学与工程(可...
    • 53 篇 工商管理
    • 6 篇 公共管理
  • 233 篇 理学
    • 198 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 80 篇 经济学
    • 56 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 983 篇 approximate dyna...
  • 142 篇 optimal control
  • 141 篇 reinforcement le...
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 61 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 28 篇 adaptive dynamic...
  • 22 篇 neural network
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 18 篇 dynamic pricing
  • 17 篇 value iteration

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 cornell univ sch...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 927 篇 英文
  • 50 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate dynamic Programming"
983 条 记 录,以下是941-950 订阅
排序:
Natural actor-critic algorithms
收藏 引用
AUTOMATICA 2009年 第11期45卷 2471-2482页
作者: Bhatnagar, Shalabh Sutton, Richard S. Ghavamzadeh, Mohammad Lee, Mark Indian Inst Sci Dept Comp Sci & Automat Bangalore 560012 Karnataka India Univ Alberta Dept Comp Sci RLAI Lab Edmonton AB T6G 2E8 Canada INRIA Lille Nord Europe Team SequeL Lille France
We present four new reinforcement learning algorithms based on actor-critic, natural-gradient and function-approximation ideas, and we provide their convergence proofs. Actor-critic reinforcement learning methods are ... 详细信息
来源: 评论
Swarm-based approximate dynamic optimization process for discrete particle swarm optimization system
收藏 引用
INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION 2009年 第1-2期1卷 61-70页
作者: Kang, Qi Wang, Lei Wu, Qidi Tongji Univ Dept Control Sci & Engn Shanghai 201804 Peoples R China
This paper presents a convergence analysis of particle swarm optimization system by treating it as a discrete-time linear time-variant system firstly. And then, based on the results of system convergence conditions, d... 详细信息
来源: 评论
Coding and control for communication networks
收藏 引用
QUEUEING SYSTEMS 2009年 第1-4期63卷 195-216页
作者: Chen, Wei Traskov, Danail Heindlmaier, Michael Medard, Muriel Meyn, Sean Ozdaglar, Asuman Univ Illinois Dept Elect & Comp Engn Urbana IL 61801 USA Univ Illinois Coordinated Sci Lab Urbana IL 61801 USA Tech Univ Munich Inst Comm Engn Munich Germany MIT Dept Elect Engn & Comp Sci Cambridge MA 02139 USA
The purpose of this paper is to survey techniques for constructing effective policies for controlling complex networks, and to extend these techniques to capture special features of wireless communication networks und... 详细信息
来源: 评论
Controlled exploration of state space in off-line ADP and its application to stochastic shortest path problems
收藏 引用
COMPUTERS & CHEMICAL ENGINEERING 2009年 第12期33卷 2111-2122页
作者: Pratikakis, Nikolaos E. Realff, Matthew J. Lee, Jay H. Georgia Inst Technol Sch Chem & Biomol Engn Atlanta GA 30332 USA
This paper addresses the problem of finding a control policy that drives a generic discrete event stochastic system from an initial state to a set of goal states with a specified probability. The control policy is ite... 详细信息
来源: 评论
Neural-Network-Based Near-Optimal Control for a Class of Discrete-Time Affine Nonlinear Systems With Control Constraints
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS 2009年 第9期20卷 1490-1503页
作者: Zhang, Huaguang Luo, Yanhong Liu, Derong Northeastern Univ Sch Informat Sci & Engn Shenyang 110004 Liaoning Peoples R China Chinese Acad Sci Key Lab Complex Syst & Intelligence Sci Inst Automat Beijing 100190 Peoples R China
In this paper, the near-optimal control problem for a class of nonlinear discrete-time systems with control constraints is solved by iterative adaptive dynamic programming algorithm. First, a novel nonquadratic perfor... 详细信息
来源: 评论
Intelligence in the brain: A theory of how it works and how to build it
收藏 引用
NEURAL NETWORKS 2009年 第3期22卷 200-212页
作者: Werbos, Paul J. Natl Sci Fdn ECCS Div Arlington VA 22230 USA
This paper presents a theory of how general-purpose learning-based intelligence is achieved in the mammal brain, and how we can replicate it. It reviews four generations of ever more powerful general-purpose learning ... 详细信息
来源: 评论
approximate dynamic programming STRATEGY FOR DUAL ADAPTIVE CONTROL
收藏 引用
IFAC Proceedings Volumes 2005年 第1期38卷 459-464页
作者: Jong Min Lee Jay H. Lee School of Chemical and Biomolecular Engineering Georgia Institute of Technology Atlanta GA 30332 USA
An approximate dynamic programming (ADP) strategy for a dual adaptive control problem is presented. An optimal control policy of a dual adaptive control problem can be derived by solving a stochastic dynamic programmi... 详细信息
来源: 评论
Separable approximations for joint capacity control and overbooking decisions in network revenue management
收藏 引用
JOURNAL OF REVENUE AND PRICING MANAGEMENT 2009年 第1期8卷 3-20页
作者: Erdelyi, Alexander Topaloglu, Huseyin Cornell Univ Sch Operat Res & Informat Engn Ithaca NY 14853 USA
We develop a network revenue management model to jointly make capacity control and overbooking decisions. Our approach is based on the observation that if the penalty cost of denying boarding to the reservations at th... 详细信息
来源: 评论
Reinforcement Learning Control of a Real Mobile Robot Using approximate Policy Iteration
收藏 引用
6th International Symposium on Neural Networks
作者: Zhang, Pengchen Xu, Xin Liu, Chunming Yuan, Qiping Natl Univ Def Technol Inst Automat Changsha 410073 Hunan Peoples R China
Machine learning for mobile robots has attracted lots of research interests in recent years. However, there are still many challenges to apply learning techniques in real mobile robots, e.g., generalization ill Contin... 详细信息
来源: 评论
dynamic Portfolio Optimization for Utility-Based Models
Dynamic Portfolio Optimization for Utility-Based Models
收藏 引用
International Conference on Information and Financial Engineering
作者: Fulga, Cristinca INCREST Bucharest Dept Math R-79622 Bucharest Romania
Portfolio management deals with the allocation of wealth among different investment opportunities, considering investor's preferences on risk. In this paper we consider a multiperiod model where the investor rebal... 详细信息
来源: 评论