咨询与建议

限定检索结果

文献类型

  • 747 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 979 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 746 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 电子科学与技术(可...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 979 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 neural network
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 923 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate Dynamic Programming"
979 条 记 录,以下是761-770 订阅
排序:
A price-directed heuristic for the economic lot scheduling problem
收藏 引用
IIE TRANSACTIONS 2014年 第12期46卷 1343-1356页
作者: Adelman, Daniel Barz, Christiane Univ Chicago Booth Sch Business Chicago IL 60637 USA Univ Calif Los Angeles Anderson Sch Management Los Angeles CA 90095 USA
The article formulates the well-known economic lot scheduling problem (ELSP) with sequence-dependent setup times and costs as a semi-Markov decision process. Using an affine approximation of the bias function, a semi-... 详细信息
来源: 评论
A Novel Iterative θ-Adaptive dynamic programming for Discrete-Time Nonlinear Systems
收藏 引用
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2014年 第4期11卷 1176-1190页
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
This paper is concerned with a new iterative theta-adaptive dynamic programming (ADP) technique to solve optimal control problems of infinite horizon discrete-time nonlinear systems. The idea is to use an iterative AD... 详细信息
来源: 评论
Effective Load Carrying Capability Evaluation for High Penetration Renewable Energy Integration
Effective Load Carrying Capability Evaluation for High Penet...
收藏 引用
IEEE Power and Energy Society General Meeting
作者: Zhi Chen Lei Wu Department of Electrical Engineering Arkansas Tech University Department of Electrical and Computer Engineering Clarkson University
This paper proposes an approximate dynamic programming (ADP) based approach to evaluate the effective load carrying capability (ELCC) of high penetration renewable resources by solving the long-term security-constrain... 详细信息
来源: 评论
Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming
收藏 引用
INFORMATION SCIENCES 2014年 282卷 167-179页
作者: Wang, Ding Liu, Derong Li, Hongliang Ma, Hongwen Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, the neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming approach is investigated. First, the robust controller of the original ... 详细信息
来源: 评论
The dynamic fleet management problem with uncertain demand and customer chosen service level
收藏 引用
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS 2014年 148卷 110-121页
作者: Shi, Ning Song, Haiqing Powell, Warren B. Sun Yat Sen Univ Sch Business Guangzhou 510275 Guangdong Peoples R China Sun Yat Sen Univ Lingnan Coll Guangzhou 510275 Guangdong Peoples R China Princeton Univ Dept Operat Res & Financial Engn Princeton NJ 08544 USA
In this paper, we study a dynamic fleet management problem with uncertain demands and customer chosen service levels. We first show that the problem can be transformed into a dynamic network with partially dependent r... 详细信息
来源: 评论
Optimal Patrol to Uncover Threats in Time When Detection is Imperfect
收藏 引用
NAVAL RESEARCH LOGISTICS 2014年 第8期61卷 557-576页
作者: Lin, Kyle Y. Atkinson, Michael P. Glazebrook, Kevin D. Naval Postgrad Sch Dept Operat Res Monterey CA 93943 USA Univ Lancaster Dept Management Sci Sch Management Lancaster LA1 4YX England
Consider a patrol problem, where a patroller traverses a graph through edges to detect potential attacks at nodes. An attack takes a random amount of time to complete. The patroller takes one time unit to move to and ... 详细信息
来源: 评论
Policy oscillation is overshooting
收藏 引用
NEURAL NETWORKS 2014年 第0期52卷 43-61页
作者: Wagner, Paul Aalto Univ Dept Informat & Comp Sci FI-00076 Aalto Finland
A majority of approximate dynamic programming approaches to the reinforcement learning problem can be categorized into greedy value function methods and value-based policy gradient methods. The former approach, althou... 详细信息
来源: 评论
Reinforcement learning algorithms with function approximation: Recent advances and applications
收藏 引用
INFORMATION SCIENCES 2014年 261卷 1-31页
作者: Xu, Xin Zuo, Lei Huang, Zhenhua Natl Univ Def Technol Coll Mechatron & Automat Changsha 410073 Hunan Peoples R China
In recent years, the research on reinforcement learning (RL) has focused on function approximation in learning prediction and control of Markov decision processes (MDPs). The usage of function approximation techniques... 详细信息
来源: 评论
Rollout Event-Triggered Control: Beyond Periodic Control Performance
收藏 引用
IEEE TRANSACTIONS ON AUTOMATIC CONTROL 2014年 第12期59卷 3296-3311页
作者: Antunes, D. Heemels, W. P. M. H. Eindhoven Univ Technol Dept Mech Engn Control Syst Technol Grp NL-5600 MB Eindhoven Netherlands
Cyber-Physical Systems (CPSs) resulting from the interconnection of computational, communication, and control (cyber) devices with physical processes are wide spreading in our society. In several CPS applications it i... 详细信息
来源: 评论
Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown dynamics
收藏 引用
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2014年 第3期11卷 706-714页
作者: Li, Hongliang Liu, Derong Wang, Ding Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, we develop an integral reinforcement learning algorithm based on policy iteration to learn online the Nash equilibrium solution for a two-player zero-sum differential game with completely unknown linear... 详细信息
来源: 评论