咨询与建议

限定检索结果

文献类型

  • 747 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 979 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 746 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 电子科学与技术(可...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 979 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 neural network
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 923 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate Dynamic Programming"
979 条 记 录,以下是561-570 订阅
排序:
A novel triggering condition of event-triggered control based on heuristic dynamic programming for discrete-time systems
收藏 引用
OPTIMAL CONTROL APPLICATIONS & METHODS 2018年 第4期39卷 1467-1478页
作者: Wang, Ziyang Wei, Qinglai Liu, Derong Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Guangdong Univ Technol Sch Automat Guangzhou Guangdong Peoples R China
In this paper, an event-triggered heuristic dynamic programming algorithm for discrete-time nonlinear systems with a novel triggering condition is studied. Different from traditional heuristic dynamic programming algo... 详细信息
来源: 评论
Spatial Resource Allocation for Emerging Epidemics: A Comparison of Greedy, Myopic, and dynamic Policies
收藏 引用
M&SOM-MANUFACTURING & SERVICE OPERATIONS MANAGEMENT 2018年 第2期20卷 181-198页
作者: Long, Elisa F. Nohdurft, Eike Spinler, Stefan Univ Calif Los Angeles Anderson Sch Management Los Angeles CA 90095 USA Kuhne Inst Logist Management WHU Otto Beisheim Sch Management D-56179 Vallendar Germany
Rapidly evolving infectious disease epidemics, such as the 2014 West African Ebola outbreak, pose significant health threats and present challenges to the global health community because of their heterogeneous geograp... 详细信息
来源: 评论
Boundary Control of 2-D Burgers' PDE: An Adaptive dynamic programming Approach
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018年 第8期29卷 3669-3681页
作者: Talaei, Behzad Jagannathan, Sarangapani Singler, John Missouri Univ Sci & Technol Dept Elect & Comp Engn Rolla MO 65401 USA Missouri Univ Sci & Technol Dept Math & Stat Rolla MO 65401 USA
In this paper, an adaptive dynamic programming-based near optimal boundary controller is developed for partial differential equations (PDEs) modeled by the uncertain Burgers' equation under Neumann boundary condit... 详细信息
来源: 评论
Robust Scheduling of EV Charging Load With Uncertain Wind Power Integration
收藏 引用
IEEE TRANSACTIONS ON SMART GRID 2018年 第2期9卷 1043-1054页
作者: Huang, Qilong Jia, Qing-Shan Guan, Xiaohong Tsinghua Univ Dept Automat Ctr Intelligent & Networked Syst Beijing 100084 Peoples R China Xi An Jiao Tong Univ MOE KLINNS Lab Xian 710049 Peoples R China
In some micro grids, the charging of electric vehicles (EVs) and the generation of wind power may partially cancel each other. This is an effective way to reduce the variation of the wind power to the state grid. Due ... 详细信息
来源: 评论
dynamic bus substitution strategy for bunching intervention
收藏 引用
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL 2018年 115卷 1-16页
作者: Petit, Antoine Ouyang, Yanfeng Lei, Chao Univ Illinois Dept Civil & Environm Engn Urbana IL 61801 USA
Bus headways are typically susceptible to external disturbances (e.g., due to traffic congestion, clustered passenger arrivals, and special passenger needs), which create gaps in the system that grow eventually into b... 详细信息
来源: 评论
Adaptive dynamic programming for Discrete-Time Zero-Sum Games
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018年 第4期29卷 957-969页
作者: Wei, Qinglai Liu, Derong Lin, Qiao Song, Ruizhuo Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
In this paper, a novel adaptive dynamic programming (ADP) algorithm, called "iterative zero-sum ADP algorithm," is developed to solve infinite-horizon discrete-time two-player zero-sum games of nonlinear sys... 详细信息
来源: 评论
Relationship between least squares Monte Carlo and approximate linear programming
收藏 引用
OPERATIONS RESEARCH LETTERS 2017年 第5期45卷 409-414页
作者: Nadarajah, Selvaprabu Secomandi, Nicola Univ Illinois Coll Business Adm 601 South Morgan St Chicago IL 60607 USA Carnegie Mellon Univ Tepper Sch Business 5000 Forbes Ave Pittsburgh PA 15213 USA
Least squares Monte Carlo (LSM) is commonly used to manage and value early or multiple exercise financial or real options. Recent research in this area has started applying approximate linear programming (ALP) and its... 详细信息
来源: 评论
Adaptive Virtual Resource Allocation in 5G Network Slicing Using Constrained Markov Decision Process
收藏 引用
IEEE ACCESS 2018年 6卷 61184-61195页
作者: Tang, Lun Tan, Qi Shi, Yingjie Wang, Chenmeng Chen, Qianbin Chongqing Univ Posts & Telecommun Key Lab Mobile Commun Sch Commun & Informat Engn Chongqing 400065 Peoples R China
Network virtualization technology is generally envisaged as a promising technology to consequently satisfy various types of service requirements. On the other hand, non-orthogonal multiple access (NOMA) technology has... 详细信息
来源: 评论
Iterative ADP learning algorithms for discrete-time multi-player games
收藏 引用
ARTIFICIAL INTELLIGENCE REVIEW 2018年 第1期50卷 75-91页
作者: Jiang, He Zhang, Huaguang Northeastern Univ Coll Informat Sci & Engn Shenyang 110819 Liaoning Peoples R China
Adaptive dynamic programming (ADP) is an important branch of reinforcement learning to solve various optimal control issues. Most practical nonlinear systems are controlled by more than one controller. Each controller... 详细信息
来源: 评论
Manifold Regularized Reinforcement Learning
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018年 第4期29卷 932-943页
作者: Li, Hongliang Liu, Derong Wang, Ding Tencent Inc AI Platform Dept Shenzhen 518057 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
This paper introduces a novel manifold regularized reinforcement learning scheme for continuous Markov decision processes. Smooth feature representations for value function approximation can be automatically learned u... 详细信息
来源: 评论