咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是841-850 订阅
排序:
Using reinforcement learning for City Site Selection in the Turn-Based Strategy Game Civilization IV
Using Reinforcement Learning for City Site Selection in the ...
收藏 引用
ieee symposium on Computational Intelligence and Games
作者: Wender, Stefan Watson, Ian Univ Auckland Dept Comp Sci Auckland 1 New Zealand
This paper describes the design and implementation of a reinforcement learner based on Q-learning. This adaptive agent is applied to the city placement selection task in the commercial computer game Civilization IV. T... 详细信息
来源: 评论
adaptive critic-based neurofuzzy controller for the steam generator water level
Adaptive critic-based neurofuzzy controller for the steam ge...
收藏 引用
15th International Workshop on Room-Temperature Semiconductor X- and Gamma-Ray Detectors/ 2006 ieee Nuclear Science symposium
作者: Fakhrazari, Amin Boroushaki, Mehrdad Sharif Univ Technol Dept Mech Engn Tehran Iran
In this paper, an adaptive critic-based neurofuzzy controller is presented for water level regulation of nuclear steam generators. The problem has been of great concern for many years as the steam generator is a highl... 详细信息
来源: 评论
dynamic Pricing by Multiagent reinforcement learning
Dynamic Pricing by Multiagent Reinforcement Learning
收藏 引用
International symposium on Electronic Commerce and Security
作者: Han, Wei Liu, Lingbo Zheng, Huaili Nanjing Univ Finance & Econ Informat Engn Coll Nanjing 210046 Peoples R China
dynamic pricing in electronic marketplaces is a basic problem in electronic commercial. In multiagent environments, the optimal pricing policy of agent depends on the pricing policies of other agents. This makes the l... 详细信息
来源: 评论
A Biologically-Inspired Computational Model for Transformation Invariant Target Recognition
A Biologically-Inspired Computational Model for Transformati...
收藏 引用
International Joint Conference on Neural Networks
作者: Iftekharuddin, Khan M. Li, Yaqin Univ Memphis Dept Elect & Comp Engn Intelligence Syst & Image Proc Lab Memphis TN 38152 USA
Transformation invariant image recognition has been an active research area due to its widespread applications in a variety of fields such as military operations, robotics' medical practices, geographic scene anal... 详细信息
来源: 评论
adaptive dynamic programming for multi-intersections traffic signal intelligent control
Adaptive dynamic programming for multi-intersections traffic...
收藏 引用
11th International ieee Conference on Intelligent Transportation Systems, ITSC 2008
作者: Li, Tao Zhao, Dongbin Yi, Jianqiang Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy of Sciences 95 Zhongguancun East Road Haidian District Beijing 100080 China University of Arizona United States
This paper aims at developing near optimal traffic signal control for multi-intersections in city. As a new optimization technique, adaptive dynamic programming (ADP) combines concepts of reinforcement learning and dy... 详细信息
来源: 评论
Foreword - ADP: The Key Direction for Future Research in Intelligent Control and Understanding Brain Intelligence
收藏 引用
ieee Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 2008年 第4期38卷 898-900页
作者: Paul J. Werbos National Science Foundation Arlington VA USA
This forward to the special issue on adaptive dynamic programming (ADP) and reinforcement learning in feedback control is written by Paul Werbos, the founder of ADP.
来源: 评论
reinforcement learning of adaptive longitudinal vehicle control for dynamic collaborative driving
Reinforcement learning of adaptive longitudinal vehicle cont...
收藏 引用
ieee symposium on Intelligent Vehicle
作者: Luke Ng Christopher M. Clark Jan P. Huissoon Department Mechanical and Mechatronics Engineering University of Waterloo ONT Canada Computer Science Department California Polytechnic State University San Louis Obispo CA USA Department of Mechanical and Mechatronics Engineering University of Waterloo ONT Canada
dynamic collaborative driving involves the motion coordination of multiple vehicles using shared information from vehicles instrumented to perceive their surroundings in order to improve road usage and safety. A basic... 详细信息
来源: 评论
Using reinforcement learning for city site selection in the turn-based strategy game Civilization IV
Using reinforcement learning for city site selection in the ...
收藏 引用
ieee symposium on Computational Intelligence and Games, CIG
作者: Stefan Wender Ian Watson Department of Computer Science University of Auckland Auckland New Zealand
This paper describes the design and implementation of a reinforcement learner based on Q-learning. This adaptive agent is applied to the city placement selection task in the commercial computer game Civilization IV. T... 详细信息
来源: 评论
RL-Based Scheduling Strategies in Actual Grid Environments
RL-Based Scheduling Strategies in Actual Grid Environments
收藏 引用
International symposium on Parallel and Distributed Processing with Applications, ISPA
作者: Bernardo Costa Inês Dutra Marta Mattoso COPPE Sistemas UFRJ Rio de Janeiro Brazil DCC University of Porto Porto Portugal
In this work, we study the behaviour of different resource scheduling strategies when doing job orchestration in grid environments. We empirically demonstrate that scheduling strategies based on reinforcement learning... 详细信息
来源: 评论
adaptive dynamic programming for Multi-intersections Traffic Signal Intelligent Control
Adaptive Dynamic Programming for Multi-intersections Traffic...
收藏 引用
International Conference on Intelligent Transportation
作者: Tao Li Dongbin Zhao Jianqiang Yi Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy and Sciences Beijing China China Scholarship Council University of Arizona Tucson USA
This paper aims at developing near optimal traffic signal control for multi-intersections in city. As a new optimization technique, adaptive dynamic programming (ADP) combines concepts of reinforcement learning and dy... 详细信息
来源: 评论