咨询与建议

限定检索结果

文献类型

  • 751 篇 会议
  • 272 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,027 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 719 篇 工学
    • 523 篇 计算机科学与技术...
    • 385 篇 电气工程
    • 284 篇 控制科学与工程
    • 153 篇 软件工程
    • 83 篇 信息与通信工程
    • 41 篇 交通运输工程
    • 24 篇 仪器科学与技术
    • 21 篇 机械工程
    • 9 篇 电子科学与技术(可...
    • 9 篇 生物工程
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 7 篇 石油与天然气工程
    • 6 篇 动力工程及工程热...
    • 4 篇 材料科学与工程(可...
    • 4 篇 生物医学工程(可授...
    • 4 篇 安全科学与工程
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
  • 120 篇 理学
    • 98 篇 数学
    • 31 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 9 篇 物理学
    • 5 篇 化学
  • 68 篇 管理学
    • 65 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 7 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 315 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 110 篇 adaptive dynamic...
  • 105 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 79 篇 heuristic algori...
  • 67 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 52 篇 convergence
  • 52 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 cost function
  • 40 篇 artificial neura...

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 northeastern uni...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 55 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 16 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 12 篇 derong liu
  • 11 篇 song ruizhuo
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 9 篇 abouheaf mohamme...

语言

  • 970 篇 英文
  • 51 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1027 条 记 录,以下是681-690 订阅
Delayed insertion and rule effect moderation of domain knowledge for reinforcement learning
Delayed insertion and rule effect moderation of domain knowl...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Teck-Hou Teng Ah-Hwee Tan School of Computer Engineering Center for Computational Intelligence School of Computer Engineering Nanyang Technological University
Though not a fundamental pre-requisite to efficient machine learning, insertion of domain knowledge into adaptive virtual agent is nonetheless known to improve learning efficiency and reduce model complexity. Conventi... 详细信息
来源: 评论
reinforcement learning to train Ms. Pac-Man using higher-order action-relative inputs
Reinforcement learning to train Ms. Pac-Man using higher-ord...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Luuk Bom Ruud Henken Marco Wiering Faculty of Mathematics and Natural Sciences University of Groningen The Netherlands
reinforcement learning algorithms enable an agent to optimize its behavior from interacting with a specific environment. Although some very successful applications of reinforcement learning algorithms have been develo... 详细信息
来源: 评论
Analyzing collective behavior in evolutionary swarm robotic systems based on an ethological approach
Analyzing collective behavior in evolutionary swarm robotic ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Toshiyuki Yasuda Nanami Wada Kazuhiro Ohkura Yoshiyuki Matsumura Graduate School of Engineering Hiroshima University Higashi-Hiroshima JAPAN Faculty of Textile Science and Technology Shinshu University Ueda Nagano JAPAN
Swarm robotic systems are a type of multi-robot systems which generally consist of many homogeneous autonomous robots without any type of global controllers. Swarm robotics aims at designing desired collective behavio... 详细信息
来源: 评论
Robust adaptive dynamic programming With an Application to Power Systems
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2013年 第7期24卷 1150-1156页
作者: Jiang, Yu Jiang, Zhong-Ping NYU Polytech Inst Dept Elect & Comp Engn Brooklyn NY 11201 USA
This brief presents a novel framework of robust adaptive dynamic programming (robust-ADP) aimed at computing globally stabilizing and suboptimal control policies in the presence of dynamic uncertainties. A key strateg... 详细信息
来源: 评论
Cooperative off-policy prediction of Markov decision processes in adaptive networks
Cooperative off-policy prediction of Markov decision process...
收藏 引用
2013 38th ieee International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
作者: Macua, Sergio Valcarcel Chen, Jianshu Zazo, Santiago Sayed, Ali H. Escuela Técnica Superior de Ingenieros de Telecomunicación Universidad Politécnica de Madrid Madrid 28040 Spain Department of Electrical Engineering University of California Los Angeles CA 90095 United States
We apply diffusion strategies to propose a cooperative reinforcement learning algorithm, in which agents in a network communicate with their neighbors to improve predictions about their environment. The algorithm is s... 详细信息
来源: 评论
Optimal tracking control scheme for discrete-time nonlinear systems with approximation errors
Optimal tracking control scheme for discrete-time nonlinear ...
收藏 引用
10th International symposium on Neural Networks, ISNN 2013
作者: Wei, Qinglai Liu, Derong State Key Laboratory of Management and Control for Complex Systems Institute of Automation Chinese Academy of Sciences Beijing 100190 China
In this paper, we aim to solve an infinite-time optimal tracking control problem for a class of discrete-time nonlinear systems using iterative adaptive dynamic programming (ADP) algorithm. When the iterative tracking... 详细信息
来源: 评论
COOPERATIVE OFF-POLICY PREDICTION OF MARKOV DECISION PROCESSES IN adaptive NETWORKS
COOPERATIVE OFF-POLICY PREDICTION OF MARKOV DECISION PROCESS...
收藏 引用
ieee International Conference on Acoustics, Speech, and Signal Processing
作者: Sergio Valcarcel Macua Jianshu Chen Santiago Zazo Ali H. Sayed Escuela Tecnica Superior de Ingenieros de Telecomunicacion Universidad Politecnica de Madrid Madrid 28040 Spain Department of Electrical Engineering University of California Los Angeles CA 90095 USA
We apply diffusion strategies to propose a cooperative reinforcement learning algorithm, in which agents in a network communicate with their neighbors to improve predictions about their environment. The algorithm is s... 详细信息
来源: 评论
Finite-Horizon Control-Constrained Nonlinear Optimal Control Using Single Network adaptive Critics
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2013年 第1期24卷 145-157页
作者: Heydari, Ali Balakrishnan, Sivasubramanya N. Missouri Univ Sci & Technol Dept Mech & Aerosp Engn Rolla MO 65401 USA
To synthesize fixed-final-time control-constrained optimal controllers for discrete-time nonlinear control-affine systems, a single neural network (NN)-based controller called the Finite-horizon Single Network Adaptiv... 详细信息
来源: 评论
Hierarchical dynamic power management using model-free reinforcement learning
Hierarchical dynamic power management using model-free reinf...
收藏 引用
ieee International symposium on Quality Electronic Design
作者: Yanzhi Wang Maryam Triki Xue Lin Ahmed C. Ammari Massoud Pedram Department .of Electrical En(ÇÇ+ineeri.ng University of Southern California Los Angeles CA USA National Institute of the Applied Sciences and Technology (INSAT) Carthage University Tunisia National InstItute of the Apphed Sciences and Technology (INSAT) Carthage University Tunisia Department ofElec. & Computer Engineering King Abdulaziz University Jeddah Saudi Arabia
Model-free reinforcement learning (RL) has become a promising technique for designing a robust dynamic power management (DPM) framework that can cope with variations and uncertainties that emanate from hardware and ap... 详细信息
来源: 评论
The Divergence of reinforcement learning Algorithms with Value-Iteration and Function Approximation
The Divergence of Reinforcement Learning Algorithms with Val...
收藏 引用
ieee International Conference on Fuzzy Systems (FUZZ-ieee)/International Joint Conference on Neural Networks (IJCNN)/ieee Congress on Evolutionary Computation (ieee-CEC)/ieee World Congress on Computational Intelligence (ieee-WCCI)
作者: Fairbank, Michael Alonso, Eduardo City Univ London Sch Informat Dept Comp London EC1V 0HB England
This paper gives specific divergence examples of value-iteration for several major reinforcement learning and adaptive dynamic programming algorithms, when using a function approximator for the value function. These d... 详细信息
来源: 评论