咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是401-410 订阅
adaptive Slope Locomotion with Deep reinforcement learning
Adaptive Slope Locomotion with Deep Reinforcement Learning
收藏 引用
ieee/SICE International symposium on System Integration
作者: William Jones Tamir Blum Kazuya Yoshida Space Robotics Laboratory of the Department of Aerospace Engineering Graduate School of Engineering Tohoku University Sendai Japan
In this paper we present a model free Deep reinforcement learning based approach to the motion planning problem of a quadruped moving from a flat to an inclined plane. In our implementation, we do not provide any prio... 详细信息
来源: 评论
Longitudinal dynamic versus Kinematic Models for Car-Following Control Using Deep reinforcement learning
Longitudinal Dynamic versus Kinematic Models for Car-Followi...
收藏 引用
ieee Intelligent Transportation Systems Conference (ieee-ITSC)
作者: Lin, Yuan McPhee, John Azad, Nasser L. Univ Waterloo Syst Design Engn Dept Waterloo ON N2L 3G1 Canada
The majority of current studies on autonomous vehicle control via deep reinforcement learning (DRL) utilize point-mass kinematic models, neglecting vehicle dynamics which includes acceleration delay and acceleration c... 详细信息
来源: 评论
Neural Network Tracking Control of Unknown Servo System with Approximate dynamic programming  38
Neural Network Tracking Control of Unknown Servo System with...
收藏 引用
38th Chinese Control Conference (CCC)
作者: Lv, Yongfeng Ren, Xuemei Zeng, Tianyi Li, Linwei Na, Jing Beijing Inst Technol Sch Automat Beijing 100081 Peoples R China Kunming Univ Sci & Technol Fac Mech & Elect Engn Kunming 650500 Yunnan Peoples R China
Although the adaptive dynamic programming (ADP) scheme has been widely researched on the optimal problem in recent years, which has not been applied to the servo system. In this paper, a simplified reinforcement learn... 详细信息
来源: 评论
reinforcement learning for Vision-Based Lateral Control of a Self-Driving Car  15
Reinforcement Learning for Vision-Based Lateral Control of a...
收藏 引用
ieee 15th International Conference on Control and Automation (ICCA)
作者: Huang, Mengzhe Zhao, Mingyu Parikh, Parthiv Wang, Yebin Ozbay, Kaan Jiang, Zhong-Ping NYU Tandon Sch Engn Dept Elect & Comp Engn Brooklyn NY 11201 USA Mitsubishi Elect Res Labs Cambridge MA 02139 USA NYU C2SMART Ctr Tandon Sch Engn Brooklyn NY 11201 USA
Lateral control design is one of the fundamental components for self-driving cars. In this paper, we propose a learning-based control strategy that enables a mobile car equipped with a camera to perfectly perform lane... 详细信息
来源: 评论
Geometric deep reinforcement learning for dynamic DAG scheduling
Geometric deep reinforcement learning for dynamic DAG schedu...
收藏 引用
ieee symposium Series on Computational Intelligence (SSCI)
作者: Nathan Grinsztajn Olivier Beaumont Emmanuel Jeannot Philippe Preux UMR 9189 CRIStAL Univ. Lille CNRS Inria Lille France Hiepacs team Inria Bordeaux Bordeaux France TADaaM team Inria Bordeaux Bordeaux France
In practice, it is quite common to face combinatorial optimization problems which contain uncertainty along with non determinism and dynamicity. These three properties call for appropriate algorithms; reinforcement le... 详细信息
来源: 评论
adaptive Assist-as-needed Control Based on Actor-Critic reinforcement learning
Adaptive Assist-as-needed Control Based on Actor-Critic Rein...
收藏 引用
ieee/RSJ International Conference on Intelligent Robots and Systems (IROS)
作者: Zhang, Yufeng Li, Shuai Nolan, Karen J. Zanotto, Damiano Stevens Inst Technol Wearable Robot Syst WRS Lab Hoboken NJ 07030 USA Kessler Fdn Human Performance & Engn Res West Orange NJ 07052 USA Rutgers NJMS Newark NJ 07103 USA
In robot-assisted rehabilitation, assist-as-needed (AAN) controllers have been proposed to promote subjects' active participation, which is thought to lead to better training outcomes. Most of these AAN controller... 详细信息
来源: 评论
IntelliNoC: A Holistic Design Framework for Energy-Efficient and Reliable On-Chip Communication for Manycores  19
IntelliNoC: A Holistic Design Framework for Energy-Efficient...
收藏 引用
46th International symposium on Computer Architecture (ISCA) / Workshop on Computer Architecture Education (WCAE)
作者: Wang, Ke Louri, Ahmed Karanth, Avinash Bunescu, Razvan George Washington Univ Dept Elect & Comp Engn Washington DC 20037 USA Ohio Univ Sch Elect Engn & Comp Sci Athens OH 45701 USA
As technology scales, Network-on-Chips (NoCs), currently being used for on-chip communication in manycore architectures, face several problems including high network latency, excessive power consumption, and low relia... 详细信息
来源: 评论
Toward Packet Routing with Fully-distributed Multi-agent Deep reinforcement learning  17
Toward Packet Routing with Fully-distributed Multi-agent Dee...
收藏 引用
17th International symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt)
作者: You, Xinyu Li, Xuanjie Xu, Yuedong Feng, Hui Zhao, Jin Fudan Univ Sch Informat Sci & Technol Res Ctr Smart Networks & Syst Shanghai Peoples R China Fudan Univ Sch Comp Sci Shanghai Peoples R China
Packet routing is one of the fundamental problems in computer networks in which a router determines the next-hop of each packet in the queue to get it as quickly as possible to its destination. reinforcement learning ... 详细信息
来源: 评论
An Enhanced reinforcement learning Approach for dynamic Placement of Virtual Network Functions
An Enhanced Reinforcement Learning Approach for Dynamic Plac...
收藏 引用
ieee International symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
作者: Omar Houidi Oussama Soualah Wajdi Louati Djamal Zeghlache Telecom SudParis Samovar-UMR 5157 CNRS Institut Polytechnique de Paris France ReDCAD Lab University of Sfax Tunisia
This paper addresses Virtualized Network Function Forwarding Graph (VNF-FG) embedding with the objective of realizing long term reward compared to placement algorithms that aim at instantaneous optimal placement. The ... 详细信息
来源: 评论
reinforcement learning Control of Power Systems with Unknown Network Model under Ambient and Forced Oscillations
Reinforcement Learning Control of Power Systems with Unknown...
收藏 引用
Control Technology and Applications (CCTA),
作者: Sayak Mukherjee He Bai Aranya Chakrabortty North Carolina State University Raleigh NC USA School of Mechanical and Aerospace Engineering Oklahoma State University Stillwater USA
We present a model-free optimal control design for electric power systems with unknown transmission network and load models to improve its dynamic performance using techniques from reinforcement learning (RL) and adap... 详细信息
来源: 评论