咨询与建议

限定检索结果

文献类型

  • 299 篇 会议
  • 8 篇 期刊文献

馆藏范围

  • 307 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 180 篇 工学
    • 158 篇 计算机科学与技术...
    • 56 篇 电气工程
    • 48 篇 软件工程
    • 47 篇 控制科学与工程
    • 13 篇 信息与通信工程
    • 10 篇 机械工程
    • 6 篇 仪器科学与技术
    • 4 篇 力学(可授工学、理...
    • 4 篇 生物工程
    • 3 篇 动力工程及工程热...
    • 2 篇 交通运输工程
    • 2 篇 核科学与技术
    • 2 篇 生物医学工程(可授...
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
    • 1 篇 食品科学与工程(可...
  • 40 篇 理学
    • 35 篇 数学
    • 9 篇 系统科学
    • 8 篇 统计学(可授理学、...
    • 4 篇 物理学
    • 4 篇 生物学
    • 1 篇 化学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 18 篇 管理学
    • 17 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 1 篇 医学

主题

  • 115 篇 dynamic programm...
  • 76 篇 reinforcement le...
  • 67 篇 learning
  • 47 篇 optimal control
  • 30 篇 neural networks
  • 27 篇 control systems
  • 21 篇 approximate dyna...
  • 21 篇 approximation al...
  • 20 篇 function approxi...
  • 20 篇 equations
  • 17 篇 convergence
  • 16 篇 adaptive dynamic...
  • 16 篇 state-space meth...
  • 16 篇 heuristic algori...
  • 14 篇 mathematical mod...
  • 13 篇 stochastic proce...
  • 12 篇 learning (artifi...
  • 12 篇 adaptive control
  • 12 篇 cost function
  • 11 篇 algorithm design...

机构

  • 5 篇 arizona state un...
  • 4 篇 department of el...
  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 4 篇 univ sci & techn...
  • 4 篇 chinese acad sci...
  • 4 篇 department of el...
  • 3 篇 princeton univ d...
  • 3 篇 northeastern uni...
  • 3 篇 national science...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 3 篇 univ utrecht dep...
  • 2 篇 univ groningen i...
  • 2 篇 sharif univ tech...
  • 2 篇 univ texas autom...
  • 2 篇 pengcheng labora...
  • 2 篇 guangxi univ sch...
  • 2 篇 chinese acad sci...
  • 2 篇 cemagref lisc au...

作者

  • 14 篇 liu derong
  • 9 篇 wei qinglai
  • 8 篇 si jennie
  • 7 篇 xu xin
  • 5 篇 derong liu
  • 4 篇 lewis frank l.
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 jennie si
  • 4 篇 marco a. wiering
  • 4 篇 xin xu
  • 4 篇 zhang huaguang
  • 4 篇 dongbin zhao
  • 4 篇 lei yang
  • 4 篇 powell warren b.
  • 4 篇 riedmiller marti...
  • 3 篇 hado van hasselt
  • 3 篇 van hasselt hado
  • 3 篇 jagannathan s.
  • 3 篇 munos remi

语言

  • 305 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"
307 条 记 录,以下是11-20 订阅
排序:
Node Fault Prediction Assisted Small-World IoT Networks Using ML Frameworks: Towards Performance Improvement  18
Node Fault Prediction Assisted Small-World IoT Networks Usin...
收藏 引用
18th ieee international Conference on Advanced Networks and Telecommunications Systems, ANTS 2024
作者: Sharma, Neha Gupta, Aryaman Deepak, Sivala Pandey, Om Jee Indian Institute of Technology BHU Department of Electronics Engineering Varanasi India
The rapid growth of the Internet of Things (IoT) networks has led to the deployment of large-scale networks, enabling seamless connectivity and data exchange among various devices. To manage the complexity and ensure ... 详细信息
来源: 评论
An Online Model-Free reinforcement learning Approach for 6-DOF Robot Manipulators
An Online Model-Free Reinforcement Learning Approach for 6-D...
收藏 引用
2023 ieee international symposium on Robotic and Sensors Environments, ROSE 2023
作者: Hosny, Zeyad Nassar, Abdullah Aboelyazeed, Ahmed Mohamed, Mahmoud Abouheaf, Mohammed Gueaieb, Wail University of Ottawa School of Electrical Engineering and Computer Science OttawaONK1N6N5 Canada Bowling Green State University Robotics Engineering Bowling GreenOH43402 United States
Controlling 6 Degrees-of-Freedom (DoF) robotic manipulators in an online, model-free manner poses significant challenges due to their complex coupling, non-linearities, and the need to account for unmodeled dynamics. ... 详细信息
来源: 评论
Solving PBQP-Based Register Allocation using Deep reinforcement learning  22
Solving PBQP-Based Register Allocation using Deep Reinforcem...
收藏 引用
20th ieee/ACM international symposium on Code Generation and Optimization (CGO)
作者: Kim, Minsu Park, Jeong-Keun Moon, Soo-Mook Seoul Natl Univ Dept Elect & Comp Engn Seoul South Korea
Irregularly structured registers are hard to abstract and allocate. Partitioned Boolean quadratic programming (PBQP) is a useful abstraction to represent complex register constraints, even those in highly irregular pr... 详细信息
来源: 评论
The Research of Quadrotor Flight Control Based on reinforcement learning and ADP  8
The Research of Quadrotor Flight Control Based on Reinforcem...
收藏 引用
8th Annual international Conference on Network and Information Systems for Computers, ICNISC 2022
作者: Li, Xueyuan Xie, Wentao Zhan, Wentao Xi'an Aeronautics Computing Technique Research Institute Avic Xi'an China
This paper studies the application of Lookup-Table reinforcement learning method into the continuous state space control of quadrotor simulator and designs a attitude controller for the quadrotor simulator based on Q-... 详细信息
来源: 评论
A learning Approach to Multi-robot Task Allocation with Priority Constraints and Uncertainty
A Learning Approach to Multi-robot Task Allocation with Prio...
收藏 引用
2022 ieee international Conference on Industrial Technology, ICIT 2022
作者: Deng, Fuqin Huang, Huanzhao Fu, Lanhui Yue, Hongwei Zhang, Jianmin Wu, Zexiao Lam, Tin Lun Wuyi University School of Intelligent Manufacturing Guangdong Jiangmen529020 China The Shenzhen Institute of Artificial Intelligence and Robotics for Society Guangdong Shenzhen518000 China The 3irobotix Co. Ltd Guangdong Shenzhen518000 China Guangdong University of Education School of Physics and Information Engineering Guangdong Guangzhou510303 China The Chinese University of HongKong School of Science and Engineering Guangdong Shenzhen518000 China
Multi-robot task allocation has an important impact on the efficiency of multi-robot collaboration. For single-shot allocation without complicated constraints, some exact algorithms and heuristic algorithms can find t... 详细信息
来源: 评论
A reinforcement learning Solution to the Nonlinear Spacecraft Pursuit-Evasion Game Problem
A Reinforcement Learning Solution to the Nonlinear Spacecraf...
收藏 引用
ieee international Conference on Cyber Technology in Automation, Control, and Intelligent Systems
作者: Haoqi Huang Guangtao Ran Yueyong Lyu Guangfu Ma Department of Control Science and Engineering Harbin Institute of Technology Harbin China
The pursuit-evasion game of non-cooperative spacecrafts under nonlinear dynamics is currently a hot topic in orbital gaming. We describe the above pursuit-evasion game model using differential game theory, transformin... 详细信息
来源: 评论
A Budget-aware Incentive Mechanism for Vehicle-to-Grid via reinforcement learning  31
A Budget-aware Incentive Mechanism for Vehicle-to-Grid via R...
收藏 引用
31st ieee/ACM international symposium on Quality of Service, IWQoS 2023
作者: Zhu, Tianxiang Zhang, Xiaoxi Duan, Jingpu Zhou, Zhi Chen, Xu Sun Yat-sen University Guangzhou China Southern University of Science and Technology Shenzhen China Pengcheng Laboratory Shenzhen China
With the increasing penetration of renewable energy and electric vehicles (EVs), the behavior of EVs' charging and discharging has shown great impact on the Micro Grid power load, motivating the development of Veh... 详细信息
来源: 评论
2022 ieee 33rd Annual international symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2022
2022 IEEE 33rd Annual International Symposium on Personal, I...
收藏 引用
33rd ieee Annual international symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2022
The proceedings contain 237 papers. The topics discussed include: robust device position and pose detection using visible light without model knowledge: a branch-structured residual learning method;access point cluste...
来源: 评论
learning Intrusion Prevention Policies through Optimal Stopping  17
Learning Intrusion Prevention Policies through Optimal Stopp...
收藏 引用
17th international Conference on Network and Service Management (CNSM) - Smart Management for Future Networks and Services
作者: Hammar, Kim Stadler, Rolf KTH Royal Inst Technol Div Network & Syst Engn Stockholm Sweden KTH Ctr Cyber Def & Informat Secur Stockholm Sweden
We study automated intrusion prevention using reinforcement learning. In a novel approach, we formulate the problem of intrusion prevention as an optimal stopping problem. This formulation allows us insight into the s... 详细信息
来源: 评论
reinforcement learning of LQR control policy by a double inverted-pendulum biomechanical model
Reinforcement learning of LQR control policy by a double inv...
收藏 引用
ieee international Conference on Industrial Technology (ICIT)
作者: Kamran Iqbal Muhammad Haras University of Arkansas at Little Rock Little Rock Arkansas
Optimal LQR feedback gains can be learned using reinforcement learning (RL) framework for systems with unknown dynamics using policy iteration methods. However, policy iteration in the case of inherently unstable syst...
来源: 评论