咨询与建议

限定检索结果

文献类型

  • 299 篇 会议
  • 8 篇 期刊文献

馆藏范围

  • 307 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 180 篇 工学
    • 158 篇 计算机科学与技术...
    • 56 篇 电气工程
    • 48 篇 软件工程
    • 47 篇 控制科学与工程
    • 13 篇 信息与通信工程
    • 10 篇 机械工程
    • 6 篇 仪器科学与技术
    • 4 篇 力学(可授工学、理...
    • 4 篇 生物工程
    • 3 篇 动力工程及工程热...
    • 2 篇 交通运输工程
    • 2 篇 核科学与技术
    • 2 篇 生物医学工程(可授...
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
    • 1 篇 食品科学与工程(可...
  • 40 篇 理学
    • 35 篇 数学
    • 9 篇 系统科学
    • 8 篇 统计学(可授理学、...
    • 4 篇 物理学
    • 4 篇 生物学
    • 1 篇 化学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 18 篇 管理学
    • 17 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 1 篇 医学

主题

  • 115 篇 dynamic programm...
  • 76 篇 reinforcement le...
  • 67 篇 learning
  • 47 篇 optimal control
  • 30 篇 neural networks
  • 27 篇 control systems
  • 21 篇 approximate dyna...
  • 21 篇 approximation al...
  • 20 篇 function approxi...
  • 20 篇 equations
  • 17 篇 convergence
  • 16 篇 adaptive dynamic...
  • 16 篇 state-space meth...
  • 16 篇 heuristic algori...
  • 14 篇 mathematical mod...
  • 13 篇 stochastic proce...
  • 12 篇 learning (artifi...
  • 12 篇 adaptive control
  • 12 篇 cost function
  • 11 篇 algorithm design...

机构

  • 5 篇 arizona state un...
  • 4 篇 department of el...
  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 4 篇 univ sci & techn...
  • 4 篇 chinese acad sci...
  • 4 篇 department of el...
  • 3 篇 princeton univ d...
  • 3 篇 northeastern uni...
  • 3 篇 national science...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 3 篇 univ utrecht dep...
  • 2 篇 univ groningen i...
  • 2 篇 sharif univ tech...
  • 2 篇 univ texas autom...
  • 2 篇 pengcheng labora...
  • 2 篇 guangxi univ sch...
  • 2 篇 chinese acad sci...
  • 2 篇 cemagref lisc au...

作者

  • 14 篇 liu derong
  • 9 篇 wei qinglai
  • 8 篇 si jennie
  • 7 篇 xu xin
  • 5 篇 derong liu
  • 4 篇 lewis frank l.
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 jennie si
  • 4 篇 marco a. wiering
  • 4 篇 xin xu
  • 4 篇 zhang huaguang
  • 4 篇 dongbin zhao
  • 4 篇 lei yang
  • 4 篇 powell warren b.
  • 4 篇 riedmiller marti...
  • 3 篇 hado van hasselt
  • 3 篇 van hasselt hado
  • 3 篇 jagannathan s.
  • 3 篇 munos remi

语言

  • 305 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"
307 条 记 录,以下是301-310 订阅
排序:
Multi-agent Deep reinforcement learning based Information-Energy Collaboration in Vehicle Edge Computing Networks
Multi-agent Deep Reinforcement Learning based Information-En...
收藏 引用
ieee international symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
作者: Yaoyu Feng Biling Zhang Jung-Lang Yu School of Network Education Beijing University of Posts and Telecommunications P. R. China Department of Electrical Engineering Fu Jen Catholic University New Taipei City Taiwan
In the vehicle edge computing network (VECN), how to deal with the computation resources and energy resources shortage problem the roadside units (RSUs) encounter when they are performing delay sensitive computation t... 详细信息
来源: 评论
From Reward to Histone: Combining Temporal-Difference learning and Epigenetic Inheritance for Swarm's Coevolving Decision Making
From Reward to Histone: Combining Temporal-Difference Learni...
收藏 引用
ieee international Conference on Development and learning, ICDL
作者: Faqihza Mukhlish John Page Michael Bain School of Mechanical and Manufacturing Engineering University of New South Wales Sydney Australia School of Computer Science and Engineering University of New South Wales Sydney Australia
Applying intelligence to a group of simple robots known as swarm robots has become an exciting technology in assisting or replacing humans to fulfil complex, dangerous and harsh missions. However, building a strategy ... 详细信息
来源: 评论
An Online Model-Free reinforcement learning Approach for 6-DOF Robot Manipulators
An Online Model-Free Reinforcement Learning Approach for 6-D...
收藏 引用
international Workshop on Robot Sensing (ROSE)
作者: Zeyad Hosny Abdullah Nassar Ahmed AboElyazeed Mahmoud Mohamed Mohammed Abouheaf Wail Gueaieb School of Electrical Engineering and Computer Science University of Ottawa Ottawa ON K1N6N5 Canada Robotics Engineering Bowling Green State University Bowling Green 43402 OH USA
Controlling 6 Degrees-of-Freedom (DoF) robotic manipulators in an online, model-free manner poses significant challenges due to their complex coupling, non-linearities, and the need to account for unmodeled dynamics. ...
来源: 评论
PhD Forum Abstract: Diffusion-based Task Scheduling for Efficient AI-Generated Content in Edge Networks
PhD Forum Abstract: Diffusion-based Task Scheduling for Effi...
收藏 引用
international symposium on Information Processing in Sensor Networks (IPSN)
作者: Changfu Xu Hong Kong and BNU-HKBU United International College Hong Kong Baptist University Zhuhai China
The Artificial Intelligence-Generated Content (AIGC) technique has gained significant popularity in creating diverse content. However, the current deployment of AIGC services is a centralized framework, thus leading t... 详细信息
来源: 评论
Virtual Network Function Embedding under Nodal Outage using reinforcement learning
Virtual Network Function Embedding under Nodal Outage using ...
收藏 引用
international symposium on Advanced Networks and Telecommunication Systems (ANTS)
作者: Swarna Bindu Chetty Hamed Ahmadi Avishek Nag School of Electrical and Electronic Engineering University College Dublin Dublin Ireland University of York United Kingdom
With the emergence of various types of applications such as delay-sensitive applications, future communication networks are expected to be increasingly complex and dynamic. Network Function Virtualization (NFV) provid... 详细信息
来源: 评论
Discrete-Time Generalized Policy Iteration ADP Algorithm With Approximation Errors
Discrete-Time Generalized Policy Iteration ADP Algorithm Wit...
收藏 引用
ieee symposium Series on Computational Intelligence
作者: Qinglai Wei Benkai Li Ruizhuo Song The State Key Laboratory of Management and Control for Complex Systems Chinese Academy of Sciences Beijing China School of Automation and Electrical Engineering University of Science and Technology Beijing Beijing China
This paper concerns with a novel generalized policy iteration (GPI) algorithm with approximation errors. Approximation errors are explicitly considered in the GPI algorithm. The properties of the stable GPI algorithm ... 详细信息
来源: 评论
learning Recovery Strategies for dynamic Self-Healing in Reactive Systems
Learning Recovery Strategies for Dynamic Self-Healing in Rea...
收藏 引用
SEAMS international Workshop on Software Engineering for Adaptive and Self-Managing Systems, ICSE
作者: Mateo Sanabria Ivana Dusparic Nicolás Cardozo Universidad de los Andes Colombia Trinity College Dublin Ireland
Self-healing systems depend on following a set of predefined instructions to recover from a known failure state. Failure states are generally detected based on domain specific specialized metrics. Failure fixes are ap... 详细信息
来源: 评论