咨询与建议

限定检索结果

文献类型

  • 299 篇 会议
  • 8 篇 期刊文献

馆藏范围

  • 307 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 180 篇 工学
    • 158 篇 计算机科学与技术...
    • 56 篇 电气工程
    • 48 篇 软件工程
    • 47 篇 控制科学与工程
    • 13 篇 信息与通信工程
    • 10 篇 机械工程
    • 6 篇 仪器科学与技术
    • 4 篇 力学(可授工学、理...
    • 4 篇 生物工程
    • 3 篇 动力工程及工程热...
    • 2 篇 交通运输工程
    • 2 篇 核科学与技术
    • 2 篇 生物医学工程(可授...
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
    • 1 篇 食品科学与工程(可...
  • 40 篇 理学
    • 35 篇 数学
    • 9 篇 系统科学
    • 8 篇 统计学(可授理学、...
    • 4 篇 物理学
    • 4 篇 生物学
    • 1 篇 化学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 18 篇 管理学
    • 17 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 1 篇 医学

主题

  • 115 篇 dynamic programm...
  • 76 篇 reinforcement le...
  • 67 篇 learning
  • 47 篇 optimal control
  • 30 篇 neural networks
  • 27 篇 control systems
  • 21 篇 approximate dyna...
  • 21 篇 approximation al...
  • 20 篇 function approxi...
  • 20 篇 equations
  • 17 篇 convergence
  • 16 篇 adaptive dynamic...
  • 16 篇 state-space meth...
  • 16 篇 heuristic algori...
  • 14 篇 mathematical mod...
  • 13 篇 stochastic proce...
  • 12 篇 learning (artifi...
  • 12 篇 adaptive control
  • 12 篇 cost function
  • 11 篇 algorithm design...

机构

  • 5 篇 arizona state un...
  • 4 篇 department of el...
  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 4 篇 univ sci & techn...
  • 4 篇 chinese acad sci...
  • 4 篇 department of el...
  • 3 篇 princeton univ d...
  • 3 篇 northeastern uni...
  • 3 篇 national science...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 3 篇 univ utrecht dep...
  • 2 篇 univ groningen i...
  • 2 篇 sharif univ tech...
  • 2 篇 univ texas autom...
  • 2 篇 pengcheng labora...
  • 2 篇 guangxi univ sch...
  • 2 篇 chinese acad sci...
  • 2 篇 cemagref lisc au...

作者

  • 14 篇 liu derong
  • 9 篇 wei qinglai
  • 8 篇 si jennie
  • 7 篇 xu xin
  • 5 篇 derong liu
  • 4 篇 lewis frank l.
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 jennie si
  • 4 篇 marco a. wiering
  • 4 篇 xin xu
  • 4 篇 zhang huaguang
  • 4 篇 dongbin zhao
  • 4 篇 lei yang
  • 4 篇 powell warren b.
  • 4 篇 riedmiller marti...
  • 3 篇 hado van hasselt
  • 3 篇 van hasselt hado
  • 3 篇 jagannathan s.
  • 3 篇 munos remi

语言

  • 305 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"
307 条 记 录,以下是291-300 订阅
排序:
Multigrid Methods for Policy Evaluation and reinforcement learning
Multigrid Methods for Policy Evaluation and Reinforcement Le...
收藏 引用
ieee international symposium on Intelligent Control (ISIC)
作者: O. Ziv N. Shimkin Department of Electrical Engineering Technion University Haifa Israel
We introduce a new class of multigrid temporal-difference learning algorithms for speeding up the estimation of the value function related to a stationary policy, within the context of discounted cost Markov decision ... 详细信息
来源: 评论
On using discretized Cohen-Grossberg node dynamics for model-free actor-critic neural learning in non-Markovian domains
On using discretized Cohen-Grossberg node dynamics for model...
收藏 引用
ieee international symposium on Computational Intelligence in Robotics and Automation (CIRA)
作者: E. Mizutani S.E. Dreyfus Department of Computer Science National Tsing Hua University Hsinchu Taiwan Department of JEOR University of California Berkeley Berkeley CA USA
We describe how multi-stage non-Markovian decision problems can be solved using actor-critic reinforcement learning by assuming that a discrete version of Cohen-Grossberg node dynamics describes the node-activation co... 详细信息
来源: 评论
A biologically-inspired computational model for transformation invariant target recognition
A biologically-inspired computational model for transformati...
收藏 引用
international Joint Conference on Neural Networks (IJCNN)
作者: Khan M. Iftekharuddin Yaqin Li Intelligence System and Image Processing Lab Department of Electrical and Computer Engineering University of Memphis Memphis TN USA
Transformation invariant image recognition has been an active research area due to its widespread applications in a variety of fields such as military operations, robotics, medical practices, geographic scene analysis... 详细信息
来源: 评论
Deep reinforcement learning for Perishable Inventory Optimization Problem
Deep Reinforcement Learning for Perishable Inventory Optimiz...
收藏 引用
ieee international Conference on Industrial Engineering and Engineering Management
作者: Yusuke Nomura Ziang Liu Tatsushi Nishi Graduate School of Environmental Life Natural Science and Technology Okayama University Okayama City Okayama Japan
While global attention on reducing food waste has increased, the demand for perishable commodities such as food and pharmaceuticals is growing. This emphasizes the need for effective perishable inventory management, w...
来源: 评论
Resource Provisioning in Fog Computing through Deep reinforcement learning
Resource Provisioning in Fog Computing through Deep Reinforc...
收藏 引用
IFIP/ieee international symposium on Integrated Network Management
作者: José Santos Tim Wauters Bruno Volckaert Filip De Turck IDLab Ghent University - imec Gent Belgium
The massive growth of connected devices has made traditional cloud systems inadequate to sustain the scalability, mobility, and heterogeneous nature of the Internet of Things (oT). Distributed clouds have become a pot... 详细信息
来源: 评论
An Enhanced reinforcement learning Approach for dynamic Placement of Virtual Network Functions
An Enhanced Reinforcement Learning Approach for Dynamic Plac...
收藏 引用
ieee international symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
作者: Omar Houidi Oussama Soualah Wajdi Louati Djamal Zeghlache Telecom SudParis Samovar-UMR 5157 CNRS Institut Polytechnique de Paris France ReDCAD Lab University of Sfax Tunisia
This paper addresses Virtualized Network Function Forwarding Graph (VNF-FG) embedding with the objective of realizing long term reward compared to placement algorithms that aim at instantaneous optimal placement. The ... 详细信息
来源: 评论
ATM: approximate Task Memoization in the Runtime System
ATM: Approximate Task Memoization in the Runtime System
收藏 引用
international symposium on Parallel and Distributed Processing (IPDPS)
作者: Iulian Brumar Marc Casas Miquel Moreto Mateo Valero Gurindar S. Sohi Barcelona Supercomputing Center (BSC) Barcelona Spain University of Wisconsin-Madison USA
Redundant computations appear during the execution of real programs. Multiple factors contribute to these unnecessary computations, such as repetitive inputs and patterns, calling functions with the same parameters or... 详细信息
来源: 评论
Cooperative learning and planning for multiple robots
Cooperative learning and planning for multiple robots
收藏 引用
ieee international symposium on Intelligent Control (ISIC)
作者: S. van der Zwaan J.A.A. Moreira P.U. Lima Inst. de Sistema e Robotica Inst. Superior Tecnico Lisbon Portugal IInstituto de Sistemas e Robótica Instituto Superior Técnico Lisboa Portugal Instituto de Sistemas e Robótica Instituto Superior Técnico Lisboa Portugal
The paper deals with the the subject of learning and planning for real mobile robots, using Sutton's (1991) Dyna algorithm. The Dyna algorithm integrates reinforcement learning, planning and reactive execution. We... 详细信息
来源: 评论
Mobile-Aware Online Task Offloading Based on Deep reinforcement learning in Mobile Edge Computing Networks
Mobile-Aware Online Task Offloading Based on Deep Reinforcem...
收藏 引用
ieee international symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
作者: Yuting Li Yitong Liu Xingcheng Liu Qiang Tu Yi Xie School of Electronics and Information Technology Sun Yat-sen University Guangzhou China School of Computer Science and Engineering Sun Yat-sen University Guangzhou China Jiangsu Viscore Technologies Co. Ltd. Suzhou China
Mobile Edge Computing (MEC) is one of the key enabling technologies for future 6G wireless networks that can provide lower latency service and more efficient resource utilization for future intelligent applications an...
来源: 评论
A Budget-aware Incentive Mechanism for Vehicle-to-Grid via reinforcement learning
A Budget-aware Incentive Mechanism for Vehicle-to-Grid via R...
收藏 引用
international Workshop on Quality of Service
作者: Tianxiang Zhu Xiaoxi Zhang Jingpu Duan Zhi Zhou Xu Chen Sun Yat-sen University Guangzhou China Southern University of Science and Technology Shenzhen China Pengcheng Laboratory Shenzhen China
With the increasing penetration of renewable energy and electric vehicles (EVs), the behavior of EVs' charging and discharging has shown great impact on the Micro Grid power load, motivating the development of Veh...
来源: 评论