咨询与建议

限定检索结果

文献类型

  • 81 篇 期刊文献
  • 28 篇 会议
  • 2 篇 学位论文

馆藏范围

  • 111 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 87 篇 工学
    • 53 篇 计算机科学与技术...
    • 36 篇 电气工程
    • 30 篇 控制科学与工程
    • 8 篇 交通运输工程
    • 7 篇 石油与天然气工程
    • 5 篇 软件工程
    • 4 篇 信息与通信工程
    • 3 篇 动力工程及工程热...
    • 2 篇 仪器科学与技术
    • 2 篇 土木工程
    • 1 篇 电子科学与技术(可...
    • 1 篇 化学工程与技术
    • 1 篇 船舶与海洋工程
    • 1 篇 环境科学与工程(可...
  • 28 篇 管理学
    • 28 篇 管理科学与工程(可...
    • 3 篇 工商管理
  • 24 篇 理学
    • 22 篇 数学
    • 4 篇 系统科学
    • 1 篇 物理学
    • 1 篇 统计学(可授理学、...
  • 11 篇 经济学
    • 7 篇 理论经济学
    • 3 篇 应用经济学
  • 3 篇 医学
    • 3 篇 临床医学
    • 2 篇 基础医学(可授医学...

主题

  • 111 篇 value function a...
  • 37 篇 reinforcement le...
  • 18 篇 approximate dyna...
  • 12 篇 dynamic programm...
  • 7 篇 dynamic vehicle ...
  • 7 篇 temporal differe...
  • 6 篇 q-learning
  • 5 篇 function approxi...
  • 5 篇 markov decision ...
  • 4 篇 markov decision ...
  • 4 篇 neural networks
  • 4 篇 optimal control
  • 4 篇 policy iteration
  • 3 篇 rate of converge...
  • 3 篇 actor-critic
  • 3 篇 policy evaluatio...
  • 3 篇 polynomial basis...
  • 3 篇 reinforcement le...
  • 3 篇 energy managemen...
  • 3 篇 off-policy learn...

机构

  • 2 篇 beijing univ che...
  • 2 篇 hefei univ techn...
  • 2 篇 missouri univ sc...
  • 2 篇 univ massachuset...
  • 2 篇 tokyo inst techn...
  • 2 篇 northeastern uni...
  • 2 篇 univ sci & techn...
  • 2 篇 tech univ carolo...
  • 2 篇 natl univ def te...
  • 2 篇 georgia inst tec...
  • 2 篇 chinese acad sci...
  • 2 篇 otto von guerick...
  • 2 篇 rice univ dept e...
  • 1 篇 polish acad sci ...
  • 1 篇 shanghai engn re...
  • 1 篇 tsinghua univ de...
  • 1 篇 univ sydney sch ...
  • 1 篇 inria nancy gran...
  • 1 篇 univ southern ca...
  • 1 篇 univ twente ind ...

作者

  • 6 篇 ulmer marlin w.
  • 5 篇 song tianheng
  • 5 篇 li dazi
  • 4 篇 xu xin
  • 4 篇 mattfeld dirk c.
  • 3 篇 soeffker ninja
  • 3 篇 hachiya hirotaka
  • 2 篇 tutsoy onder
  • 2 篇 huang zhenhua
  • 2 篇 savelsbergh mart...
  • 2 篇 montoya juan m.
  • 2 篇 lewis frank l.
  • 2 篇 pietquin olivier
  • 2 篇 jin qibing
  • 2 篇 sickles robin c.
  • 2 篇 geist matthieu
  • 2 篇 li ping
  • 2 篇 chapman archie c...
  • 2 篇 zuo lei
  • 2 篇 cervellera crist...

语言

  • 109 篇 英文
  • 2 篇 其他
检索条件"主题词=Value function approximation"
111 条 记 录,以下是11-20 订阅
排序:
Algorithmic Survey of Parametric value function approximation
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2013年 第6期24卷 845-867页
作者: Geist, Matthieu Pietquin, Olivier Supelec IMS MaLIS Res Grp F-57070 Metz France
Reinforcement learning (RL) is a machine learning answer to the optimal control problem. It consists of learning an optimal control policy through interactions with the system to be controlled, the quality of this pol... 详细信息
来源: 评论
Local and soft feature selection for value function approximation in batch reinforcement learning for robot navigation
收藏 引用
JOURNAL OF SUPERCOMPUTING 2024年 第8期80卷 10720-10745页
作者: Fathinezhad, Fatemeh Adibi, Peyman Shoushtarian, Bijan Chanussot, Jocelyn Univ Isfahan Fac Comp Engn Artificial Intelligence Dept Esfahan Iran Univ Grenoble Alpes Grenoble INP GIPSA Lab CNRS Grenoble France
This paper proposes a novel method for robot navigation in high-dimensional environments that reduce the dimension of the state space using local and soft feature selection. The algorithm selects relevant features bas... 详细信息
来源: 评论
Controller design and value function approximation for nonlinear dynamical systems
收藏 引用
AUTOMATICA 2016年 67卷 54-66页
作者: Korda, Milan Henrion, Didier Jones, Colin N. Ecole Polytech Fed Lausanne Lab Automat Stn 9 CH-1015 Lausanne Switzerland CNRS LAAS 7 Ave Colonel Roche F-31400 Toulouse France Univ Toulouse LAAS F-31400 Toulouse France Czech Tech Univ Fac Elect Engn CZ-16626 Prague Czech Republic
This work considers the infinite-time discounted optimal control problem for continuous time input-affine polynomial dynamical systems subject to polynomial state and box input constraints. We propose a sequence of su... 详细信息
来源: 评论
Least Absolute Policy Iteration-A Robust Approach to value function approximation
收藏 引用
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS 2010年 第9期E93D卷 2555-2565页
作者: Sugiyama, Masashi Hachiya, Hirotaka Kashima, Hisashi Morimura, Tetsuro Tokyo Inst Technol Dept Comp Sci Tokyo 1528552 Japan Japan Sci & Technol Agcy PRESTO Tokyo 1528552 Japan Univ Tokyo Dept Math Informat Tokyo 1138656 Japan IBM Res Tokyo Yamato 2428502 Japan
Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers in observed rewards. In this paper, we propose an... 详细信息
来源: 评论
Leveraging Statistical Multi-Agent Online Planning with Emergent value function approximation  17
Leveraging Statistical Multi-Agent Online Planning with Emer...
收藏 引用
17th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS)
作者: Phan, Thomy Belzner, Lenz Gabor, Thomas Schmid, Kyrill Ludwig Maximilians Univ Munchen Inst Informat Munich Germany
Making decisions is a great challenge in distributed autonomous environments due to enormous state spaces and uncertainty. Many online planning algorithms rely on statistical sampling to avoid searching the whole stat... 详细信息
来源: 评论
Improving value function approximation in Factored POMDPs by Exploiting Model Structure  14
Improving Value Function Approximation in Factored POMDPs by...
收藏 引用
14th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
作者: Veiga, Tiago S. Spaan, Matthijs T. J. Lima, Pedro U. Univ Lisbon Inst Super Tecn Inst Syst Robot Lisbon Portugal Delft Univ Technol Delft Netherlands
Linear value function approximation in Markov decision processes (MDPs) has been studied extensively, but there are several challenges when applying such techniques to partially observable MDPs (POMDPs). Furthermore, ... 详细信息
来源: 评论
Power System Maintenance Planning Using value function approximation
Power System Maintenance Planning Using Value Function Appro...
收藏 引用
International Conference on Probabilistic Methods Applied to Power Systems (PMAPS)
作者: Abeygunawardane, Saranga K. Jirutitijaroen, Panida Xu, Huan Univ Moratuwa Dept Elect Engn Moratuwa Sri Lanka Natl Univ Singapore Dept Elect & Comp Engn Singapore 117548 Singapore Natl Univ Singapore Dept Mech Engn Singapore 117548 Singapore
Power system maintenance planning is vital for conducting maintenance of power system equipment in an optimal manner. A maintenance model of a system with number of equipment has several states. Solving such a system ... 详细信息
来源: 评论
Feature Selection for value function approximation
Feature Selection for Value Function Approximation
收藏 引用
作者: Taylor, Gavin Duke University
学位级别:Ph.D.
The field of reinforcement learning concerns the question of automated action selection given past experiences. As an agent moves through the state space, it must recognize which state choices are best in terms of all... 详细信息
来源: 评论
Integrating Symmetry of Environment by Designing Special Basis functions for value function approximation in Reinforcement Learning  14
Integrating Symmetry of Environment by Designing Special Bas...
收藏 引用
14th International Conference on Control, Automation, Robotics and Vision (ICARCV)
作者: Wang, Guo-fang Fang, Zhou Li, Bo Li, Ping Zhejiang Univ ZJU Sch Aeronaut & Astronaut Hangzhou Zhejiang Peoples R China Zhejiang Univ ZJU Dept Control Sci & Engn Hangzhou Zhejiang Peoples R China
Reinforcement learning (RL) is usually regarded as tabula rasa learning, and the agent needs to randomly explore the environment, so the time consuming and data inefficiency will hinder RL from the real application. I... 详细信息
来源: 评论
Combining value function approximation and multiple scenario approach for the effective management of ride-hailing services
收藏 引用
EURO JOURNAL ON TRANSPORTATION AND LOGISTICS 2023年 12卷
作者: Heitmann, R. Julius O. Soeffker, Ninja Ulmer, Marlin W. Mattfeld, Dirk C. Tech Univ Carolo Wilhelmina Braunschweig Decis Support Grp Braunschweig Germany Univ Wien Dept Business Decis & Analyt Vienna Austria Otto von Guericke Univ Chair Management Sci Magdeburg Germany
The availability of various services for individual mobility is increasing, especially in urban areas. Dynamic ride-hailing services address these aspects and are gaining market share with providers such as MOIA, Uber... 详细信息
来源: 评论