咨询与建议

限定检索结果

文献类型

  • 140 篇 会议
  • 7 篇 期刊文献

馆藏范围

  • 147 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 71 篇 工学
    • 66 篇 计算机科学与技术...
    • 15 篇 软件工程
    • 11 篇 电气工程
    • 10 篇 控制科学与工程
    • 2 篇 仪器科学与技术
    • 2 篇 信息与通信工程
    • 1 篇 力学(可授工学、理...
    • 1 篇 机械工程
    • 1 篇 建筑学
  • 11 篇 理学
    • 10 篇 数学
    • 2 篇 系统科学
    • 2 篇 统计学(可授理学、...
  • 7 篇 管理学
    • 6 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 1 篇 图书情报与档案管...
  • 3 篇 经济学
    • 3 篇 应用经济学

主题

  • 76 篇 dynamic programm...
  • 39 篇 learning
  • 26 篇 optimal control
  • 25 篇 reinforcement le...
  • 15 篇 function approxi...
  • 15 篇 control systems
  • 14 篇 approximation al...
  • 14 篇 equations
  • 13 篇 neural networks
  • 13 篇 stochastic proce...
  • 12 篇 convergence
  • 10 篇 state-space meth...
  • 10 篇 cost function
  • 9 篇 mathematical mod...
  • 8 篇 trajectory
  • 8 篇 approximation me...
  • 7 篇 approximate dyna...
  • 7 篇 algorithm design...
  • 7 篇 adaptive control
  • 7 篇 heuristic algori...

机构

  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 northeastern uni...
  • 3 篇 univ texas autom...
  • 3 篇 arizona state un...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 2 篇 princeton univ d...
  • 2 篇 national science...
  • 2 篇 college of mecha...
  • 2 篇 key laboratory o...
  • 2 篇 univ utrecht dep...
  • 2 篇 department of op...
  • 1 篇 inria
  • 1 篇 computational le...
  • 1 篇 school of automa...
  • 1 篇 univ cincinnati ...
  • 1 篇 toyota technol c...
  • 1 篇 neuroinformatics...

作者

  • 5 篇 liu derong
  • 4 篇 xu xin
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 marco a. wiering
  • 4 篇 zhang huaguang
  • 4 篇 si jennie
  • 4 篇 derong liu
  • 3 篇 hado van hasselt
  • 3 篇 lewis frank l.
  • 3 篇 dongbin zhao
  • 3 篇 powell warren b.
  • 3 篇 warren b. powell
  • 3 篇 riedmiller marti...
  • 2 篇 manuel loth
  • 2 篇 van hasselt hado
  • 2 篇 preux philippe
  • 2 篇 hu dewen
  • 2 篇 jennie si
  • 2 篇 philippe preux

语言

  • 142 篇 英文
  • 5 篇 其他
检索条件"任意字段=2007 IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning, ADPRL 2007"
147 条 记 录,以下是141-150 订阅
排序:
Continuous-Time ADP for Linear Systems with Partially Unknown dynamics
Continuous-Time ADP for Linear Systems with Partially Unknow...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (adprl)
作者: Draguna Vrabie Murad Abu-Khalaf Frank L. Lewis Youyi Wang Automation and Robotics Research Institute University of Texas Arlington Fort Worth TX USA School of Electrical and Electronic Engineering Nanyang Technological University Singapore
approximate dynamic programming has been formulated and applied mainly to discrete-time systems. Expressing the ADP concept for continuous-time systems raises difficult issues related to sampling time and system model... 详细信息
来源: 评论
Hybrid Ant Colony Optimization Using Memetic Algorithm for Traveling Salesman Problem
Hybrid Ant Colony Optimization Using Memetic Algorithm for T...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (adprl)
作者: Haibin Duan Xiufen Yu School of Automation Science and Electrical Engineering Beihang University Beijing China Center for Space Science and Applied Research Chinese Academy and Sciences Beijing China
Ant colony optimization was originally presented under the inspiration during collective behavior study results on real ant system, and it has strong robustness and easy to combine with other methods in optimization. ... 详细信息
来源: 评论
DHP Adaptive Critic Motion Control of Autonomous Wheeled Mobile Robot
DHP Adaptive Critic Motion Control of Autonomous Wheeled Mob...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (adprl)
作者: Wei-Song Lin Ping-Chieh Yang Department and Institute of Electrical Engineering National Taiwan University Taipei Taiwan
Autonomous drive of wheeled mobile robot (WMR) needs implementing velocity and path tracking control subject to complex dynamical constraints. Conventionally, this control design is obtained by analysis and synthesis ... 详细信息
来源: 评论
Call admission control in wireless DS-CDMA systems using actor-critic reinforcement learning
Call admission control in wireless DS-CDMA systems using act...
收藏 引用
2nd International symposium on Wireless Pervasive Computing
作者: Chanloha, Pitipong Usaha, Wipawee Suranaree Univ Technol Sch Telecommun Engn Nakhon Ratchasima 30000 Thailand
This paper addresses the call admission control (CAC) problem for multiple services in the uplink of a cellular system using direct sequential code division multiple access (DS-CDMA) when taking into account the physi... 详细信息
来源: 评论
reinforcement-learning-based Magneto-hydrodynamic Control of Hypersonic Flows
Reinforcement-Learning-based Magneto-hydrodynamic Control of...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (adprl)
作者: Nilesh V. Kulkarni Minh Q. Phan NASA Ames Research Center QSS Group Inc. Moffett Field CA USA Dartmouth College Hanover NH USA
In this work, we design a policy-iteration-based Q-learning approach for on-line optimal control of ionized hypersonic flow at the inlet of a scramjet engine. Magneto-hydrodynamics (MHD) has been recently proposed as ... 详细信息
来源: 评论
On reinforcement learning in Genetic Regulatory Networks
On Reinforcement Learning in Genetic Regulatory Networks
收藏 引用
ieee/SP Workshop on Statistical Signal Processing (SSP)
作者: Babak Faryabi Aniruddha Datta Edward R. Dougherty Department of Electrical and Computer Engineering Texas A and M University College Station TX USA Computational Biology Division Translational Genomics Research Institute Phoenix Phoenix AZ USA
The control of probabilistic Boolean networks as a model of genetic regulatory networks is formulated as an optimal stochastic control problem and has been solved using dynamic programming; however, the proposed metho... 详细信息
来源: 评论
Optimal Control Applied to Wheeled Mobile Vehicles
Optimal Control Applied to Wheeled Mobile Vehicles
收藏 引用
ieee International symposium on Intelligent Signal Processing
作者: M. Gomez T. Martinez S. Sanchez D. Meziat Departamento de Automática Universidad de Alcalá Spain Departamento de Física Ingeniería de Sistemas y Teoría de la Señal Universidad de Alcalá Spain
The goal of the work described in this paper is to develop a particular optimal control technique based on a Cell-Mapping technique in combination with the Q-learning reinforcement learning method to control wheeled m... 详细信息
来源: 评论