检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

299 篇 会议
8 篇 期刊文献

馆藏范围

307 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

180 篇 工学
- 158 篇 计算机科学与技术...
- 56 篇 电气工程
- 48 篇 软件工程
- 47 篇 控制科学与工程
- 13 篇 信息与通信工程
- 10 篇 机械工程
- 6 篇 仪器科学与技术
- 4 篇 力学（可授工学、理...
- 4 篇 生物工程
- 3 篇 动力工程及工程热...
- 2 篇 交通运输工程
- 2 篇 核科学与技术
- 2 篇 生物医学工程（可授...
- 1 篇 建筑学
- 1 篇 化学工程与技术
- 1 篇 航空宇航科学与技...
- 1 篇 食品科学与工程（可...
40 篇 理学
- 35 篇 数学
- 9 篇 系统科学
- 8 篇 统计学（可授理学、...
- 4 篇 物理学
- 4 篇 生物学
- 1 篇 化学
- 1 篇 天文学
- 1 篇 大气科学
- 1 篇 地球物理学
- 1 篇 地质学
18 篇 管理学
- 17 篇 管理科学与工程(可...
- 7 篇 工商管理
4 篇 经济学
- 4 篇 应用经济学
1 篇 医学

主题

115 篇 dynamic programm...
76 篇 reinforcement le...
67 篇 learning
47 篇 optimal control
30 篇 neural networks
27 篇 control systems
21 篇 approximate dyna...
21 篇 approximation al...
20 篇 function approxi...
20 篇 equations
17 篇 convergence
16 篇 adaptive dynamic...
16 篇 state-space meth...
16 篇 heuristic algori...
14 篇 mathematical mod...
13 篇 stochastic proce...
12 篇 learning (artifi...
12 篇 adaptive control
12 篇 cost function
11 篇 algorithm design...

机构

5 篇 arizona state un...
4 篇 department of el...
4 篇 school of inform...
4 篇 department of in...
4 篇 univ sci & techn...
4 篇 chinese acad sci...
4 篇 department of el...
3 篇 princeton univ d...
3 篇 northeastern uni...
3 篇 national science...
3 篇 robotics institu...
3 篇 univ illinois de...
3 篇 univ utrecht dep...
2 篇 univ groningen i...
2 篇 sharif univ tech...
2 篇 univ texas autom...
2 篇 pengcheng labora...
2 篇 guangxi univ sch...
2 篇 chinese acad sci...
2 篇 cemagref lisc au...

作者

14 篇 liu derong
9 篇 wei qinglai
8 篇 si jennie
7 篇 xu xin
5 篇 derong liu
4 篇 lewis frank l.
4 篇 martin riedmille...
4 篇 huaguang zhang
4 篇 jennie si
4 篇 marco a. wiering
4 篇 xin xu
4 篇 zhang huaguang
4 篇 dongbin zhao
4 篇 lei yang
4 篇 powell warren b.
4 篇 riedmiller marti...
3 篇 hado van hasselt
3 篇 van hasselt hado
3 篇 jagannathan s.
3 篇 munos remi

语言

305 篇 英文
1 篇 其他
1 篇 中文

检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"

共 307 条记录，以下是271-280 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

DHP Adaptive Critic Motion Control of Autonomous Wheeled Mobile Robot

DHP Adaptive Critic Motion Control of Autonomous Wheeled Mob...

引用

ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)

作者： Wei-Song Lin Ping-Chieh Yang Department and Institute of Electrical Engineering National Taiwan University Taipei Taiwan

Autonomous drive of wheeled mobile robot (WMR) needs implementing velocity and path tracking control subject to complex dynamical constraints. Conventionally, this control design is obtained by analysis and synthesis of the WMR system. This paper presents the dual heuristic programming (DHP) adaptive critic design of the motion control system that enables WMR to achieve the control purpose simply by learning through trial. The design consists of an adaptive critic velocity neuro-control loop and a posture neuro-control loop. The neural weights in the velocity neuro-controller (VNC) are corrected with the DHP adaptive critic method. The designer simply expresses the control objective with a utility function. The VNC learns by sequential optimization to satisfy the control objective. The posture neuro-controller (PNC) approximates the inverse velocity model of WMR so as to map planned positions to desired velocities. Supervised drive of WMR in variant velocities supplies training samples for the PNC and VNC to setup the neural weights. In autonomous drive, the learning mechanism keeps improving the PNC and VNC. The design is evaluated on an experimental WMR. The excellent results make it certain that the DHP adaptive critic motion control design enables WMR to develop the control ability autonomously.

关键词： Programmable control Adaptive control Motion control Mobile robots Velocity control Control design Control system synthesis Robot programming Control systems Design methodology

来源：评论

学校读者我要写书评

暂无评论

reinforcement-learning-based Magneto-hydrodynamic Control of Hypersonic Flows

Reinforcement-Learning-based Magneto-hydrodynamic Control of...

引用

ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)

作者： Nilesh V. Kulkarni Minh Q. Phan NASA Ames Research Center QSS Group Inc. Moffett Field CA USA Dartmouth College Hanover NH USA

ISBN: (纸本)1424407060

In this work, we design a policy-iteration-based Q-learning approach for on-line optimal control of ionized hypersonic flow at the inlet of a scramjet engine. Magneto-hydrodynamics (MHD) has been recently proposed as a means for flow control in various aerospace problems. This mechanism corresponds to applying external magnetic fields to ionized flows towards achieving desired flow behavior. The applications range from external flow control for producing forces and moments on the air-vehicle to internal flow control designs, which compress and extract electrical energy from the flow. The current work looks at the later problem of internal flow control. The baseline controller and Q-function parameterizations are derived from an off-line mixed predictive-control and dynamic-programming-based design. The nominal optimal neural network Q-function and controller are updated on-line to handle modeling errors in the off-line design. The on-line implementation investigates key concerns regarding the conservativeness of the update methods. Value-iteration-based update methods have been shown to converge in a probabilistic sense. However, simulations results illustrate that realistic implementations of these methods face significant training difficulties, often failing in learning the optimal controller on-line. The present approach, therefore, uses a policy-iteration-based update, which has time-based convergence guarantees. Given the special finite-horizon nature of the problem, three novel on-line update algorithms are proposed. These algorithms incorporate different mix of concepts, which include bootstrapping, and forward and backward dynamic programming update rules. Simulation results illustrate success of the proposed update algorithms in re-optimizing the performance of the MHD generator during system operation

关键词： Optimal control Engines Magnetohydrodynamics Aerospace control Magnetic fields Force control Control design Neural networks Error correction Convergence

来源：评论

学校读者我要写书评

暂无评论

Optimal Control Applied to Wheeled Mobile Vehicles

Optimal Control Applied to Wheeled Mobile Vehicles

引用

ieee international symposium on Intelligent Signal Processing

作者： M. Gomez T. Martinez S. Sanchez D. Meziat Departamento de Automática Universidad de Alcalá Spain Departamento de Física Ingeniería de Sistemas y Teoría de la Señal Universidad de Alcalá Spain

ISBN: (纸本)1424408296;97

The goal of the work described in this paper is to develop a particular optimal control technique based on a Cell-Mapping technique in combination with the Q-learning reinforcement learning method to control wheeled mobile vehicles. This approach manages 4 state variables due to a dynamic model is performed instead of a kinematics model which can be done with less variables. This new solution can be applied to non-linear continuous systems where reinforcement learning methods have multiple constraints. Emphasis is given to the new combination of techniques, which applied to optimal control problems produce satisfactory results. The proposed algorithm is very robust to any change involved in the vehicle parameters because the vehicle model is estimated in real time from received experience.

关键词： Optimal control Vehicle dynamics learning Remotely operated vehicles Wheels Kinematics dynamic programming Trajectory Intelligent vehicles Path planning

来源：评论

学校读者我要写书评

暂无评论

Development of reinforcement learning methods in control and decision making in the large scale dynamic game environments

Development of reinforcement learning methods in control and...

引用

ieee international symposium on Intelligent Control

作者： Orafa, S. Yazdanpanah, M. J. Lucas, C. Rahimikian, A. Ahmadabadi, M. Nili Univ Tehran Control & Intelligent Proc Ctr Excellence Fac Elect & Comp Engn Tehran Iran

ISBN: (纸本)9780780397989

In this paper, an analytical comparison is done between dynamic programming and reinforcement learning methods in dynamic two-player games. The emphasis is on the large number of states and actions available for each player and different conflictive optimization objectives of these games that make them complicated in modeling and analysis. Optimization and decision making is done through quantifying a modified Q-learning algorithm. By this method, it is shown that the information processing in large scale-long stage games will take shorter times and will result in lower decision costs whereas dynamic programming methods cannot handle them across long time-horizons.

关键词： reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

A performance gradient perspective on approximate dynamic programming and its application to partially observable markov decision processes

A performance gradient perspective on approximate dynamic pr...

引用

ieee international symposium on Intelligent Control

作者： Dankert, James Yang, Lei Si, Jennie Arizona State Univ Dept Elect Engn Tempe AZ 85287 USA

ISBN: (纸本)9780780397989

This paper shows an approach to integrating common approximate dynamic programming (ADP) algorithms into a theoretical framework to address both analytical characteristics and algorithmic features. Several important insights are gained from this analysis, including new approaches to the creation of algorithms. Built on this paradigm, ADP learning algorithms are further developed to address a broader class of problems: optimization with partial observability. This framework is based on an average cost formulation which makes use of the concepts of differential costs and performance gradients to describe learning and optimization algorithms. Numerical simulations are conducted including a queueing problem and a maze problem to illustrate and verify features of the proposed algorithms. Pathways for applying this analysis to adaptive critics are also shown.

关键词： dynamic programming

来源：评论

学校读者我要写书评

暂无评论

A performance gradient perspective on approximate dynamic programming and its application to partially observable Markov decision processes

A performance gradient perspective on approximate dynamic pr...

引用

2006 ieee international symposium on Intelligent Control, ISIC 2006

作者： Dankert, James Lei, Yang Si, Jennie Department of Electrical Engineering Arizona State University Tempe AZ 85287-5706

ISBN: (纸本)0780397983

This paper shows an approach to integrating common approximate dynamic programming (ADP) algorithms into a theoretical framework to address both analytical characteristicsand algorithmic features. Several important insights are gained from this analysis, including new approaches to the creation of algorithms. Built on this paradigm, ADP learning algorithms are further developed to address a broader class of problems: optimization with partial observability. This framework is based on an average cost formulation which makes use of the concepts of differential costs and performance gradients to describe learning and optimization algorithms. Numerical simulations are conducted including a queueing problem and a maze problem to illustrate and verify features of the proposed algorithms. Pathways for applying this analysis to adaptive critics are also shown. ©2006 ieee.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Proceedings of the 2006 ieee international symposium on Intelligent Control, ISIC 2006

Proceedings of the 2006 IEEE International Symposium on Inte...

引用

2006 ieee international symposium on Intelligent Control, ISIC 2006

ISBN: (纸本)0780397983

The proceedings contain 94 papers. The topics discussed include: neural adaptive control of dynamic sandwich systems with hysteresis;radial basis function based iterative learning control for stochastic distribution systems;energy-efficient approaches to coverage holes detection in wireless sensor networks;optimal sensor placement for border perambulation;finite horizon discrete-time approximate dynamic programming;adaptive critic designs based coupled neurocontrollers for a static compensator;stability analysis and design for switched descriptor systems;a design of a partial sliding mode controller using duality to linear functional observer;stability of digital control systems with time delays;robust stabilization of nonlinear switched systems via switched output feedback;intermittent iterative learning control;and iterative learning control of perspective dynamic systems.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Development of reinforcement learning methods in control and decision making in the large scale dynamic game environments

Development of reinforcement learning methods in control and...

引用

ieee international Conference on Computer-Aided Design

作者： S. Orafa M.J. Yazdanpanah C. Lucas A. Rahimikian M. Nili Ahmadabadi Control and Intelligent Processing Center of Excellence Faculty of Electrical and Computer Engineering University of Tehran Tehran Iran

关键词： learning Decision making Large-scale systems Game theory dynamic programming Intelligent control Control system synthesis Equations Optimal control State-space methods

来源：评论

学校读者我要写书评

暂无评论

A performance gradient perspective on approximate dynamic programming and its application to partially observable Markov decision processes

A performance gradient perspective on approximate dynamic pr...

引用

ieee international Conference on Computer-Aided Design

作者： James Dankert Lei Yang Jennie Si Department of Electrical Engineering Arizona State University Tempe AZ USA

关键词： dynamic programming Function approximation Algorithm design and analysis Equations Cost function Optimization methods Intelligent control Heuristic algorithms Performance analysis Observability

来源：评论

学校读者我要写书评

暂无评论

Individualization of pharmacological anemia management using reinforcement learning

Individualization of pharmacological anemia management using...

引用

international Joint Conference on Neural Networks

作者： Gaweda, AE Muezzinoglu, MK Aronoff, GR Jacobs, AA Zurada, JM Brier, ME Univ Louisville Dept Med Louisville KY 40292 USA Univ Louisville Dept Elect & Comp Engn Louisville KY 40292 USA Dept Vet Affairs Louisville KY 40202 USA

Effective management of anemia due to renal failure poses many challenges to physicians. Individual response to treatment varies across patient populations and, due to the prolonged character of the therapy, changes over time. In this work, a reinforcement learning-based approach is proposed as an alternative method for individualization of drug administration in the treatment of renal anemia. Q-learning, an off-policy approximate dynamic programming method, is applied to determine the proper dosing strategy in real time. Simulations compare the proposed methodology with the currently used dosing protocol. Presented results illustrate the ability of the proposed method to achieve the therapeutic goal for individuals with different response characteristics and its potential to become an alternative to currently used techniques. (c) 2005 Elsevier Ltd. All rights reserved.

关键词： reinforcement learning drug dosing anemia management

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共31页 << < 22 23 24 25 26 27 28 29 30 31 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：