检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

748 篇 会议
271 篇 期刊文献
4 册 图书

馆藏范围

1,023 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

712 篇 工学
- 520 篇 计算机科学与技术...
- 381 篇 电气工程
- 278 篇 控制科学与工程
- 153 篇 软件工程
- 79 篇 信息与通信工程
- 40 篇 交通运输工程
- 23 篇 仪器科学与技术
- 20 篇 机械工程
- 9 篇 生物工程
- 8 篇 电子科学与技术（可...
- 7 篇 力学（可授工学、理...
- 7 篇 土木工程
- 6 篇 动力工程及工程热...
- 6 篇 石油与天然气工程
- 4 篇 生物医学工程（可授...
- 3 篇 材料科学与工程（可...
- 3 篇 化学工程与技术
- 3 篇 航空宇航科学与技...
- 3 篇 安全科学与工程
118 篇 理学
- 98 篇 数学
- 32 篇 系统科学
- 22 篇 统计学（可授理学、...
- 10 篇 生物学
- 8 篇 物理学
- 4 篇 化学
66 篇 管理学
- 63 篇 管理科学与工程(可...
- 14 篇 工商管理
- 5 篇 图书情报与档案管...
5 篇 经济学
- 4 篇 应用经济学
3 篇 法学
- 3 篇 社会学
2 篇 医学
1 篇 教育学

主题

313 篇 reinforcement le...
216 篇 dynamic programm...
206 篇 optimal control
107 篇 adaptive dynamic...
104 篇 adaptive dynamic...
97 篇 learning
88 篇 neural networks
78 篇 heuristic algori...
68 篇 reinforcement le...
58 篇 learning (artifi...
54 篇 nonlinear system...
53 篇 convergence
51 篇 control systems
51 篇 mathematical mod...
48 篇 approximate dyna...
44 篇 approximation al...
43 篇 equations
42 篇 adaptive control
41 篇 artificial neura...
41 篇 cost function

机构

41 篇 chinese acad sci...
27 篇 univ rhode isl d...
17 篇 tianjin univ sch...
16 篇 univ sci & techn...
16 篇 univ illinois de...
15 篇 northeastern uni...
14 篇 beijing normal u...
13 篇 northeastern uni...
13 篇 guangdong univ t...
12 篇 northeastern uni...
9 篇 natl univ def te...
8 篇 ieee
8 篇 univ chinese aca...
7 篇 univ chinese aca...
7 篇 cent south univ ...
7 篇 southern univ sc...
7 篇 beijing univ tec...
6 篇 chinese acad sci...
6 篇 missouri univ sc...
5 篇 nanjing univ pos...

作者

54 篇 liu derong
37 篇 wei qinglai
29 篇 he haibo
22 篇 wang ding
21 篇 xu xin
19 篇 jiang zhong-ping
17 篇 lewis frank l.
17 篇 yang xiong
17 篇 zhang huaguang
17 篇 ni zhen
16 篇 zhao bo
15 篇 gao weinan
14 篇 zhao dongbin
13 篇 derong liu
13 篇 zhong xiangnan
12 篇 si jennie
10 篇 jagannathan s.
10 篇 dongbin zhao
10 篇 song ruizhuo
9 篇 abouheaf mohamme...

语言

992 篇 英文
25 篇 其他
6 篇 中文

检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"

共 1023 条记录，以下是401-410 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

adaptive Slope Locomotion with Deep reinforcement learning

Adaptive Slope Locomotion with Deep Reinforcement Learning

引用

ieee/SICE International symposium on System Integration

作者： William Jones Tamir Blum Kazuya Yoshida Space Robotics Laboratory of the Department of Aerospace Engineering Graduate School of Engineering Tohoku University Sendai Japan

ISBN: (数字)9781728166674

ISBN: (纸本)9781728166681

In this paper we present a model free Deep reinforcement learning based approach to the motion planning problem of a quadruped moving from a flat to an inclined plane. In our implementation, we do not provide any prior information of the location of the inclined plane, nor pass any vision data during the training process. With this approach, we train a 12 degree of freedom quadruped robot to traverse up and down a variety of simulated sloped environments, in the process demonstrating that deep reinforcement learning is able to generate highly dynamic and adaptable solutions.

关键词： Legged locomotion learning (artificial intelligence) Animals Computational modeling Neural networks Logic gates

来源：评论

学校读者我要写书评

暂无评论

Longitudinal dynamic versus Kinematic Models for Car-Following Control Using Deep reinforcement learning

Longitudinal Dynamic versus Kinematic Models for Car-Followi...

引用

ieee Intelligent Transportation Systems Conference (ieee-ITSC)

作者： Lin, Yuan McPhee, John Azad, Nasser L. Univ Waterloo Syst Design Engn Dept Waterloo ON N2L 3G1 Canada

ISBN: (纸本)9781538670248

The majority of current studies on autonomous vehicle control via deep reinforcement learning (DRL) utilize point-mass kinematic models, neglecting vehicle dynamics which includes acceleration delay and acceleration command dynamics. The acceleration delay, which results from sensing and actuation delays, results in delayed execution of the control inputs. The acceleration command dynamics dictates that the actual vehicle acceleration does not rise up to the desired command acceleration instantaneously due to dynamics. In this work, we investigate the feasibility of applying DRL controllers trained using vehicle kinematic models to more realistic driving control with vehicle dynamics. We consider a particular longitudinal car-following control, i.e., adaptive Cruise Control (ACC), problem solved via DRL using a point-mass kinematic model. When such a controller is applied to car following with vehicle dynamics, we observe significantly degraded car-following performance. Therefore, we redesign the DRL framework to accommodate the acceleration delay and acceleration command dynamics by adding the delayed control inputs and the actual vehicle acceleration to the reinforcement learning environment state, respectively. The training results show that the redesigned DRL controller results in near-optimal control performance of car following with vehicle dynamics considered when compared with dynamic programming solutions.

关键词： Acceleration

来源：评论

学校读者我要写书评

暂无评论

Neural Network Tracking Control of Unknown Servo System with Approximate dynamic programming 38

Neural Network Tracking Control of Unknown Servo System with...

引用

38th Chinese Control Conference (CCC)

作者： Lv, Yongfeng Ren, Xuemei Zeng, Tianyi Li, Linwei Na, Jing Beijing Inst Technol Sch Automat Beijing 100081 Peoples R China Kunming Univ Sci & Technol Fac Mech & Elect Engn Kunming 650500 Yunnan Peoples R China

ISBN: (纸本)9789881563972

Although the adaptive dynamic programming (ADP) scheme has been widely researched on the optimal problem in recent years, which has not been applied to the servo system. In this paper, a simplified reinforcement learning (RL) based (ADP) scheme is developed to obtain the optimal tracking control of the servo system, where the unknown system dynamics are approximated with a three-layer neural network (NN) identifier. First, the servo system model is constructed and a three-layer NN identifier is used to approximate the unknown servo system. The NN weights of both the hidden layer and output layer are synchronously tuned with an adaptive gradient law. An RL-based critic NN is then used to learn the optimal cost function, and NN weights are updated by minimizing the squared Hamilton-Jacobi-Bellman (HJB) error. The optimal tracking control of the servomechanism is obtained based on the three-layer NN identifier and RL scheme, which can make the motor speed track the predefined command. Moreover, the convergence of the identifier and NN weights is proved. Finally, a servomechanism model is provided, which can illustrate the proposed methods.

关键词： reinforcement learning adaptive dynamic programming Optimal Control Neural Networks Servomechanisms

来源：评论

学校读者我要写书评

暂无评论

reinforcement learning for Vision-Based Lateral Control of a Self-Driving Car 15

Reinforcement Learning for Vision-Based Lateral Control of a...

引用

ieee 15th International Conference on Control and Automation (ICCA)

作者： Huang, Mengzhe Zhao, Mingyu Parikh, Parthiv Wang, Yebin Ozbay, Kaan Jiang, Zhong-Ping NYU Tandon Sch Engn Dept Elect & Comp Engn Brooklyn NY 11201 USA Mitsubishi Elect Res Labs Cambridge MA 02139 USA NYU C2SMART Ctr Tandon Sch Engn Brooklyn NY 11201 USA

ISBN: (纸本)9781728111643

Lateral control design is one of the fundamental components for self-driving cars. In this paper, we propose a learning-based control strategy that enables a mobile car equipped with a camera to perfectly perform lane keeping in a road on the ground. Using the method of adaptive dynamic programming, the proposed control algorithm exploits the structural knowledge of the car kinematics as well as the collected data (images) about the lane information. An adaptive optimal lateral controller is obtained through a data-driven learning algorithm. The effectiveness of the proposed method is demonstrated by theoretical stability proofs and experimental evaluations.

关键词： Automobiles Kinematics Tracking Wheels Stability analysis Mobile robots

来源：评论

学校读者我要写书评

暂无评论

Geometric deep reinforcement learning for dynamic DAG scheduling

Geometric deep reinforcement learning for dynamic DAG schedu...

引用

ieee symposium Series on Computational Intelligence (SSCI)

作者： Nathan Grinsztajn Olivier Beaumont Emmanuel Jeannot Philippe Preux UMR 9189 CRIStAL Univ. Lille CNRS Inria Lille France Hiepacs team Inria Bordeaux Bordeaux France TADaaM team Inria Bordeaux Bordeaux France

ISBN: (数字)9781728125473

ISBN: (纸本)9781728125480

In practice, it is quite common to face combinatorial optimization problems which contain uncertainty along with non determinism and dynamicity. These three properties call for appropriate algorithms; reinforcement learning (RL) is dealing with them in a very natural way. Today, despite some efforts, most real-life combinatorial optimization problems remain out of the reach of reinforcement learning algorithms. In this paper, we propose a reinforcement learning approach to solve a realistic scheduling problem, and apply it to an algorithm commonly executed in the high performance computing community, the CHOLESKY factorization. On the contrary to static scheduling, where tasks are assigned to processors in a predetermined ordering before the beginning of the parallel execution, our method is dynamic: task allocations and their execution ordering are decided at runtime, based on the system state and unexpected events, which allows much more flexibility. To do so, our algorithm uses graph neural networks in combination with an actor critic algorithm (A2C) to build an adaptive representation of the problem on the fly. We show that this approach is competitive with state-of-the-art heuristics used in high performance computing runtime systems. Moreover, our algorithm does not require an explicit model of the environment, but we demonstrate that extra knowledge can easily be incorporated and improves the performance. We also exhibit key properties provided by this RL approach, and study its transfer abilities to other instances.

关键词： Task analysis Kernel reinforcement learning Heuristic algorithms Runtime dynamic scheduling Optimization

来源：评论

学校读者我要写书评

暂无评论

adaptive Assist-as-needed Control Based on Actor-Critic reinforcement learning

Adaptive Assist-as-needed Control Based on Actor-Critic Rein...

引用

ieee/RSJ International Conference on Intelligent Robots and Systems (IROS)

作者： Zhang, Yufeng Li, Shuai Nolan, Karen J. Zanotto, Damiano Stevens Inst Technol Wearable Robot Syst WRS Lab Hoboken NJ 07030 USA Kessler Fdn Human Performance & Engn Res West Orange NJ 07052 USA Rutgers NJMS Newark NJ 07103 USA

ISBN: (纸本)9781728140049

In robot-assisted rehabilitation, assist-as-needed (AAN) controllers have been proposed to promote subjects' active participation, which is thought to lead to better training outcomes. Most of these AAN controllers require a patient-specific manual tuning of the parameters defining the underlying force-field, which typically results in a tedious and time-consuming process. In this paper, we propose a reinforcement-learning-based impedance controller that actively reshapes the stiffness of the force-field to the subject's performance, while providing assistance only when needed. This adaptability is made possible by correlating the subject's most recent performance to the ultimate control objective in real-time. In addition, the proposed controller is built upon action dependent heuristic dynamic programming using the actor-critic structure, and therefore does not require prior knowledge of the system model. The controller is experimentally validated with healthy subjects through a simulated ankle mobilization training session using a powered ankle-foot orthosis.

关键词： Assist-as-needed controller robot-assisted training reinforcement learning wearable robotics rehabilitation robotics

来源：评论

学校读者我要写书评

暂无评论

IntelliNoC: A Holistic Design Framework for Energy-Efficient and Reliable On-Chip Communication for Manycores 19

IntelliNoC: A Holistic Design Framework for Energy-Efficient...

引用

46th International symposium on Computer Architecture (ISCA) / Workshop on Computer Architecture Education (WCAE)

作者： Wang, Ke Louri, Ahmed Karanth, Avinash Bunescu, Razvan George Washington Univ Dept Elect & Comp Engn Washington DC 20037 USA Ohio Univ Sch Elect Engn & Comp Sci Athens OH 45701 USA

ISBN: (纸本)9781450366694

As technology scales, Network-on-Chips (NoCs), currently being used for on-chip communication in manycore architectures, face several problems including high network latency, excessive power consumption, and low reliability. Simultaneously addressing these problems is proving to be difficult due to the explosion of the design space and the complexity of handling many trade-offs. In this paper, we propose IntelliNoC, an intelligent NoC design framework which introduces architectural innovations and uses reinforcement learning to manage the design complexity and simultaneously optimize performance, energy-efficiency, and reliability in a holistic manner. IntelliNoC integrates three NoC architectural techniques: (1) multi-function adaptive channels (MFACs) to improve energy-efficiency;(2) adaptive error detection/correction and re-transmission control to enhance reliability;and (3) a stress-relaxing bypass feature which dynamically powers off NoC components to prevent overheating and fatigue. To handle the complex dynamic interactions induced by these techniques, we train a dynamic control policy using Q-learning, with the goal of providing improved fault-tolerance and performance while reducing power consumption and area overhead. Simulation using PARSEC benchmarks shows that our proposed IntelliNoC design improves energy-efficiency by 67% and mean-time-to-failure (MTTF) by 77%, and decreases end-to-end packet latency by 32% and area requirements by 25% over baseline NoC architecture.

关键词： Network-on-Chip (NoC) reinforcement learning NoC Performance Reliability Energy-Efficiency

来源：评论

学校读者我要写书评

暂无评论

Toward Packet Routing with Fully-distributed Multi-agent Deep reinforcement learning 17

Toward Packet Routing with Fully-distributed Multi-agent Dee...

引用

17th International symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt)

作者： You, Xinyu Li, Xuanjie Xu, Yuedong Feng, Hui Zhao, Jin Fudan Univ Sch Informat Sci & Technol Res Ctr Smart Networks & Syst Shanghai Peoples R China Fudan Univ Sch Comp Sci Shanghai Peoples R China

ISBN: (纸本)9783903176201

Packet routing is one of the fundamental problems in computer networks in which a router determines the next-hop of each packet in the queue to get it as quickly as possible to its destination. reinforcement learning has been introduced to design the autonomous packet routing policy namely Q-routing only using local information available to each router. However, the curse of dimensionality of Q-routing prohibits the more comprehensive representation of dynamic network states, thus limiting the potential benefit of reinforcement learning. Inspired by recent success of deep reinforcement learning (DRL), we embed deep neural networks in multi-agent Q-routing. Each router possesses an independent neural network that is trained without communicating with its neighbors and makes decision locally. Two multi-agent DRL-enabled routing algorithms are proposed: one simply replaces Q-table of vanilla Q-routing by a deep neural network, and the other further employs extra information including the past actions and the destinations of non-head of line packets. Our simulation manifests that the direct substitution of Q-table by a deep neural network may not yield minimal delivery delays because the neural network does not learn more from the same input. When more information is utilized, adaptive routing policy can converge and significantly reduce the packet delivery time.

关键词： reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

An Enhanced reinforcement learning Approach for dynamic Placement of Virtual Network Functions

An Enhanced Reinforcement Learning Approach for Dynamic Plac...

引用

ieee International symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)

作者： Omar Houidi Oussama Soualah Wajdi Louati Djamal Zeghlache Telecom SudParis Samovar-UMR 5157 CNRS Institut Polytechnique de Paris France ReDCAD Lab University of Sfax Tunisia

ISBN: (数字)9781728144900

ISBN: (纸本)9781728144917

This paper addresses Virtualized Network Function Forwarding Graph (VNF-FG) embedding with the objective of realizing long term reward compared to placement algorithms that aim at instantaneous optimal placement. The long term reward is obtained using reinforcement learning (RL), following a Markov Decision Process (MDP) model, enhanced through the injection of expert knowledge in the learning process. A comparison with an Integer Linear programming (ILP) approach, a reduced candidate set (R-ILP), and an algorithm that treats the requests in batch reveals the potential improvements using the RL approach. The instantaneous and short term reward solutions are efficient only in finding instant solutions as they make decisions only on current infrastructure status for a given request at a time or eventually a batch of requests. They are efficient only for present conditions without anticipating future requests. RL possesses instead the learning and anticipation capabilities lacking in instantaneous and snapshot optimizations. A reinforcement learning based approach, called EQL (Enhanced Q-learning), aiming at balancing the load on hosting infrastructures is proposed to achieve the desired longer term reward. EQL employs RL to learn the network and control it based on the usage patterns of the physical resources. Results from extensive simulations, based on realistic and large scale topologies, report the superior performance of EQL in terms of acceptance rate, quality, scalability and achieved gains.

关键词： Bandwidth learning (artificial intelligence) Servers Switches Optimization Linear programming Land mobile radio

来源：评论

学校读者我要写书评

暂无评论

reinforcement learning Control of Power Systems with Unknown Network Model under Ambient and Forced Oscillations

Reinforcement Learning Control of Power Systems with Unknown...

引用

Control Technology and Applications (CCTA),

作者： Sayak Mukherjee He Bai Aranya Chakrabortty North Carolina State University Raleigh NC USA School of Mechanical and Aerospace Engineering Oklahoma State University Stillwater USA

ISBN: (数字)9781728171401

ISBN: (纸本)9781728171418

We present a model-free optimal control design for electric power systems with unknown transmission network and load models to improve its dynamic performance using techniques from reinforcement learning (RL) and adaptive dynamic programming (ADP). We consider different persistent disturbances in the grid including ambient oscillations resulting from load fluctuations and their effects on exciter voltage regulation loops. We also consider forced oscillation scenarios that frequently occur due to malfunctioning of governor valves. Our proposed RL algorithm recovers the optimal feedback response in spite of all of these disturbances in a completely model-free way using online measurements of the states, inputs, and the disturbances. The design is validated using the ieee benchmark 39-bus, 10-generator New England power system model perturbed with different ambient and forced oscillations.

关键词： Oscillators Generators Load modeling Power system dynamics Power system stability Aerodynamics Optimal control

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共103页 << < 37 38 39 40 41 42 43 44 45 46 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：