检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

128 篇 期刊文献
71 篇 会议

馆藏范围

199 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

181 篇 工学
- 91 篇 计算机科学与技术...
- 79 篇 电气工程
- 41 篇 控制科学与工程
- 37 篇 信息与通信工程
- 18 篇 石油与天然气工程
- 12 篇 软件工程
- 11 篇 机械工程
- 10 篇 仪器科学与技术
- 8 篇 电子科学与技术（可...
- 7 篇 动力工程及工程热...
- 5 篇 交通运输工程
- 4 篇 材料科学与工程（可...
- 4 篇 船舶与海洋工程
- 3 篇 土木工程
- 3 篇 环境科学与工程（可...
- 2 篇 建筑学
- 2 篇 水利工程
- 2 篇 航空宇航科学与技...
- 1 篇 光学工程
36 篇 管理学
- 34 篇 管理科学与工程(可...
- 5 篇 工商管理
23 篇 理学
- 12 篇 数学
- 5 篇 物理学
- 5 篇 系统科学
- 3 篇 化学
- 2 篇 生物学
- 1 篇 海洋科学
5 篇 经济学
- 4 篇 应用经济学
- 2 篇 理论经济学
2 篇 教育学
- 2 篇 教育学
1 篇 农学

主题

199 篇 q-learning algor...
52 篇 reinforcement le...
12 篇 learning (artifi...
11 篇 optimization
10 篇 path planning
9 篇 markov decision ...
8 篇 q-learning
7 篇 quality of servi...
6 篇 heuristic algori...
5 篇 convergence
5 篇 mobile robot
5 篇 resource allocat...
5 篇 machine learning
5 篇 dynamic scheduli...
4 篇 internet of thin...
4 篇 task analysis
4 篇 automatic genera...
4 篇 radio networks
4 篇 jamming attack
4 篇 cognitive radio ...

机构

3 篇 natl taiwan univ...
3 篇 mississippi stat...
2 篇 hong kong polyte...
2 篇 s china univ tec...
2 篇 northeastern uni...
2 篇 aristotle univ t...
2 篇 nanjing tech uni...
2 篇 northwestern pol...
2 篇 univ sains malay...
2 篇 nanyang technol ...
2 篇 kun shan univ te...
2 篇 mil acad tunisia...
2 篇 nagoya univ dept...
2 篇 hong kong polyte...
2 篇 china commun inf...
2 篇 jiangsu normal u...
1 篇 nanjing univ pos...
1 篇 beijing inst tec...
1 篇 hainan inst zhej...
1 篇 hangzhou dianzi ...

作者

3 篇 scheers bart
3 篇 stebel krzysztof
3 篇 suandi shahrel a...
3 篇 samma hussein
3 篇 mohamad-saleh ju...
3 篇 slimeni feten
3 篇 chen jiann-liang
3 篇 chtourou zied
3 篇 le nir vincent
2 篇 li ji
2 篇 wang xingwei
2 篇 xu yan
2 篇 liu dexing
2 篇 musial jakub
2 篇 xu zhao
2 篇 yang songpo
2 篇 czeczot jacek
2 篇 attia rabah
2 篇 lu en
2 篇 noori amin

语言

193 篇 英文
4 篇 其他
2 篇 中文
1 篇 德文

检索条件"主题词=Q-learning algorithm"

共 199 条记录，以下是151-160 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

A UAV Dynamic Path Planning algorithm 35

A UAV Dynamic Path Planning Algorithm

引用

35th Youth Academic Annual Conference of Chinese-Association-of-Automation (YAC)

作者： Hou, Xiaojian Liu, Fei Wang, Renjie Yu, Yao Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing Engn Res Ctr Ind Spectrum Imaging Beijing 100083 Peoples R China

ISBN: (纸本)9781728176840

In this paper, we propose a UAV dynamic path planning algorithm to solve the path planning problem of a single UAV in a dynamic environment. The contributions of this paper mainly include the following two folds: (1) Using a combination of global and local path planning to improve planning efficiency. (2) The improved q-learning algorithm and artificial potential field method are combined to solve the problem that effective path planning cannot be performed between two path nodes. Finally, simulation results with Matlab proves the effectiveness of the algorithm.

关键词： UAV path planning q-learning algorithm artificial potential field method

来源：评论

学校读者我要写书评

暂无评论

Adaptive Inventory Control and Bullwhip Effect Analysis for Supply Chains with Non-stationary Demand 27

Adaptive Inventory Control and Bullwhip Effect Analysis for ...

引用

27th Chinese Control and Decision Conference (CCDC)

作者： Yang, Songpo Zhang, Jihui Qingdao Univ Inst Complex Sci Qingdao 266071 Peoples R China

ISBN: (纸本)9781479970162

In this paper, two adaptive inventory control models, i.e. centralized and decentralized respectively, for a multi-echelon multi-cycle supply chain consisting of one supplier and one retailer with non-stationary stochastic demand were established, In the centralized model, the vendor managed inventory replenishment policy was used by the supplier and the retailer didn't keep any stock, An improved exponential smoothing method was used by the supplier to forecast the future demand. The EOq model was used by the supplier to determine the replenishment quantity for the retailer and an adaptive approach was used by the supplier to determine his safety stock to against demand fluctuation. An reinforcement learning algorithm was adopted to select an proper safety factor according to the stochastic demand. On the contrary in the decentralized model, both the supplier and the retailers hold their own inventory and safety stock for themselves respectively. That is, they control their own inventory independently. In both cases, the aim is to satisfy the given target service level predefined. In our simulation study two types of demand patterns, stationary and non-stationary demand, are considered respectively. The bullwhip effect generated in the course of forecasting and processing of demand information were analyzed. The results show that the proposed method can satisfy the given service level and mitigate the bullwhip effect to some extent.

关键词： Vendor Managed Inventory q-learning algorithm Bullwhip Effect

来源：评论

学校读者我要写书评

暂无评论

Reinforcement learning algorithms in Global Path Planning for Mobile Robot

Reinforcement Learning Algorithms in Global Path Planning fo...

引用

International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM)

作者： Sichkar, Valentyn N. ITMO Univ Dept Control Syst & Robot St Petersburg Russia

ISBN: (纸本)9781538681190

The paper is devoted to the research of two approaches for global path planning for mobile robots, based on q-learning and Sarsa algorithms. The study has been done with different adjustments of two algorithms that made it possible to learn faster. The implementation of two Reinforcement learning algorithms showed differences in learning time and the methods of building path to avoid obstacles and to reach a destination point. The analysis of obtained results made it possible to select optimal parameters of the considered algorithms for the tested environments. Experiments were performed in virtual environments where algorithms learned which steps to choose in order to get a maximum payoff and reach the goal avoiding obstacles.

关键词： reinforcement learning q-learning algorithm Sarsa algorithm path planning mobile agent

来源：评论

学校读者我要写书评

暂无评论

Agent-Based Simulation of Power Markets under Uniform and Pay-as-Bid Pricing Rules using Reinforcement learning

Agent-Based Simulation of Power Markets under Uniform and Pa...

引用

IEEE/PES Power Systems Conference and Exposition

作者： Bakirtzis, Anastasios G. Tellidou, Athina C. Aristotle Univ Thessaloniki Dept Elect & Comp Engn Thessaloniki 54124 Greece

ISBN: (纸本)9781424401772

In this paper agent-based simulation is employed to study the power market operation under two alternative pricing systems: uniform and discriminatory (pay-as-bid). Power suppliers are modeled as adaptive agents capable of learning through the interaction with their environment, following a Reinforcement learning algorithm. The SA-q-learning algorithm, a slightly changed version of the popular q-learning, is used in this paper;it proposes a solution to the difficult problem of the balance between exploration and exploitation and it has been chosen for its quick convergence. A test system with five supplier-agents is used to study the suppliers' behavior under the uniform and the pay-as-bid pricing systems.

关键词： electricity spot markets multi-agent modeling pay-as-bid q-learning algorithm uniform pricing

来源：评论

学校读者我要写书评

暂无评论

High-speed Train Timetabling Based on Reinforcement learning

High-speed Train Timetabling Based on Reinforcement Learning

引用

IEEE Symposium Series on Computational Intelligence (IEEE SSCI)

作者： Yang, Wanlu Jiang, Peng Song, Shiji Tsinghua Univ Dept Automat Beijing 100084 Peoples R China Tsinghua Univ BNRist Beijing 100084 Peoples R China

ISBN: (纸本)9781665487689

Chinese high-speed railway has developed rapidly in the more intelligent and automatic direction over the past few decades. In this paper, we consider the optimization problem of the train timetable for the high-speed railway to minimize the total train waiting time and total station occupied time. To deal with time-related constraints, we first establish the train operation environment based on Discrete Event Dynamic System (DEDS). Then, we reformulate the timetabling problem as a Markov Decision Process (MDP) problem and propose an improved q-learning approach by redesigning q-value function to solve the problem. Finally, we consider the Beijing-Shanghai high-speed railway as a numerical example, where the passenger flow and train running time are stochastic. We empirically show that our q-learning method reduces over 30% total waiting time and 1.9% total occupied time compared with the well-known First-Come-First-Service (FCFS) scheduling strategy.

关键词： High-speed railway Train timetable Markov decision process q-learning algorithm

来源：评论

学校读者我要写书评

暂无评论

Privacy-Cost Management in Smart Meters Using Deep Reinforcement learning 10

Privacy-Cost Management in Smart Meters Using Deep Reinforce...

引用

10th IEEE PES Innovative Smart Grid Technologies Europe (ISGT-Europe) - Smart Grids - Key Enablers of a Green Power System

作者： Shateri, Mohammadhadi Messina, Francisco Piantanida, Pablo Labeau, Fabrice McGill Univ Montreal PQ Canada Univ Paris Sud CNRS Cent Supelec Gif Sur Yvette France

ISBN: (纸本)9781728171005

Smart meters (SMs) play a pivotal rule in the smart grid by being able to report the electricity usage of consumers to the utility provider (UP) almost in real-time. However, this could leak sensitive information about the consumers to the UP or a third-party. Recent works have leveraged the availability of energy storage devices, e.g., a rechargeable battery (RB), in order to provide privacy to the consumers with minimal additional energy cost. In this paper, a privacy-cost management unit (PCMU) is proposed based on a model-free deep reinforcement learning algorithm, called deep double q-learning (DDqL). Empirical results evaluated on actual SMs data are presented to compare DDqL with the state-of-the-art, i.e., classical q-learning (CqL). Additionally, the performance of the method is investigated for two concrete cases where attackers aim to infer the actual demand load and the occupancy status of dwellings. Finally, an abstract information-theoretic characterization is provided.

关键词： Smart meters privacy Privacy-cost trade-off Deep reinforcement learning q-learning algorithm Deep double q-learning Privacy-cost management unit

来源：评论

学校读者我要写书评

暂无评论

A Reinforcement learning Approach to Dynamic Optimization of Load Allocation in AGC System

A Reinforcement Learning Approach to Dynamic Optimization of...

引用

General Meeting of the IEEE-Power-and-Energy-Society

作者： Wang, Y. M. Liu, q. J. Yu, T. S China Univ Technol Elect Power Coll Guangzhou Guangdong Peoples R China

ISBN: (纸本)9781424442409

A Reinforcement learning (RL) method applied to the dynamic load allocation in AGC system is presented. The problem can be modeled as a Markov Decision Process (MDP). The q-learning algorithm as a model-free learning algorithm is introduced. It learns an optimal action strategy by experience from exploring an unknown system and getting rewards. Rewards are chosen to express how well actions control the system. The applications of the q-learning algorithm to the two-area power system model and China Southern,Power Grid model are presented. The case study shows that the q-learning algorithm enhances the performance of AGC system under CPS.

关键词： Reinforcement learning q-learning algorithm dynamic load allocation MDP CPS

来源：评论

学校读者我要写书评

暂无评论

qoS-Aware Heterogeneous Networking Using Distributed Multiagent Schemes

QoS-Aware Heterogeneous Networking Using Distributed Multiag...

引用

7th IEEE International Wireless Communications and Mobile Computing Conference (IWCMC)

作者： Chen, Jiann-Liang Larosa, Yanuarius Teofilus Deng, Der-Jiunn Yang, Pei-Jia Ma, Yi-Wei Natl Changhua Univ Educ Dept Comp Sci & Informat Engn Taipei Taiwan Natl Taiwan Univ Sci & Technol Dept Elect Engn Taipei Taiwan Natl Cheng Kung Univ Dept Engn Sci Tainan Taiwan

ISBN: (纸本)9781424495375

This study achieves quality-of-Service (qoS) management in heterogeneous networking using a distributed multiagent scheme (DMAS) based on the concept of cooperation and the awareness algorithm. The proposed scheme is developed for supporting qoS management in a user-accepted and cost-effective fashion, which consists of a collection of problem-solving agents with three modules: the knowledge source, the in-cloud blackboard system, and the control engine built into the scheme. A set of problem-solving agents autonomously process local tasks and cooperatively interoperate via an in-cloud blackboard system to guarantee qoS. An awareness algorithm, called the q-learning algorithm, calculates the exceptive rewards of a handoff to all access networks. These rewards are then used by these problem-solving agents to determine what to do. Through operations and cooperation among the active agents, a policy is selected and a user-accepted schedule that meets the specified qoS is generated. Compared with traditional qoS management mechanisms, the proposed DMAS scheme has a 36% lower packet loss ratio in video streaming applications and a 34% lower average delay in VoIP applications with only a minor sacrifice in system computational complexity.

关键词： Heterogeneous Network Cooperative Networking Distributed Multi-Agent Scheme Cloud Computing q-learning algorithm quality of Service

来源：评论

学校读者我要写书评

暂无评论

Cognitive Radio Jamming Mitigation using Markov Decision Process and Reinforcement learning

Cognitive Radio Jamming Mitigation using Markov Decision Pro...

引用

International Conference on Advanced Wireless Information and Communication Technologies (AWICT)

作者： Slimeni, Feten Scheers, Bart Chtourou, Zied Le Nir, Vincent Attia, Rabah Mil Acad Tunisia VRIT Lab Nabeul 8000 Tunisia Royal Mil Acad CISS Dept B-1000 Brussels Belgium EPT Univ Carthage SERCOM Lab Marsa 2078 Tunisia

The Cognitive radio technology is a promising solution to the imbalance between scarcity and under utilization of the spectrum. However, this technology is susceptible to both classical and advanced jamming attacks which can prevent it from the efficient exploitation of the free frequency bands. In this paper, we explain how a cognitive radio can exploit its ability of dynamic spectrum access and its learning capabilities to avoid jammed channels. We start by the definition of jamming attacks in cognitive radio networks and we give a review of its potential countermeasures. Then, we model the cognitive radio behavior in the suspicious environment as a markov decision process. To solve this optimization problem, we implement the q-learning algorithm in order to learn the jammer strategy and to pro-actively avoid jammed channels. We present the limits of this algorithm in cognitive radio context and we propose a modified version to speed up learning a safe strategy. The effectiveness of this modified algorithm is evaluated by simulations and compared to the original q-learning algorithm. (C) 2015 The Authors. Published by Elsevier B.V.

关键词： Cognitive radio network jamming attack q-learning algorithm

来源：评论

学校读者我要写书评

暂无评论

Synchronization of Probabilistic Boolean Networks under State-Flipped Control 39

Synchronization of Probabilistic Boolean Networks under Stat...

引用

39th Youth Academic Annual Conference of Chinese-Association-of-Automation (YAC)

作者： Bian, Chenyang Du, Leihao Zhang, Zhipeng Tiangong Univ Sch Control Sci & Engn Tianjin Peoples R China Tiangong Univ Sch Comp Sci & Engn Tianjin Peoples R China Tiangong Univ Sch Artificial Intelligence Tianjin Peoples R China

ISBN: (纸本)9798350390780;9798350379228

This paper investigates the synchronization problem of probabilistic boolean networks (PBNs) under state-flipped control. First, by flipping some of the nodes, the entire state space is transferred to a synchronous state set. Some verification conditions for the synchronization of PBNs are proposed. Second, a q-learning (qL) algorithm for synchronizing PBNs in finite flip control is given, and the minimum flip set is obtained. Finally, numerical simulations are performed to verify the feasibility of the conclusions.

关键词： Probabilistic boolean networks State flip control q-learning algorithm Synchronization

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共20页 << < 11 12 13 14 15 16 17 18 19 20 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：