检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

128 篇 期刊文献
71 篇 会议

馆藏范围

199 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

181 篇 工学
- 92 篇 计算机科学与技术...
- 78 篇 电气工程
- 41 篇 控制科学与工程
- 39 篇 信息与通信工程
- 18 篇 石油与天然气工程
- 12 篇 软件工程
- 11 篇 机械工程
- 10 篇 仪器科学与技术
- 9 篇 动力工程及工程热...
- 9 篇 电子科学与技术（可...
- 5 篇 材料科学与工程（可...
- 4 篇 交通运输工程
- 4 篇 船舶与海洋工程
- 3 篇 土木工程
- 3 篇 环境科学与工程（可...
- 2 篇 建筑学
- 2 篇 水利工程
- 2 篇 航空宇航科学与技...
- 1 篇 光学工程
36 篇 管理学
- 34 篇 管理科学与工程(可...
- 5 篇 工商管理
23 篇 理学
- 12 篇 数学
- 5 篇 物理学
- 5 篇 系统科学
- 3 篇 化学
- 2 篇 生物学
- 1 篇 海洋科学
5 篇 经济学
- 4 篇 应用经济学
- 2 篇 理论经济学
2 篇 教育学
- 2 篇 教育学
1 篇 农学

主题

199 篇 q-learning algor...
51 篇 reinforcement le...
11 篇 path planning
11 篇 optimization
10 篇 learning (artifi...
9 篇 markov decision ...
8 篇 q-learning
7 篇 quality of servi...
6 篇 heuristic algori...
5 篇 convergence
5 篇 mobile robot
5 篇 resource allocat...
5 篇 machine learning
5 篇 dynamic scheduli...
4 篇 internet of thin...
4 篇 task analysis
4 篇 automatic genera...
4 篇 radio networks
4 篇 jamming attack
4 篇 cognitive radio ...

机构

3 篇 natl taiwan univ...
3 篇 mississippi stat...
2 篇 hong kong polyte...
2 篇 s china univ tec...
2 篇 northeastern uni...
2 篇 aristotle univ t...
2 篇 nanjing tech uni...
2 篇 northwestern pol...
2 篇 univ sains malay...
2 篇 nanyang technol ...
2 篇 kun shan univ te...
2 篇 mil acad tunisia...
2 篇 nagoya univ dept...
2 篇 hong kong polyte...
2 篇 china commun inf...
2 篇 jiangsu normal u...
1 篇 nanjing univ pos...
1 篇 beijing inst tec...
1 篇 hainan inst zhej...
1 篇 hangzhou dianzi ...

作者

3 篇 scheers bart
3 篇 stebel krzysztof
3 篇 suandi shahrel a...
3 篇 samma hussein
3 篇 mohamad-saleh ju...
3 篇 slimeni feten
3 篇 chen jiann-liang
3 篇 chtourou zied
3 篇 le nir vincent
2 篇 li ji
2 篇 wang xingwei
2 篇 xu yan
2 篇 liu dexing
2 篇 musial jakub
2 篇 xu zhao
2 篇 yang songpo
2 篇 czeczot jacek
2 篇 attia rabah
2 篇 lu en
2 篇 noori amin

语言

192 篇 英文
4 篇 其他
2 篇 中文

检索条件"主题词=Q-Learning algorithm"

共 199 条记录，以下是151-160 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Study on motion forms of a two-dimensional mobile robot by using reinforcement learning

Study on motion forms of a two-dimensional mobile robot by u...

引用

SICE-ICASE International Joint Conference

作者： Jung, Youngmi Inoue, Masashi Hara, Masayuki Huang, Jian Yabuta, Tetsuro Yokohama Natl Univ Dept Mech Engn Grad Sch Engn Yokohama Kanagawa 240 Japan

ISBN: (纸本)9788995003848

The main advantage of Reinforcement learning is that it provides unexpected solutions for a designer. This study shows how a mobile robot can obtain unexpected motion forms by using Reinforcement learning. Results show the the mobile robot with two-dimensional mobile ability can obtain unexpected motion forms for both advance motion and rotation motion. The mechanisms for these motions were investigated in order to understand how to obtain these motions. Moreover, since this system has a two-dimensional factor, this study examines the learning characteristic for the oblivion of the learning knowledge. In addition, this study examines the learning of the knowledge manipulation method to obtain new learning results with respect to the two-dimensional factor.

关键词： Reinforcement learning method q-learning algorithm mobile robot new learning motion forms

来源：评论

学校读者我要写书评

暂无评论

A GREEN INTELLIGENT UNICAST ROUTING algorithm

A GREEN INTELLIGENT UNICAST ROUTING ALGORITHM

引用

5th IEEE International Conference on Broadband Network & Multimedia Technology (IC-BNMT)

作者： Zhang, Jinhong Wang, Xingwei Huang, Min Northeastern Univ Coll Informat Sci & Engn Shenyang 110819 Peoples R China

ISBN: (纸本)9781479900930

For the past years, the energy overconsumption problems, arising from the rapid growth of Internet scale and services types, are becoming more and more serious. In this context, the Information and Communication Technology (ICT) sector has given an extensive concern to the research on green networking. From an energy-saving point of view for Internet, this paper designs a power consumption model and a qoS model for unicast. Furthermore, this paper proposes a unicast routing algorithm based on Chandy-Misra algorithm and q-learning algorithm for green Internet, which is compared with an ant colony algorithm based self-adaptive energy saving routing with respect to power consumption, the success rate of routing and running time. Results show the intelligent unicast routing algorithm proposed can effectively reduce network energy consumption, while guaranteeing the good performance.

关键词： Green Internet Unicast routing Energy saving Chandy-Misra algorithm q-learning algorithm

来源：评论

学校读者我要写书评

暂无评论

Dynamic Joint Decision on Price and Delivery Date in MTO Manufacturer Based on Agent

Dynamic Joint Decision on Price and Delivery Date in MTO Man...

引用

3rd International Conference on Energy, Environment and Sustainable Development (EESD 2013)

作者： Hao, Juan Yu, Jianjun Wu, Miancan South China Univ Technol Sch Business Adm Guangzhou Guangdong Peoples R China Guangdong Univ Foreign Studies Cisco Sch Informat Guangzhou Peoples R China

ISBN: (纸本)9783037859728

In order to maximize the total profit and improve the service level, based on the perspective of queuing theory, a new approach for dynamic joint decision on price and delivery date in Make-to-order (MTO) manufacturing firms using q-learning algorithm was proposed. Compared with static price and delivery date policy, the simulation results show that the proposed algorithm performs better in total profit and service level. The total profit does not increase with the growing number of accepted orders and the number of accepted orders must match the production capacity.

关键词： price/delivery date dynamic joint decision q-learning algorithm MTO manufacturing

来源：评论

学校读者我要写书评

暂无评论

A UAV Dynamic Path Planning algorithm 35

A UAV Dynamic Path Planning Algorithm

引用

35th Youth Academic Annual Conference of Chinese-Association-of-Automation (YAC)

作者： Hou, Xiaojian Liu, Fei Wang, Renjie Yu, Yao Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing Engn Res Ctr Ind Spectrum Imaging Beijing 100083 Peoples R China

ISBN: (纸本)9781728176840

In this paper, we propose a UAV dynamic path planning algorithm to solve the path planning problem of a single UAV in a dynamic environment. The contributions of this paper mainly include the following two folds: (1) Using a combination of global and local path planning to improve planning efficiency. (2) The improved q-learning algorithm and artificial potential field method are combined to solve the problem that effective path planning cannot be performed between two path nodes. Finally, simulation results with Matlab proves the effectiveness of the algorithm.

关键词： UAV path planning q-learning algorithm artificial potential field method

来源：评论

学校读者我要写书评

暂无评论

Adaptive Inventory Control and Bullwhip Effect Analysis for Supply Chains with Non-stationary Demand 27

Adaptive Inventory Control and Bullwhip Effect Analysis for ...

引用

27th Chinese Control and Decision Conference (CCDC)

作者： Yang, Songpo Zhang, Jihui Qingdao Univ Inst Complex Sci Qingdao 266071 Peoples R China

ISBN: (纸本)9781479970162

In this paper, two adaptive inventory control models, i.e. centralized and decentralized respectively, for a multi-echelon multi-cycle supply chain consisting of one supplier and one retailer with non-stationary stochastic demand were established, In the centralized model, the vendor managed inventory replenishment policy was used by the supplier and the retailer didn't keep any stock, An improved exponential smoothing method was used by the supplier to forecast the future demand. The EOq model was used by the supplier to determine the replenishment quantity for the retailer and an adaptive approach was used by the supplier to determine his safety stock to against demand fluctuation. An reinforcement learning algorithm was adopted to select an proper safety factor according to the stochastic demand. On the contrary in the decentralized model, both the supplier and the retailers hold their own inventory and safety stock for themselves respectively. That is, they control their own inventory independently. In both cases, the aim is to satisfy the given target service level predefined. In our simulation study two types of demand patterns, stationary and non-stationary demand, are considered respectively. The bullwhip effect generated in the course of forecasting and processing of demand information were analyzed. The results show that the proposed method can satisfy the given service level and mitigate the bullwhip effect to some extent.

关键词： Vendor Managed Inventory q-learning algorithm Bullwhip Effect

来源：评论

学校读者我要写书评

暂无评论

Privacy-Cost Management in Smart Meters Using Deep Reinforcement learning 10

Privacy-Cost Management in Smart Meters Using Deep Reinforce...

引用

10th IEEE PES Innovative Smart Grid Technologies Europe (ISGT-Europe) - Smart Grids - Key Enablers of a Green Power System

作者： Shateri, Mohammadhadi Messina, Francisco Piantanida, Pablo Labeau, Fabrice McGill Univ Montreal PQ Canada Univ Paris Sud CNRS Cent Supelec Gif Sur Yvette France

ISBN: (纸本)9781728171005

Smart meters (SMs) play a pivotal rule in the smart grid by being able to report the electricity usage of consumers to the utility provider (UP) almost in real-time. However, this could leak sensitive information about the consumers to the UP or a third-party. Recent works have leveraged the availability of energy storage devices, e.g., a rechargeable battery (RB), in order to provide privacy to the consumers with minimal additional energy cost. In this paper, a privacy-cost management unit (PCMU) is proposed based on a model-free deep reinforcement learning algorithm, called deep double q-learning (DDqL). Empirical results evaluated on actual SMs data are presented to compare DDqL with the state-of-the-art, i.e., classical q-learning (CqL). Additionally, the performance of the method is investigated for two concrete cases where attackers aim to infer the actual demand load and the occupancy status of dwellings. Finally, an abstract information-theoretic characterization is provided.

关键词： Smart meters privacy Privacy-cost trade-off Deep reinforcement learning q-learning algorithm Deep double q-learning Privacy-cost management unit

来源：评论

学校读者我要写书评

暂无评论

Agent-Based Simulation of Power Markets under Uniform and Pay-as-Bid Pricing Rules using Reinforcement learning

Agent-Based Simulation of Power Markets under Uniform and Pa...

引用

IEEE/PES Power Systems Conference and Exposition

作者： Bakirtzis, Anastasios G. Tellidou, Athina C. Aristotle Univ Thessaloniki Dept Elect & Comp Engn Thessaloniki 54124 Greece

ISBN: (纸本)9781424401772

In this paper agent-based simulation is employed to study the power market operation under two alternative pricing systems: uniform and discriminatory (pay-as-bid). Power suppliers are modeled as adaptive agents capable of learning through the interaction with their environment, following a Reinforcement learning algorithm. The SA-q-learning algorithm, a slightly changed version of the popular q-learning, is used in this paper;it proposes a solution to the difficult problem of the balance between exploration and exploitation and it has been chosen for its quick convergence. A test system with five supplier-agents is used to study the suppliers' behavior under the uniform and the pay-as-bid pricing systems.

关键词： electricity spot markets multi-agent modeling pay-as-bid q-learning algorithm uniform pricing

来源：评论

学校读者我要写书评

暂无评论

Hierarchical Multi-agent System in Traffic Network Signalization with Improved Genetic algorithm

Hierarchical Multi-agent System in Traffic Network Signaliza...

引用

IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)

作者： Tan, Min Keng Chuo, Helen Sin Ee Chin, Renee Ka Yin Yeo, Kiam Beng Teo, Kenneth Tze Kin Univ Malaysia Sabah Fac Engn Modelling Simulat & Comp Lab Kota Kinabalu Malaysia Univ Malaysia Sabah Fac Med & Hlth Sci Kota Kinabalu Malaysia

ISBN: (纸本)9781538678138

Instead of using classical offline data-driven optimization technique in traffic network signal control, this work aims to explore the potential of implementing an online data-driven optimization technique. A dynamic modeling technique is proposed using q-learning (qL) algorithm to online observe and learn the inflow-outflow traffic behaviors and extract the model parameters to update the evaluation model used in the fitness function of genetic algorithm (GA). The proposed GA with dynamic modeling is known as dyna-GA. Dyna-GA is then integrated into a hierarchical-based multi-agent traffic signal control system which consists of two layers. The lower-layer consists of several local agents that have autonomy in controlling their local intersection, whereas the upper-layer consists of one supervisory agent that has jurisdiction on all the local agents. The supervisory agent has the superiority in overwriting the local control decision if conflict occurred. The robustness of the proposed dyna-GA under several traffic scenarios is tested using a simulated arterial traffic network. The simulation results show the proposed dyna-GA has better performances in minimizing travel delay as compared to the classical GA which does not have the dynamic model.

关键词： traffic signal control multi-agent system dynamic modeling genetic algorithm q-learning algorithm

来源：评论

学校读者我要写书评

暂无评论

A Reinforcement learning Approach to Dynamic Optimization of Load Allocation in AGC System

A Reinforcement Learning Approach to Dynamic Optimization of...

引用

General Meeting of the IEEE-Power-and-Energy-Society

作者： Wang, Y. M. Liu, q. J. Yu, T. S China Univ Technol Elect Power Coll Guangzhou Guangdong Peoples R China

ISBN: (纸本)9781424442409

A Reinforcement learning (RL) method applied to the dynamic load allocation in AGC system is presented. The problem can be modeled as a Markov Decision Process (MDP). The q-learning algorithm as a model-free learning algorithm is introduced. It learns an optimal action strategy by experience from exploring an unknown system and getting rewards. Rewards are chosen to express how well actions control the system. The applications of the q-learning algorithm to the two-area power system model and China Southern,Power Grid model are presented. The case study shows that the q-learning algorithm enhances the performance of AGC system under CPS.

关键词： Reinforcement learning q-learning algorithm dynamic load allocation MDP CPS

来源：评论

学校读者我要写书评

暂无评论

qoS-Aware Heterogeneous Networking Using Distributed Multiagent Schemes

QoS-Aware Heterogeneous Networking Using Distributed Multiag...

引用

7th IEEE International Wireless Communications and Mobile Computing Conference (IWCMC)

作者： Chen, Jiann-Liang Larosa, Yanuarius Teofilus Deng, Der-Jiunn Yang, Pei-Jia Ma, Yi-Wei Natl Changhua Univ Educ Dept Comp Sci & Informat Engn Taipei Taiwan Natl Taiwan Univ Sci & Technol Dept Elect Engn Taipei Taiwan Natl Cheng Kung Univ Dept Engn Sci Tainan Taiwan

ISBN: (纸本)9781424495375

This study achieves quality-of-Service (qoS) management in heterogeneous networking using a distributed multiagent scheme (DMAS) based on the concept of cooperation and the awareness algorithm. The proposed scheme is developed for supporting qoS management in a user-accepted and cost-effective fashion, which consists of a collection of problem-solving agents with three modules: the knowledge source, the in-cloud blackboard system, and the control engine built into the scheme. A set of problem-solving agents autonomously process local tasks and cooperatively interoperate via an in-cloud blackboard system to guarantee qoS. An awareness algorithm, called the q-learning algorithm, calculates the exceptive rewards of a handoff to all access networks. These rewards are then used by these problem-solving agents to determine what to do. Through operations and cooperation among the active agents, a policy is selected and a user-accepted schedule that meets the specified qoS is generated. Compared with traditional qoS management mechanisms, the proposed DMAS scheme has a 36% lower packet loss ratio in video streaming applications and a 34% lower average delay in VoIP applications with only a minor sacrifice in system computational complexity.

关键词： Heterogeneous Network Cooperative Networking Distributed Multi-Agent Scheme Cloud Computing q-learning algorithm quality of Service

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共20页 << < 11 12 13 14 15 16 17 18 19 20 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：