检索结果-内蒙古大学图书馆

ADVANCED INTELLIGENT SYSTEMS 2024年第9期6卷

作者： Benoit, Alexandre Asef, Pedram Univ Bath Elect & Elect Engn Bath BA2 7AY England UCL Mech Engn London E20 3BS England

We offer a new in-depth investigation of global path planning (GPP) for unmanned ground vehicles, an autonomous mining samplingrobot named ROMIE. GPP is essential for ROMIE's optimal performance, which istranslated into solving the traveling salesman problem, a complexgraph theory challenge that is crucial for determining the most effective routeto cover all sampling locations in a mining field. This problem is central toenhancing ROMIE's operational efficiency and competitiveness against humanlabor by optimizing cost and time. The primary aim of this research is toadvance GPP by developing, evaluating, and improving a cost-efficient softwareand web application. We delve into an extensive comparison and analysis of Google operations research (OR)-Tools optimization algorithms. Our study is driven by the goal of applyingand testing the limits of OR-Tools capabilities by integrating Reinforcementlearning techniques for the first time. This enables us to compare thesemethods with OR-Tools, assessing their computational effectiveness andreal-world application efficiency. Our analysis seeks to provide insights intothe effectiveness and practical application of each technique. Our findingsindicate that q-learning stands out as the optimal strategy, demonstratingsuperior efficiency by deviating only 1.2% on average from the optimalsolutions across our datasets. Advancing global path planning algorithm is studied for transforming geochemical mining sampling in autonomous vehicles. Cutting-edge algorithms are harnessed to solve the intricate traveling salesman problem, optimizing route efficiency. A novel analysis of operations research-tools and reinforcement learning techniques is investigated, demonstrating q-learning's superior efficiency (codes provided for benchmarking). Technological advancements with a new benchmark for autonomous mining operations are *** (c) 2024 WILEY-VCH GmbH

关键词： autonomous vehicles global path planning Google OR-tools machine learning q-learning algorithms reinforcement learning traveling salesman problems

来源：评论

学校读者我要写书评

暂无评论

Research on logistics AGV vehicle path planning based on improved q-learning algorithm 7

Research on logistics AGV vehicle path planning based on imp...

引用

7th International Conference on Automation, Control and Robotics Engineering (CACRE)

作者： Han, Zhao Liaoning Inst Sci & Technol Dept Elect & Informat Engn Benxi Peoples R China

ISBN: (数字)9781665403870

ISBN: (纸本)9781665403870

Logistics AGV car as an important part in the intelligent manufacturing, the path planning problem by the attention of many scholars. at present, the path planning algorithm based on reinforcement learning exists the problems of slow convergence and the result is not stable, logistics AGV car in order to get a better return function, you need to perform different actions to gain more experience and information. In order to balance exploration and utilization problems, the traditional q-learning algorithm introduces the probability value of an exploration factor into the action selection strategy of AGV, selects the state-action pair of the largest q-value function every time, which leads to the system is easy to fall into the local optimal solution, which also slows down the convergence rate of the whole process. And the final action selection results will also have fluctuations. In order to solve this problem, this paper proposes an improved dynamic adjustment of exploration factor epsilon strategy, that is to choose different exploration factor e values in different stages of reinforcement learning, which can better solve the contradiction between exploration and utilization. Through simulation and real experiments, it is proved that the convergence speed of the improved reinforcement learning algorithm is faster and the stability of the convergence result is improved.

关键词： Path planning Reinforcement learning q-learning algorithms To explore the strategy

来源：评论

学校读者我要写书评

暂无评论

A Sojourn-Based Approach to Semi-Markov Reinforcement learning

引用

JOURNAL OF SCIENTIFIC COMPUTING 2022年第2期92卷 1-44页

作者： Ascione, Giacomo Cuomo, Salvatore Univ Napoli Federico II Naples Italy Univ Napoli Federico II Dipartimento Matemat & Applicaz Naples Italy

In this paper we introduce a new approach to discrete-time semi-Markov decision processes based on the sojourn time process. Different characterizations of discrete-time semi-Markov processes are exploited and decision processes are constructed by their means. With this new approach, the agent is allowed to consider different actions depending also on the sojourn time of the process in the current state. A numerical method based on q-learning algorithms for finite horizon reinforcement learning and stochastic recursive relations is investigated. Finally, we consider two toy examples: one in which the reward depends on the sojourn-time, according to the gambler's fallacy;the other in which the environment is semi-Markov even if the reward function does not depend on the sojourn time. These are used to carry on some numerical evaluations on the previously presented q-learning algorithm and on a different naive method based on deep reinforcement learning.

关键词： Semi-Markov chains Dynamic Programming Principle q-learning algorithms Optimal policy

来源：评论

学校读者我要写书评

暂无评论

Exploring InterferenceAware Spectrum Allocation in 6G Cellular Networks using dynamic resource Sharing Algorithm 23

Exploring InterferenceAware Spectrum Allocation in 6G Cellul...

引用

Proceedings of the 5th International Conference on Information Management & Machine Intelligence

作者： Logeshwaran Jaganathan S Dhanasekaran Praveen Kantha Amit Garg Department of ECE Sri Eshwar College of Engineering India Electronics and Communication Engineering Sri Eshwar College of Engineering Coimbatore India Computer Science & Engineering School of Engineering and Technology Chitkara University Himachal Pradesh India Department of Computer Science and Engineering Manipal University Jaipur India

ISBN: (纸本)9798400709418

This paper looks at interference-conscious spectrum allocation in 6G cell networks. A novel dynamic resource sharing a set of rules is proposed, aiming to efficaciously use available stay spectral resources and minimize the interference not unusual to more than one technique or customer. The proposed set of rules consists of sub-algorithms: the first segment is a channel selection set of rules, which selects the most excellent channels for each licensee based totally on their signal-to-interference-plus-noise ratio (SINR) and interference degrees. The second segment is an optimization set of rules, which promotes the most valuable spectrum to get admission to and aid allocation in line with the interference necessities of the specific consumer or network. The proposed rules embrace the access strategy mentioned in 3GPP spec 5G-NR, wherein a fast spectrum allocation and aid-sharing principles throughout multiple licensees are used to maximize spectrum usage. Outcomes suggest that the algorithm can attain powerful spectrum utilization even by imparting high levels of interference mitigation. The proposed gadget gives a promising technique to enhance 6G spectrum allocation. It is predicted to offer an attractive answer for operators searching to deploy a dynamic, interference-resistant communications carrier.

关键词： AI-based Optimization q-learning algorithms Resource Scheduling Spectrum Utilization

来源：评论

学校读者我要写书评

暂无评论

A deep recurrent q network towards self-adapting distributed microservice architecture

引用

SOFTWARE-PRACTICE & EXPERIENCE 2020年第2期50卷 116-135页

作者： Magableh, Basel Almiani, Muder Technol Univ Dublin Dublin Ireland Al Hussein Bin Talal Univ Maan Jordan

One desired aspect of microservice architecture is the ability to self-adapt its own architecture and behavior in response to changes in the operational environment. To achieve the desired high levels of self-adaptability, this research implements distributed microservice architecture model running a swarm cluster, as informed by the Monitor, Analyze, Plan, and Execute over a shared Knowledge (MAPE-K) model. The proposed architecture employs multiadaptation agents supported by a centralized controller, which can observe the environment and execute a suitable adaptation action. The adaptation planning is managed by a deep recurrent q-learning network (DRqN). It is argued that such integration between DRqN and Markov decision process (MDP) agents in a MAPE-K model offers distributed microservice architecture with self-adaptability and high levels of availability and scalability. Integrating DRqN into the adaptation process improves the effectiveness of the adaptation and reduces any adaptation risks, including resource overprovisioning and thrashing. The performance of DRqN is evaluated against deep q-learning and policy gradient algorithms, including (1) a deep q-learning network (DqN), (2) a dueling DqN (DDqN), (3) a policy gradient neural network, and (4) deep deterministic policy gradient. The DRqN implementation in this paper manages to outperform the aforementioned algorithms in terms of total reward, less adaptation time, lower error rates, plus faster convergence and training time. We strongly believe that DRqN is more suitable for driving the adaptation in distributed services-oriented architecture and offers better performance than other dynamic decision-making algorithms.

关键词： deep q-learning networks multiagent environment policy approximation q-learning algorithms recurrent q-learning networks reinforcement learning self-adaptive architectures service-oriented architecture

来源：评论

学校读者我要写书评

暂无评论

An analytical framework for modeling distributed JRRM decision in cognitive networks

An analytical framework for modeling distributed JRRM decisi...

引用

21st Annual IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)

作者： Saker, L. Ben Jemaa, S. Elayoubi, S. E. Orange Labs F-92794 Issy Les Moulineaux France

ISBN: (纸本)9781424480166

This paper deals with mobile-centered decision making in heterogeneous networks, where intelligent mobile terminals take autonomous decisions about the JRRM actions, consisting to connect to one of the available systems. This distributed decision-making is possible due to q-learning algorithms implemented within the mobile terminals that enable them to profit from their past experience in order to enhance their subsequent decisions. We develop an original Markovian model that allows analyzing analytically the evolution of the q-learning process and show how the the performance is enhanced until convergence.

关键词： Markov processes q-learning algorithms cognitive networks cognitive radio decision making distributed joint radio resource management decision heterogeneous networks intelligent mobile terminals mobile radio mobile-centered decision making original Markovian model

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：