检索结果-内蒙古大学图书馆

Power Quality Data Compression and Disturbances Recognition Based on Deep CS-BiLSTM algorithm With Cloud-Edge Collaboration

引用

FRONTIERS IN ENERGY RESEARCH 2022年 10卷

作者： Xia, Xin He, Chuanliang Lv, Yingjie Zhang, Bo Wang, ShouZhi Chen, Chen Chen, Haipeng Beijing Smart Chip Microelect Technol Co Ltd State Grid Lab Power Line Commun Applicat Technol Beijing Peoples R China Beijing Elect Power Sci & Smart Chip Technol Co L Beijing Peoples R China Jilin Univ Coll Instrumentat & Elect Engn Changchun Peoples R China Northeast Elect Power Univ Dept Elect Engn Jilin Jilin Peoples R China

The current disturbance classification of power quality data often has the problem of low disturbance recognition accuracy due to its large volume and difficult feature extraction. This paper proposes a hybrid model based on distributed compressive sensing and a bidirectional long-short memory network to classify power quality disturbances. A cloud-edge collaborative framework is first established with distributed compressed sensing as an edge-computing algorithm. With the uploading of dictionary atoms of compressed sensing, the data transmission and feature extraction of power quality is achieved to compress power quality measurements. In terms of data transmission and feature extraction, the dictionary atoms and measurements uploaded at the edge are analyzed in the cloud by building a cloud-edge collaborative framework with distributed compressed sensing as the edge algorithm so as to achieve compressed storage of power quality data. For power disturbance identification, a new network structure is designed to improve the classification accuracy and reduce the training time, and the training parameters are optimized using the Deep Deterministic Policy Gradient algorithm in reinforcement learning to analyze the noise immunity of the model under different scenarios. Finally, the simulation analysis of 10 common power quality disturbance signals and 13 complex composite disturbance signals with random noise shows that the proposed method solves the problem of inadequate feature selection by traditional classification algorithms, improves the robustness of the model, and reduces the training time to a certain extent.

关键词： distributed compressed sensing power quality disturbance classification bidirectional long-short memory network edge algorithm and cloud edge collaboration parameter optimization ddpg algorithm

来源：评论

学校读者我要写书评

暂无评论

A Reinforcement Learning Approach for Continuum Robot Control

引用

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS 2023年第4期109卷 77-77页

作者： Kargin, Turhan Can Kolota, Jakub Poznan Univ Tech Inst Automat Control & Robot Piotrowo 3A PL-60965 Poznan Poland

Rigid joint manipulators are limited in their movement and degrees of freedom (DOF), while continuum robots possess a continuous backbone that allows for free movement and multiple DOF. Continuum robots move by bending over a section, taking inspiration from biological manipulators such as tentacles and trunks. This paper presents an implementation of a forward kinematics and velocity kinematics model to describe the planar continuum robot, along with the application of reinforcement learning (RL) as a control algorithm. In this paper, we have adopted the planar constant curvature representation for the forward kinematic modeling. This choice was made due to its straightforward implementation and its potential to fill the literature gap in the field RL-based control for planar continuum robots. The intended control mechanism is achieved through the use of Deep Deterministic Policy Gradient (ddpg), a RL algorithm that is suited for learning controls in continuous action spaces. After simulating the algorithm, it was observed that the planar continuum robot can autonomously move from any initial point to any desired goal point within the task space of the robot. By analyzing the results, we wanted to recommend a future direction for research in the field of continuum robot control, specifically in the application of RL algorithms. One potential area of focus could be the integration of sensory feedback, such as vision or force sensing, to improve the robot's ability to navigate complex environments. Additionally, exploring the use of different RL algorithms, such as Proximal Policy Optimization (PPO) or Trust Region Policy Optimization (TRPO), could lead to further advancements in the field. Overall, this paper demonstrates the potential for RL-based control of continuum robots and highlights the importance of continued research in this area.

关键词： Reinforcement Learning ddpg algorithm Continuum robot

来源：评论

学校读者我要写书评

暂无评论

Financial Collaboration Technology Based on Deep Deterministic Strategy Gradient algorithm

引用

Procedia Computer Science 2024年 247卷 263-270页

作者： Nannan Sun Industrial College of Finance and Economics Shandong Institute of Commerce and Technology Jinan 250103 Shandong China

With the rapid development of modern financial technology, the use of machine learning methods to assist enterprises in financial decision-making has become an important trend. This article investigates financial decision-making problems based on the Deep Deterministic Policy Gradient (ddpg). By building a simulation environment and organically integrating the algorithm with the real financial decision-making process, the effectiveness and accuracy of the ddpg algorithm in financial decision-making were verified. The value of the investment portfolio steadily increases with the increase of time step. From the initial $10000 to the final $10985, this indicates that the algorithm is highly effective in optimizing asset portfolios. This development trend demonstrates that the ddpg algorithm can efficiently manage assets in a stable market, thereby achieving the goal of preserving and increasing value. The research results of this article can expand the application of ddpg method in unconventional data processing, and provide new ideas and methods for the research of financial technology and other related issues, which has important theoretical significance and practical value.

关键词： ddpg algorithm Financial Collaboration Technology Asset Management Portfolio Value

来源：评论

学校读者我要写书评

暂无评论

Dynamic Resource Allocation of Reinforcement Learning Based on Neural Networks in Software Defined Networks

引用

Procedia Computer Science 2024年 243卷 834-841页

作者： Xinjiu Xie Guangzhou College of Commerce Guangzhou 511363 China

With the rapid development of Software Defined Networking (SDN) technology, how to efficiently and flexibly manage and allocate network resources has become a key challenge. This article proposes the ddpg (Deep Deterministic Policy Gradient) algorithm method, aiming to dynamically optimize resource allocation in SDN. The ddpg algorithm can respond in real-time to changes in network status, automatically adjust resource allocation strategies, and thereby improve network performance and service quality. This study comprehensively evaluated the dynamic resource allocation ability of neural network-based ddpg reinforcement learning algorithm in SDN through four experiments. In the baseline comparison experiment, the network throughput of ddpg reached 95 Mbps. Under different network loads, ddpg still maintains a throughput of 95 Mbps under high load conditions. In the fault recovery capability testing experiment, the recovery time of ddpg is 30 seconds. In the final real-time adjustment capability evaluation, ddpg demonstrated a fast response time of 1.2 seconds, as well as a throughput of up to 80 Mbps and a resource utilization rate of 95% after adjustment. From the experimental data conclusions, it can be seen that the ddpg algorithm provides superior performance and flexible resource management capabilities in SDN environments.

关键词： Software Defined Networks Dynamic Resource Allocation Reinforcement Learning ddpg algorithm

来源：评论

学校读者我要写书评

暂无评论

Design of a Path-Following Controller for Autonomous Vehicles Using an Optimized Deep Deterministic Policy Gradient Method

引用

INTERNATIONAL JOURNAL OF AUTOMOTIVE AND MECHANICAL ENGINEERING 2024年第3期21卷 11682-11694页

作者： Rizehvandi, Ali Azadi, Shahram KN Toosi Univ Technol Fac Mech Engn Tehran Iran

The need for a safe and reliable transportation system has made the advancement of autonomous vehicles (Avs) increasingly significant. To achieve Level 5 autonomy, as defined by the Society of Automotive Engineers, AVs must be capable of navigating complex and unconventional traffic environments. Path-following is a crucial task in autonomous driving, requiring precise and safe navigation along a defined path. Traditional path-tracking methods often rely on parameter tuning or rule-based approaches, which may not be suitable for dynamic and complex environments. Reinforcement learning has emerged as a powerful technique for developing effective control strategies through agent-environment interactions. This study investigates the efficiency of an optimized Deep Deterministic Policy Gradient (ddpg) method for controlling acceleration and steering in the path-following of autonomous vehicles. The algorithm demonstrates rapid convergence, enabling stable and efficient path tracking. Additionally, the trained agent achieves smooth control without extreme actions. The performance of the optimized ddpg is compared with the standard ddpg algorithm, with results confirming the improved efficiency of the optimized approach. This advancement could significantly contribute to the development of autonomous driving technology.

关键词： Autonomous vehicles DRL method ddpg algorithm Path-following

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：