检索结果-内蒙古大学图书馆

Advances of Pipeline Model Parallelism for Deep Learning Training:An Overview

Journal of computer science & technology 2024年第3期39卷 567-584页

作者：关磊李东升梁吉业王文剑葛可适卢锡城 College of Science National University of Defense TechnologyChangsha 410073China College of Computer National University of Defense TechnologyChangsha 410073China School of Computer and Information Technology Shanxi UniversityTaiyuan 030006China CCF IEEE

Deep learning has become the cornerstone of artificial intelligence,playing an increasingly important role in human production and ***,as the complexity of problem-solving increases,deep learning models become increasingly intricate,resulting in a proliferation of large language models with an astonishing number of *** model parallelism(PMP)has emerged as one of the mainstream approaches to addressing the significant challenge of training“big models”.This paper presents a comprehensive review of *** covers the basic concepts and main challenges of *** also comprehensively compares synchronous and asynchronous pipeline schedules for PMP approaches,and discusses the main techniques to achieve load balance for both intra-node and inter-node ***,the main techniques to optimize computation,storage,and communication are presented,with potential research directions being discussed.

关键词： deep learning pipeline schedule load balance multi-GPU system pipeline model parallelism(PMP)

来源：评论

学校读者我要写书评

暂无评论

Rotation-invariant face detection with guided deformable attention

引用

International Journal of Information and Communication technology 2024年第8期25卷 31-48页

作者： Deng, Bin Deng, Guanghui College of Computer Science Hunan University of Technology Hunan Zhuzhou412007 China College of Science Hunan University of Technology Hunan Zhuzhou412007 China

Detecting rotated faces has always been a challenging task. Fixed convolutional kernels struggle to effectively match features after rotation, while the sampling point offsets of deformable convolutions are limited by complex backgrounds. To address this issue, we propose a guided deformable attention (GDA) network. Guiding the offset direction of sampling points by adding constraints of facial structure to deformable convolutions. The GDA network adopts a dual-stream structure, with one branch detecting the inherent structural information for preliminary positioning of the face area;then, the second branch uses deformable convolution to perform pixel-level feature extraction on the face within the range. In addition, we introduce a novel loss, which, during the guidance process, aligns the activation areas in the feature maps extracted by the two branches through the KL divergence. Extensive experimental results validate that GDA network performs excellently on multiple face detection datasets, surpassing the current state-of-the-art face detection methods. © The Author(s) 2024. Published by Inderscience Publishers Ltd.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

LSTM-Based Model Compression for CAN Security in Intelligent Vehicles

IEEE Transactions on Artificial Intelligence

引用

IEEE Transactions on Artificial Intelligence 2024年第12期5卷 6457-6471页

作者： Feng, Yuan Lai, Yingxu Chen, Ye Zhang, Zhaoyi Wei, Jingwen Beijing University of Technology College of Computer Science Beijing100124 China

The rapid deployment and low-cost inference of controller area network (CAN) bus anomaly detection models on intelligent vehicles can drive the development of the Green Internet of Vehicles. Anomaly detection on intelligent vehicles often utilizes recurrent neural network models, but computational resources for these models are limited on small platforms. Model compression is essential to ensure CAN bus security with restricted computing resources while improving model computation efficiency. However, the existence of shared cyclic units significantly constrains the compression of recurrent neural networks. In this study, we propose a structured pruning method for long short-term memory (LSTM) based on the contribution values of shared vectors. By analyzing the contribution value of each dimension of shared vectors, the weight matrix of the model is structurally pruned, and the output value of the LSTM layer is supplemented to maintain the information integrity between adjacent network layers. We further propose an approximate matrix multiplication calculation module that runs in the whole process of model calculation and is deployed in parallel with the pruning module. Evaluated on a realistic public CAN bus dataset, our method effectively achieves highly structured pruning, improves model computing efficiency, and maintains performance stability compared to other compression methods. © 2024 IEEE.

关键词： Computational efficiency

来源：评论

学校读者我要写书评

暂无评论

Hierarchical Retrieval of High-Resolution Fingerprints Based on Pore Feature

Hierarchical Retrieval of High-Resolution Fingerprints Based...

引用

2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024

作者： Ma, Jing Xu, Yuanrong Dong, Suyu Wang, Wei Harbin Institute of Technology School of Computer Science and Technology Shenzhen China Harbin Institute of Technology School of Computer Science and Technology Weihai China Northeast Forestry University College of Computer and Control Engineering Harbin China

ISBN: (纸本)9798350386226

Faced with an escalating number of fingerprint images, most existing retrieval approachs suffer from a common problem: diminishing computational efficiency. This paper presents a hierarchical retrieval system tailored for high-resolution fingerprint images that utilizes abundant pore features and robust recognizability to improve retrieval performance. The framework comprises two core components. Firstly, a CNN-based feature extraction network is established, incorporating an attention mechanism to capture pore features in fingerprint images comprehensively. Subsequently, a hierarchical fingerprint retrieval approach is introduced, involving connection graph construction and a hierarchy of jump table structures for efficient retrieval of query pores. Empirical experiments conducted on high-resolution fingerprint image datasets underscore the system's effectiveness. Compared with other advanced pore-based fingerprint retrieval methods, the proposed method exhibits a notable rise in the hit rate with reduced penetration rates, significantly reducing the retrieval time. © 2024 IEEE.

关键词： Image enhancement

来源：评论

学校读者我要写书评

暂无评论

Detecting Adversarial Fake Tasks in Mobile Crowd sensing Platforms 4

Detecting Adversarial Fake Tasks in Mobile Crowd sensing Pla...

引用

4th International Conference on Communication technology and Information technology, ICCTIT 2024

作者： Zhang, Zhonghuan Harbin University of Science and Technology School of Computer Science and Technology Heilongjiang Harbin150080 China

ISBN: (数字)9798331528973

ISBN: (纸本)9798331528973

The presence of fake tasks in mobile crowd sensing severely affects the normal operation of the platform. Due to the rapid development of deep learning, 'Adversarial Fake Tasks' now have a greater destructive impact than traditional fake tasks. Traditional fake tasks are generated based on experience, while adversarial fake tasks are created using deep learning techniques to mimic real tasks, making them difficult to be detected. This paper proposes a 'Spatial attention mechanism guided by distance' based on the multi-head self-attention mechanism to counter the threat of adversarial fake tasks in mobile crowdsensing platforms. This module captures the distance correlation between features of generated fake tasks and designs a detection model called Self Distance-Attention Convolution Crowdsensing Network (SDACC). Using SDACC as a detector, the detection steps based on the adversarial training defense strategy are as follows: firstly, utilize the generation model-SeqGAN to learn the distribution of real tasks and generate adversarial fake tasks;then train SDACC to detect adversarial fake tasks. Experiments on the MCS task datasets, the KDD Cup99 dataset, and the CICIDS-2017 dataset demonstrate that SDACC outperforms traditional detection algorithms in the MCS fake task detection domain, with average ACC, F1, and AUC reaching 0.9357,0.9332, and 0.9786 respectively. © 2024 IEEE.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

Dynamic Cloth Folding Using Curriculum Learning

引用

Journal of Shanghai Jiaotong university (science) 2024年 1-10页

作者： Li, Mingyang Bao, Hujun Huang, Jin College of Computer Science and Technology Zhejiang University Hangzhou310058 China

This paper presents a novel algorithm for training robotic arms to manipulate cloth, by leveraging reinforcement learning and curriculum learning approaches. Traditional cloth manipulation algorithms rely heavily on predefined action primitives and assumptions about cloth dynamics, introducing significant prior knowledge. To circumvent this limitation, we utilize reinforcement learning to train our cloth folding agent. To fully utilize the advantage of reinforcement learning, we propose a semi-sparse reward function incorporating folding accuracy and a curriculum scheme to accelerate training and improve policy stability. We validate the proposed method by implementing it in the StableBaselines3 framework and training the agent using the soft actor critic algorithm in our virtual environment based on physical-based cloth simulator. Our results demonstrate the benefits of the curriculum learning scheme which increases sample efficiency and accelerates training process compared with previous reinforcement learning cloth manipulation method. © Shanghai Jiao Tong university 2024.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

GTE: learning code AST representation efficiently and effectively

引用

science China(Information sciences) 2025年第3期68卷 393-394页

作者： Yihao QIN Shangwen WANG Bo LIN Kang YANG Xiaoguang MAO College of Computer Science and Technology National University of Defense Technology Key Laboratory of Software Engineering for Complex Systems National University of Defense Technology

With the development of deep learning in recent years, code representation learning techniques have become the foundation of many software engineering tasks such as program classification [1] and defect detection. Earlier approaches treat the code as token sequences and use CNN, RNN, and the Transformer models to learn code representations.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Hybrid compression for LSTM-based encrypted traffic classification model

引用

International Journal of Wireless and Mobile Computing 2024年第1期26卷 61-73页

作者： Mu, Qiaoxu Zhang, Meng College of Computer Science and Technology Jilin University Jilin Changchun China

Traditional techniques for network traffic classification are no longer effective in handling the complexities of dynamic network environments. Moreover, deep learning methods, while powerful, demand substantial spatial and computational resources, resulting in increased latency and instability. In this paper, we propose an innovative approach to network traffic classification utilising an LSTM structure. This approach incorporates network pruning, knowledge refinement, and Generative Adversarial Networks (GAN) to reduce model size, accelerate training speed without compromising accuracy, and address challenges associated with unbalanced datasets in classification problems. Our methodology involves the pruning of unimportant filters from the teacher model, followed by retraining and knowledge distillation to generate the student model. Experimental show that the size of the pruned teacher model is only 25.69% of the original, resulting in a noteworthy 28.16% improvement in training speed. Additionally, the classification performance of various unbalanced traffic categories, such as VoIP and streaming, shows significant enhancement. © 2024 Inderscience Publishers. All rights reserved.

关键词： Distillation

来源：评论

学校读者我要写书评

暂无评论

SpikingMiniLM: energy-efficient spiking transformer for natural language understanding

引用

science China(Information sciences) 2024年第10期67卷 115-128页

作者： Jiayu ZHANG Jiangrong SHEN Zeke WANG Qinghai GUO Rui YAN Gang PAN Huajin TANG College of Computer Science and Technology Zhejiang University The State Key Lab of Brain-Machine Intelligence Zhejiang University Collaborative Innovation Center of Artificial Intelligence Zhejiang University Advanced Computing and Storage Laboratory Huawei Technologies Co. Ltd. College of Computer Science and Technology Zhejiang University of Technology MOE Frontier Science Center for Brain Science and Brain-Machine Integration Zhejiang University

In the era of large-scale pretrained models, artificial neural networks（ANNs） have excelled in natural language understanding（NLU） tasks. However, their success often necessitates substantial computational resources and energy consumption. To address this, we explore the potential of spiking neural networks（SNNs） in NLU——a promising avenue with demonstrated advantages, including reduced power consumption and improved efficiency due to their event-driven characteristics. We propose the SpikingMiniLM,a novel spiking Transformer model tailored for natural language understanding. We first introduce a multi-step encoding method to convert text embeddings into spike trains. Subsequently, we redesign the attention mechanism and residual connections to make our model operate on the pure spike-based paradigm without any normalization technique. To facilitate stable and fast convergence, we propose a general parameter initialization method grounded in the stable firing rate principle. Furthermore, we apply an ANN-to-SNN knowledge distillation to overcome the challenges of pretraining SNNs. Our approach achieves a macro-average score of 75.5 on the dev sets of the GLUE benchmark, retaining 98% of the performance exhibited by the teacher model MiniLMv2. Our smaller model also achieves similar performance to BERTMINIwith fewer parameters and much lower energy consumption, underscoring its competitiveness and resource efficiency in NLU tasks.

关键词： spiking neural networks natural language understanding spiking Transformer spike-based attention multi-step encoding ANN-to-SNN distillation

来源：评论

学校读者我要写书评

暂无评论

Event-based nonsingular fixed-time containment control for nonlinear multiagent systems with dynamic uncertainties

引用

science China(Information sciences) 2025年第5期68卷 413-414页

作者： Yuanbo SU Qihe SHAN Tieshan LI C.L.Philip CHEN Navigation College Dalian Maritime University School of Automation Engineering University of Electronic Science and Technology of China School of Computer Science and Engineering South China University of Technology

Owing to the extensive applications in many areas such as networked systems,formation flying of unmanned air vehicles,and coordinated manipulation of multiple robots,the distributed containment control for nonlinear multiagent systems (MASs) has received considerable attention,for example [1,2].Although the valued studies in [1,2] investigate containment control problems for MASs subject to nonlinearities,the proposed distributed nonlinear protocols only achieve the asymptotic *** a crucial performance indicator for distributed containment control of MASs,the fast convergence is conducive to achieving better control accuracy [3].The work in [4] first addresses the backstepping-based adaptive fuzzy fixed-time containment tracking problem for nonlinear high-order MASs with unknown external ***,the designed fixedtime control protocol [4] cannot escape the singularity problem in the backstepping-based adaptive control *** is well known,the singularity problem has become an inherent problem in the adaptive fixed-time control design,which may cause the unbounded control inputs and even the instability of controlled ***,how to solve the nonsingular fixed-time containment control problem for nonlinear MASs is still open and awaits breakthrough to the best of our knowledge.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：