检索结果-内蒙古大学图书馆

SWG: an architecture for sparse weight gradient computation

science China(Information sciences) 2024年第2期67卷 302-321页

作者： Weiwei WU Fengbin TU Xiangyu LI Shaojun WEI Shouyi YIN School of Integrated Circuits Tsinghua University Department of Electronic and Computer Engineering The Hong Kong University of Science and Technology

On-device training for deep neural networks(DNN) has become a trend due to various user preferences and scenarios. The DNN training process consists of three phases, feedforward(FF), backpropagation(BP), and weight gradient(WG) update. WG takes about one-third of the computation in the whole training process. Current training accelerators usually ignore the special computation property of WG and process it in a way similar to FF/BP. Besides, the extensive data sparsity existing in WG, which brings opportunities to save computation, is not well explored. Nevertheless, exploiting the optimization opportunities would meet three underutilization problems, which are caused by(1) the mismatch between WG data dimensions and hardware parallelism,(2) the full sparsity, i.e., the sparsity of feature map(Fmap),error map(Emap), and gradient, and(3) the workload imbalance resulting from irregular sparsity. In this paper, we propose a specific architecture for sparse weight gradient(SWG) computation. The architecture is designed based on hierarchical unrolling and sparsity-aware(HUSA) dataflow to exploit the optimization opportunities of the special computation property and full data sparsity. In HUSA dataflow, the data dimensions are unrolled hierarchically on the hardware architecture. A valid-data trace(VDT) mechanism is embedded in the dataflow to avoid the underutilization caused by the two-sided input sparsity. The gradient is unrolled in PE to alleviate the underutilization induced by output sparsity while maintaining the data reuse opportunities. Besides, we design an intra-and inter-column balancer(IIBLC) to dynamically tackle the workload imbalance problem resulting from the irregular sparsity. Experimental results show that with HUSA dataflow exploiting the full sparsity, SWG achieves a speedup of 12.23× over state-of-the-art gradient computation architecture, Train Ware. SWG helps to improve the energy efficiency of the state-of-the-art training accelerator LNPU from

关键词： CNN training gradient computation sparsity architecture

来源：评论

学校读者我要写书评

暂无评论

Adversarial Attack on Object Detection via Object Feature-Wise Attention and Perturbation Extraction

引用

清华大学学报自然科学版（英文版） 2025年第3期30卷 1174-1189页

作者： Wei Xue Xiaoyan Xia Pengcheng Wan Ping Zhong Xiao Zheng School of Computer Science and Technology Anhui University of TechnologyMaanshan 243032China National Key Laboratory of Science and Technology on Automatic Target Recognition National University of Defense TechnologyChangsha 410073China

Deep neural networks are commonly used in computer vision tasks,but they are vulnerable to adversarial samples,resulting in poor recognition *** traditional algorithms that craft adversarial samples have been effective in attacking classification models,the attacking performance degrades when facing object detection models with more complex *** address this issue better,in this paper we first analyze the mechanism of multi-scale feature extraction of object detection models,and then by constructing the object feature-wise attention module and the perturbation extraction module,a novel adversarial sample generation algorithm for attacking detection models is ***,in the first module,based on the multi-scale feature map,we reduce the range of perturbation and improve the stealthiness of adversarial samples by computing the noise distribution in the object *** in the second module,we feed the noise distribution into the generative adversarial networks to generate adversarial perturbation with strong attack *** doing so,the proposed approach possesses the ability to better confuse the judgment of detection *** carried out on the DroneVehicle dataset show that our method is computationally efficient and works well in attacking detection models measured by qualitative analysis and quantitative analysis.

关键词： adversarial attack transfer attack object detection generative adversarial networks multi-scale feature map

来源：评论

学校读者我要写书评

暂无评论

Adaptive estimation and control for uncertain nonlinear systems and full actuation control

引用

science China(Information sciences) 2023年第11期66卷 169-184页

作者： Fei YAN Mingyuan ZHANG Guoxiang GU School of Information Science and Technology Southwest Jiaotong University School of Electrical Engineering and Computer Science Louisiana State University

We study adaptive control for a family of nonlinear systems, involving unknown and uncertain parameters. The proposed control law estimates the system parameters adaptively and stabilizes the closedloop system asymptotically for the initial state over any given bounded set of the state-space. Moreover,reconstruction filters are designed to obtain error residue signals and to enable the use of the least-squares algorithm for estimating the parameters, in order to achieve the convergence based on the persistent excitation condition and asymptotic linearization. The proposed methods are applicable to full actuation and under actuation control systems. Simulation studies are carried out for a pendulum system and for a third-order vehicle model, as well as control of vehicle platoons, validating the theoretical results presented in this paper.

关键词： adaptive control asymptotic stabilization full actuation and underactuation least-squares nonlinear systems

来源：评论

学校读者我要写书评

暂无评论

Robust federated contrastive recommender system against targeted model poisoning attack

引用

science China(Information sciences) 2025年第4期68卷 50-65页

作者： Wei YUAN Chaoqun YANG Liang QU Guanhua YE Quoc Viet Hung NGUYEN Hongzhi YIN School of Electrical Engineering and Computer Science The University of Queensland School of Information and Communication Technology Griffith University Deep Neural Computing Company Limited

Federated recommender systems(FedRecs) have garnered increasing attention recently, thanks to their privacypreserving benefits. However, the decentralized and open characteristics of current FedRecs present at least two ***, the performance of FedRecs is compromised due to highly sparse on-device data for each client. Second, the system's robustness is undermined by the vulnerability to model poisoning attacks launched by malicious users. In this paper, we introduce a novel contrastive learning framework designed to fully leverage the client's sparse data through embedding augmentation, referred to as CL4FedRec. Unlike previous contrastive learning approaches in FedRecs that necessitate clients to share their private parameters, our CL4FedRec aligns with the basic FedRec learning protocol, ensuring compatibility with most existing FedRec implementations. We then evaluate the robustness of FedRecs equipped with CL4FedRec by subjecting it to several state-of-the-art model poisoning attacks. Surprisingly, our observations reveal that contrastive learning tends to exacerbate the vulnerability of FedRecs to these attacks. This is attributed to the enhanced embedding uniformity, making the polluted target item embedding easily proximate to popular items. Based on this insight, we propose an enhanced and robust version of CL4FedRec(rCL4FedRec) by introducing a regularizer to maintain the distance among item embeddings with different popularity levels. Extensive experiments conducted on four commonly used recommendation datasets demonstrate that rCL4FedRec significantly enhances both the model's performance and the robustness of FedRecs.

关键词： federated recommender system contrastive learning model poisoning attack and defense

来源：评论

学校读者我要写书评

暂无评论

SmartEagleEye:A Cloud-Oriented Webshell Detection System Based on Dynamic Gray-Box and Deep Learning

引用

Tsinghua science and technology 2024年第3期29卷 766-783页

作者： Xin Liu Yingli Zhang Qingchen Yu Jiajun Min Jun Shen Rui Zhou Qingguo Zhou School of Information Science and Engineering Lanzhou UniversityLanzhou 730000China College of Computer Science and Technology Zhejiang UniversityHangzhou 310058China School of Computing and Information Technology University of WollongongWollongong 2500Australia

Compared with traditional environments,the cloud environment exposes online services to additional vulnerabilities and threats of cyber attacks,and the cyber security of cloud platforms is becoming increasingly prominent.A piece of code,known as a Webshell,is usually uploaded to the target servers to achieve multiple *** Webshell attacks has become a hot spot in current ***,the traditional Webshell detectors are not built for the cloud,making it highly difficult to play a defensive role in the cloud ***,a Webshell detection system based on deep learning that is successfully applied in various scenarios,is proposed in this *** system contains two important components:gray-box and neural network *** gray-box analyzer defines a series of rules and algorithms for extracting static and dynamic behaviors from the code to make the decision *** neural network analyzer transforms suspicious code into Operation Code(OPCODE)sequences,turning the detection task into a classification *** experiment results show that SmartEagleEye achieves an encouraging high detection rate and an acceptable false-positive rate,which indicate its capability to provide good protection for the cloud environment.

关键词： Webshell detection cloud web security deep learning

来源：评论

学校读者我要写书评

暂无评论

Underwater Biological Target Detection Algorithm and Research Based on YOLOv7 Algorithm

IAENG International Journal of Computer Science

引用

IAENG International Journal of computer science 2024年第6期51卷 594-601页

作者： Zhuang, Hongwei Liu, Weisheng School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan China College of Computer Science and Software Engineering University of Science and Technology Liaoning CO Anshan114051 China

Underwater target detection is an important method for detecting marine organisms. However, due to the image occlusion of underwater targets, blurred water quality, poor lighting conditions, small targets, and complex backgrounds, the detection of underwater biological targets has posed significant challenges. In the intricate underwater environment, the conventional feature extraction method has a few drawbacks, including imprecise feature extraction, sluggish detection speed, and inadequate robustness. Consequently, an underwater target detection method based on the enhanced You Only Look Once 7 (YOLOv7) is proposed in this study. The network architecture is reconstructed, and the Deformable Convolutional Network (DCN) modules replace some 3×3 convolutional blocks in the ELAN structure to offset sampling points and reduce background interference. Skip connections and 1× 1 convolutional architecture are added to the DCN module to improve the model’s perception of image details. In addition, Contextual Transformer 3 (COT3) is also incorporated to improve visual performance. Finally, to improve the detection efficiency of small objects, the CIoU loss function is finally replaced by the Normalized Wasserstein Distance (NWD) algorithm. The mAP of DCCN-YOLOv7 on the URPC dataset is 80.4%, according to the experimental results, 2.8% higher than the YOLOv7 network model that is used as a baseline. Furthermore, in contrast to the original YOLOv7 algorithm, the detection speed and accuracy are higher, making it more appropriate for target recognition underwater. © (2024), (International Association of Engineers). All rights reserved.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning

引用

IEEE/CAA Journal of Automatica Sinica 2024年第11期11卷 2346-2348页

作者： Yi Liu Xiang Wu Yuming Bo Jiacun Wang Lifeng Ma the School of Automation Nanjing University of Science and Technology the Department of Computer Science and Software Engineering Monmouth University

Dear Editor,This letter presents a new transfer learning framework for the deep multi-agent reinforcement learning(DMARL) to reduce the convergence difficulty and training time when applying DMARL to a new scenario [1... 详细信息

关键词： Deep agent Framework

来源：评论

学校读者我要写书评

暂无评论

Comprehensive analysis of various imputation and forecasting models for predicting PM2.5 pollutant in Delhi

引用

Neural Computing and Applications 2025年第17期37卷 11441-11458页

作者： Karnati, Hemanth Soma, Anuraag Alam, Adnan Kalaavathi, B. School of Computer Science and Engineering Vellore Institute of Technology Tamil Nadu Vellore632014 India Department of IoT School of Computer Science and Engineering Vellore Institute of Technology Tamil Nadu Vellore632014 India

Understanding and predicting air quality is pivotal for public health and environmental management, especially in urban areas like Delhi. This study utilizes a comprehensive dataset from the Central Pollution Control Board, detailing air pollutant concentrations at the ITO (Income Tax Office), Delhi station, from 2017 to 2023. Our focus on pollutants such as PM2.5, PM10, NO2, NH3, SO2, ozone, and CO highlights the challenges posed by missing data in environmental studies. Through the application of several imputation models, including KNN, linear regression, forward fill + backward fill, Fourier + KNN, linear interpolation, and statistical methods, we identified combination of forward fill and backward fill as the most effective method for addressing data gaps, specifically for ozone measurements. Building on this imputed dataset, we employ various forecasting models to predict PM2.5 levels, a critical pollutant. Our exploration includes time series and deep learning approaches, such as LSTM, Bi-LSTM, CNN-LSTM, GRU, Deep AR, WaveNet, TCN, ARIMA, SARIMA, and Prophet. The performance of these models is evaluated using daily data, with Bi-LSTM and LSTM with attention emerging as the top performers due to their accuracy in predicting PM2.5 concentrations. The implications of our findings extend beyond academic interest, offering practical insights for environmental policy and health advisories in Delhi. By demonstrating the efficacy of specific imputation and forecasting techniques, our research contributes to the broader effort of improving air quality monitoring and prediction. The success of Bi-LSTM and LSTM with attention models, in particular, suggests a promising avenue for future studies aiming to enhance the precision of environmental forecasts. © The Author(s) 2025.

关键词： Time series

来源：评论

学校读者我要写书评

暂无评论

Convolutional Neural Network Image Classification Based on Different Color Spaces

引用

Tsinghua science and technology 2025年第1期30卷 402-417页

作者： Zixiang Xian Rubing Huang Dave Towey Chuan Yue School of Computer Science and Engineering Macao University of Science and TechnologyMacao 999078China School of Computer Science University of Nottingham Ningbo ChinaNingbo 315100China

Although Convolutional Neural Networks(CNNs)have achieved remarkable success in image classification,most CNNs use image datasets in the Red-Green-Blue(RGB)color space(one of the most commonly used color spaces).The existing literature regarding the influence of color space use on the performance of CNNs is *** paper explores the impact of different color spaces on image classification using *** compare the performance of five CNN models with different convolution operations and numbers of layers on four image datasets,each converted to nine color *** find that color space selection can significantly affect classification accuracy,and that some classes are more sensitive to color space changes than *** color spaces may have different expression abilities for different image features,such as brightness,saturation,hue,*** leverage the complementary information from different color spaces,we propose a pseudo-Siamese network that fuses two color spaces without modifying the network *** experiments show that our proposed model can outperform the single-color-space models on most *** also find that our method is simple,flexible,and compatible with any CNN and image dataset.

关键词： color space Convolutional Neural Network(CNN) image classification pseudo-Siamese network

来源：评论

学校读者我要写书评

暂无评论

DNACDS:Cloud IoE big data security and accessing scheme based on DNA cryptography

引用

Frontiers of computer science 2024年第1期18卷 157-170页

作者： Ashish SINGH Abhinav KUMAR Suyel NAMASUDRA School of Computer Engineering KIIT Deemed to be UniversityBhubaneshwar 751024India Department of Computer Science and Engineering Indian Institute of Information Technology SuratSurat 394190India Department of Computer Science and Engineering National Institute of Technology AgartalaAgartala 799046India

The Internet of Everything(IoE)based cloud computing is one of the most prominent areas in the digital big data *** approach allows efficient infrastructure to store and access big real-time data and smart IoE services from the *** IoE-based cloud computing services are located at remote locations without the control of the data *** data owners mostly depend on the untrusted Cloud Service Provider(CSP)and do not know the implemented security *** lack of knowledge about security capabilities and control over data raises several security *** Acid(DNA)computing is a biological concept that can improve the security of IoE big *** IoE big data security scheme consists of the Station-to-Station Key Agreement Protocol(StS KAP)and Feistel cipher *** paper proposed a DNA-based cryptographic scheme and access control model(DNACDS)to solve IoE big data security and access *** experimental results illustrated that DNACDS performs better than other DNA-based security *** theoretical security analysis of the DNACDS shows better resistance capabilities.

关键词： IoE based cloud computing DNA cryptography IoE big data security StS KAP feistel cipher IoE big data access

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：