检索结果-内蒙古大学图书馆

Rotation-invariant face detection with guided deformable attention

International Journal of Information and Communication technology 2024年第8期25卷 31-48页

作者： Deng, Bin Deng, Guanghui College of Computer Science Hunan University of Technology Hunan Zhuzhou412007 China College of Science Hunan University of Technology Hunan Zhuzhou412007 China

Detecting rotated faces has always been a challenging task. Fixed convolutional kernels struggle to effectively match features after rotation, while the sampling point offsets of deformable convolutions are limited by complex backgrounds. To address this issue, we propose a guided deformable attention (GDA) network. Guiding the offset direction of sampling points by adding constraints of facial structure to deformable convolutions. The GDA network adopts a dual-stream structure, with one branch detecting the inherent structural information for preliminary positioning of the face area;then, the second branch uses deformable convolution to perform pixel-level feature extraction on the face within the range. In addition, we introduce a novel loss, which, during the guidance process, aligns the activation areas in the feature maps extracted by the two branches through the KL divergence. Extensive experimental results validate that GDA network performs excellently on multiple face detection datasets, surpassing the current state-of-the-art face detection methods. © The Author(s) 2024. Published by Inderscience Publishers Ltd.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

Trade-off Performance and Energy Efficiency by Optimizing the Data Flow for PIM Architectures

引用

IEEE Transactions on computer-Aided Design of Integrated Circuits and Systems 2024年第7期44卷 2530-2543页

作者： Zhao, Yunping Ma, Sheng Tang, Yuhua Liu, Hengzhu Li, Dongsheng The College of Computer Science and Technology National University of Defense Technology Hunan China

The Processing-In-Memory (PIM) architecture becomes a promising candidate for deep learning accelerators by integrating computation and memory. Most PIM-based studies improve the performance and energy efficiency by using the Weight Stationary (WS) data flow due to its high parallelism. However, the WS data flow has some fundamental limitations. First, the WS data flow has huge activation movements between on-chip memory and off-chip memory due to the limited memory space of the ReRAM array. Second, the WS data flow needs to read the input activation repeatedly according to the convolution window. These data movements decrease the energy efficiency and performance of the PIM architecture. To address these issues, the IS data flow stores activations instead of weights to reduce data movements. But the IS data flow faces some challenges. First, the data dependency between adjacent layers limits the performance. Second, there are huge across-array computations due to the special mapping method. Third, the previous IS data flow cannot realize the high parallelism. Fourth, the IS data flow depends on the three-dimensional (3D) ReRAM structure. To address these issues, we propose a novel data flow for PIM architectures. We optimize the IS data flow to decrease the activation movement and propose a parallel computing method to realize high parallelism and reduce the across-array computations. We identify and analyze the fundamental limitations and impact of different inter-layer data flows, including the WS-WS, IS-IS, WS-IS, and IS-WS. We also propose a method to build a hybrid data flow by combining these inter-layer data flows to trade-off performance and energy consumption. Our experimental results and analysis demonstrate the potential of our design. The performance and energy efficiency of our design reaches 0.13 TFLOPS∼1.77 TFLOPS and 61 TOPS/J∼85 TOPS/J, respectively. Compared to the state-of-the-art design, the NEBULA, our design can improve performance by 1.4×, 2.

关键词： Data streams

来源：评论

学校读者我要写书评

暂无评论

Coarse-to-fine lightweight meta-embedding for ID-based recommendations

引用

science China(Information sciences) 2025年第4期68卷 82-97页

作者： Yang WANG Haipeng LIU Zeqian YI Biao QIAN Meng WANG School of Computer Science and Information Engineering Hefei University of Technology College of Information and Intelligence Hunan Agricultural University

State-of-the-art recommender systems are increasingly focused on optimizing implementation efficiency, such as enabling on-device recommendations under memory constraints. Current methods commonly use lightweight embeddings for users and items or employ compact embeddings to enhance reusability and reduce memory usage. However, these approaches consider only the coarse-grained aspects of embeddings, overlooking subtle semantic nuances. This limitation results in an adversarial degradation of meta-embedding performance, impeding the system's ability to capture intricate relationships between users and items, leading to suboptimal recommendations. To address this, we propose a novel approach to efficiently learn meta-embeddings with varying grained and apply fine-grained meta-embeddings to strengthen the representation of their coarse-grained counterparts. Specifically, we introduce a recommender system based on a graph neural network, where each user and item is represented as a node. These nodes are directly connected to coarse-grained virtual nodes and indirectly linked to fine-grained virtual nodes, facilitating learning of multi-grained semantics. Fine-grained semantics are captured through sparse meta-embeddings, which dynamically balance embedding uniqueness and memory constraints. To ensure their sparseness, we rely on initialization methods such as sparse principal component analysis combined with a soft thresholding activation function. Moreover, we propose a weight-bridging update strategy that aligns coarse-grained meta-embedding with several fine-grained meta-embeddings based on the underlying semantic properties of users and items. Comprehensive experiments demonstrate that our method outperforms existing baselines. The code of our proposal is available at https://***/htyjers/C2F-MetaEmbed.

关键词： lightweight meta-embedding coarse-to-fine learning ID-based recommendations

来源：评论

学校读者我要写书评

暂无评论

Semi-Supervised Multi-Task Deep Learning for WiFi Fingerprint Database Construction in Building-Scale Localization

引用

IEEE Transactions on Consumer Electronics 2024年第1期71卷 488-500页

作者： Wang, Chun Luo, Juan Yin, Luxiu Li, Chuang Huang, Wenbin Liang, Wei Li, Kuan-Ching Hunan Normal University College of Information Science and Engineering China Hunan University College of Computer Science and Electronic Engineering China Hunan University of Science and Technology School of Computer Science and Engineering China Nanjing University of Information Science and technology School of Computer Science China

WiFi-based indoor positioning has emerged as a crucial technology for enabling smart consumer electronic applications, particularly in large-scale buildings. The construction of WiFi fingerprint databases using received signal strength (RSS) is foundational due to its widespread deployment. However, achieving high positioning accuracy typically requires labor-intensive and time-consuming site surveys. While recent crowdsourcing methods have facilitated the collection of numerous RSS samples, these samples frequently lack labels and reliability in multi-scale building *** this paper, we design a novel semi-supervised and multi-task mean-teacher model (MTMT-DNN) to annotate crowdsourcing unlabeled multi-scale fingerprint samples. This method enables the construction of a comprehensive fingerprint database without requiring intensive manual effort or compromising positioning accuracy. Our key idea is to first develop a multi-task Deep Neural Network (MT-DNN) for simultaneously annotating building, floor, and intra-floor coordinate labels by leveraging their complementary information. Then we employ the mean-teacher semi-supervised learning to leverage additional unlabeled fingerprint data for further improving the annotating performance and reducing intensive manual effort. Finally, we train the MTMT-DNN model by developing two multi-task loss functions and ensuring consistency between them, thereby enhancing the reliability of the annotated crowdsourced fingerprints. We conducted real-world experiments in a 20,000 m2 site encompassing three multi-story buildings. The results demonstrate that our proposed method significantly reduces the workload of manually collecting labeled fingerprint samples. With only 20% of labeled fingerprints collected, we achieve 99% average annotation accuracy for building and floor labels and an average coordinates annotation error within 4 m. © 1975-2011 IEEE.

关键词： Floors

来源：评论

学校读者我要写书评

暂无评论

STDNet: A Spatio-Temporal Decomposition Neural Network for Multivariate Time Series Forecasting

引用

Tsinghua science and technology 2024年第4期29卷 1232-1247页

作者： Zhuolun Jiang Zefei Ning Hao Miao Li Wang College of Computer Science and Technology(College of Data Science) Taiyuan University of TechnologyJinzhong 030600China

Long-term multivariate time series forecasting is an important task in engineering applications. It helps grasp the future development trend of data in real-time, which is of great significance for a wide variety of fields. Due to the non-linear and unstable characteristics of multivariate time series, the existing methods encounter difficulties in analyzing complex high-dimensional data and capturing latent relationships between multivariates in time series, thus affecting the performance of long-term prediction. In this paper, we propose a novel time series forecasting model based on multilayer perceptron that combines spatio-temporal decomposition and doubly residual stacking, namely Spatio-Temporal Decomposition Neural Network (STDNet). We decompose the originally complex and unstable time series into two parts, temporal term and spatial term. We design temporal module based on auto-correlation mechanism to discover temporal dependencies at the sub-series level, and spatial module based on convolutional neural network and self-attention mechanism to integrate multivariate information from two dimensions, global and local, respectively. Then we integrate the results obtained from the different modules to get the final forecast. Extensive experiments on four real-world datasets show that STDNet significantly outperforms other state-of-the-art methods, which provides an effective solution for long-term time series forecasting.

关键词： time series forecasting multivariate time series spatio-temporal decomposition

来源：评论

学校读者我要写书评

暂无评论

Metarelation2vec:A Metapath-Free Scalable Representation Learning Model for Heterogeneous Networks

引用

Tsinghua science and technology 2024年第2期29卷 553-575页

作者： Lei Chen Yuan Li Yong Lei Xingye Deng School of Information and Electrical Engineering Hunan University of Science and TechnologyXiangtan 411201China School of Computer Science and Engineering Hunan University of Science and TechnologyXiangtan 411201China

Metapaths with specific complex semantics are critical to learning diverse semantic and structural information of heterogeneous networks(HNs)for most of the existing representation learning ***,any metapaths consisting of multiple,simple metarelations must be driven by domain *** sensitive,expensive,and limited metapaths severely reduce the flexibility and scalability of the existing models.A metapath-free,scalable representation learning model,called Metarelation2vec,is proposed for HNs with biased joint learning of all metarelations in a bid to address this ***,a metarelation-aware,biased walk strategy is first designed to obtain better training samples by using autogenerating cooperation probabilities for all metarelations rather than using expert-given ***,grouped nodes by the type,a common and shallow skip-gram model is used to separately learn structural proximity for each node ***,grouped links by the type,a novel and shallow model is used to separately learn the semantic proximity for each link ***,supervised by the cooperation probabilities of all meta-words,the biased training samples are thrown into the shallow models to jointly learn the structural and semantic information in the HNs,ensuring the accuracy and scalability of the *** experimental results on three tasks and four open datasets demonstrate the advantages of our proposed model.

关键词： metarelation random walk heterogeneous network metapath representation learning

来源：评论

学校读者我要写书评

暂无评论

A Global-Local Parallel Dual-Branch Deep Learning Model with Attention-Enhanced Feature Fusion for Brain Tumor MRI Classification

引用

computers, Materials & Continua 2025年第4期83卷 739-760页

作者： Zhiyong Li Xinlian Zhou School of Computer Science and Engineering Hunan University of Science and TechnologyXiangtan411100China

Brain tumor classification is crucial for personalized treatment *** deep learning-based Artificial Intelligence(AI)models can automatically analyze tumor images,fine details of small tumor regions may be overlooked during global feature ***,we propose a brain tumor Magnetic Resonance Imaging(MRI)classification model based on a global-local parallel dual-branch *** global branch employs ResNet50 with a Multi-Head Self-Attention(MHSA)to capture global contextual information from whole brain images,while the local branch utilizes VGG16 to extract fine-grained features from segmented brain tumor *** features from both branches are processed through designed attention-enhanced feature fusion module to filter and integrate important ***,to address sample imbalance in the dataset,we introduce a category attention block to improve the recognition of minority *** results indicate that our method achieved a classification accuracy of 98.04%and a micro-average Area Under the Curve(AUC)of 0.989 in the classification of three types of brain tumors,surpassing several existing pre-trained Convolutional Neural Network(CNN)***,feature interpretability analysis validated the effectiveness of the proposed *** suggests that the method holds significant potential for brain tumor image classification.

关键词： Deep learning attention mechanism feature fusion dual-branch structure brain tumor MRI classification

来源：评论

学校读者我要写书评

暂无评论

Research on Stock Price Prediction Method Based on the GAN-LSTM-Attention Model

引用

computers, Materials & Continua 2025年第1期82卷 609-625页

作者： Peng Li Yanrui Wei Lili Yin College of Computer Science and Technology Harbin University of Science and TechnologyHarbin150006China

Stock price prediction is a typical complex time series prediction problem characterized by dynamics,nonlinearity,and *** paper introduces a generative adversarial network model that incorporates an attention mechanism(GAN-LSTM-Attention)to improve the accuracy of stock price ***,the generator of this model combines the Long and Short-Term Memory Network(LSTM),the Attention Mechanism and,the Fully-Connected Layer,focusing on generating the predicted stock *** discriminator combines the Convolutional Neural Network(CNN)and the Fully-Connected Layer to discriminate between real stock prices and generated stock ***,to evaluate the practical application ability and generalization ability of the GAN-LSTM-Attention model,four representative stocks in the United States of America(USA)stock market,namely,Standard&Poor’s 500 Index stock,Apple Incorporatedstock,AdvancedMicroDevices Incorporatedstock,and Google Incorporated stock were selected for prediction experiments,and the prediction performance was comprehensively evaluated by using the three evaluation metrics,namely,mean absolute error(MAE),root mean square error(RMSE),and coefficient of determination(R2).Finally,the specific effects of the attention mechanism,convolutional layer,and fully-connected layer on the prediction performance of the model are systematically analyzed through ablation *** results of experiment show that the GAN-LSTM-Attention model exhibits excellent performance and robustness in stock price prediction.

关键词： Stock price prediction generative adversarial network attention mechanism time-series prediction

来源：评论

学校读者我要写书评

暂无评论

Local Content-Aware Enhancement for Low-Light Images with Non-Uniform Illumination

引用

computers, Materials & Continua 2025年第3期82卷 4669-4690页

作者： Qi Mu Yuanjie Guo Xiangfu Ge Xinyue Wang Zhanli Li College of Computer Science and Technology Xi’an University of Science and TechnologyXi’an710054China

In low-light image enhancement,prevailing Retinex-based methods often struggle with precise illumina-tion estimation and brightness *** can result in issues such as halo artifacts,blurred edges,and diminished details in bright regions,particularly under non-uniform illumination *** propose an innovative approach that refines low-light images by leveraging an in-depth awareness of local content within the *** introducing multi-scale effective guided filtering,our method surpasses the limitations of traditional isotropic filters,such as Gaussian filters,in handling non-uniform *** dynamically adjusts regularization parameters in response to local image characteristics and significantly integrates edge perception across different *** balanced approach achieves a harmonious blend of smoothing and detail preservation,enabling more accurate illumination ***,we have designed an adaptive gamma correction function that dynamically adjusts the brightness value based on local pixel intensity,further balancing enhancement effects across different brightness levels in the *** results demonstrate the effectiveness of our proposed method for non-uniform illumination images across various *** exhibits superior quality and objective evaluation scores compared to existing *** method effectively addresses potential issues that existing methods encounter when processing non-uniform illumination images,producing enhanced images with precise details and natural,vivid colors.

关键词： Retinex non-uniform low illumination local content-aware effective guided image filtering

来源：评论

学校读者我要写书评

暂无评论

引用

computers, Materials & Continua 2025年第3期82卷 5135-5151页

作者： Jian Feng Yifan Guo Cailing Du College of Computer Science & Technology Xi’an University of Science and TechnologyXi’an710054China

Graph similarity learning aims to calculate the similarity between pairs of *** unsupervised graph similarity learning methods based on contrastive learning encounter challenges related to random graph augmentation strategies,which can harm the semantic and structural information of graphs and overlook the rich structural information present in *** address these issues,we propose a graph similarity learning model based on learnable augmentation and multi-level contrastive ***,to tackle the problem of random augmentation disrupting the semantics and structure of the graph,we design a learnable augmentation method to selectively choose nodes and edges within the *** enhance contrastive levels,we employ a biased random walk method to generate corresponding subgraphs,enriching the contrastive ***,to solve the issue of previous work not considering multi-level contrastive learning,we utilize graph convolutional networks to learn node representations of augmented views and the original graph and calculate the interaction information between the attribute-augmented and structure-augmented views and the original *** goal is to maximize node consistency between different views and learn node matching between different graphs,resulting in node-level representations for each *** representations are then obtained through pooling operations,and we conduct contrastive learning utilizing both node and subgraph ***,the graph similarity score is computed according to different downstream *** conducted three sets of experiments across eight datasets,and the results demonstrate that the proposed model effectively mitigates the issues of random augmentation damaging the original graph’s semantics and structure,as well as the insufficiency of contrastive ***,the model achieves the best overall performance.

关键词： Graph similarity learning contrastive learning attributes structure

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：