检索结果-内蒙古大学图书馆

24th IEEE International Conference on Data Mining, ICDM 2024

作者： Liang, Fei-Yao Xi, Wu-Dong Xing, Xing-Xing Wan, Wei Wang, Chang-Dong Chen, Min Guizani, Mohsen School of Computer Science and Engineering Sun Yat-sen University Guangzhou China NetEase Games China UX Center NetEase Games Guangzhou China Guangdong Provincial Key Laboratory of Intellectual Property and Big Data Guangzhou China School of Computer Science and Engineering South China University of Technology Guangzhou China Pazhou Laboratory Guangzhou China Machine Learning Department Abu Dhabi United Arab Emirates

ISBN: (纸本)9798331506681

With the explosive growth of information, recommendation systems have emerged to alleviate the problem of information overload. In order to improve the performance of recommendation systems, many existing methods introduce Large Language Models to extract textual information from description text. However, Large Language Models are trained on large-scale generic textual data and may face a semantic gap for downstream recommendation tasks. To address the above issues, we propose Contrastive learning for Adapting Language Model to Sequential Recommendation (CLA-Rec). In CLA-Rec, we first extract text embeddings from description text using Large Language Models and align the text embeddings learned by Large Language Models with the collaborative information through contrastive learning to obtain high-quality item representations. Through semantic alignment, we bridge the semantic gap between Large Language Models and the recommendation task. To map textual information and collaborative information into user representations, we utilize a Transformer model to learn user representations and capture user preferences by combining the semantically aligned item representations. Extensive experiments on three public datasets demonstrate that our method outperforms state-of-the-art approaches on multiple evaluation metrics, illustrating the effectiveness of the CLA-Rec model in adapting Large Language Models to recommendation tasks. © 2024 IEEE.

关键词： Contrastive learning

来源：评论

学校读者我要写书评

暂无评论

Multi-Task learning for Fatigue Detection and Face Recognition of Drivers via Tree-Style Space-Channel Attention Fusion Network

arXiv

引用

arXiv 2024年

作者： Qu, Shulei Gao, Zhenguo Chen, Xiaowei Li, Na Wang, Yakai Wu, Xiaoxiao Department of Computer Science and Technology Huaqiao University Fujian Xiamen361021 China Key Laboratory of Computer Vision Machine Learning of Fujian Provincial Universities Fujian Xiamen361021 China Department of Mechanical Engineering and Automation Huaqiao University Fujian Xiamen361021 China

In driving scenarios, automobile active safety systems are increasingly incorporating deep learning technology. These systems typically need to handle multiple tasks simultaneously, such as detecting fatigue driving and recognizing the driver’s identity. However, the traditional parallel-style approach of combining multiple single-task models tends to waste resources when dealing with similar tasks. Therefore, we propose a novel tree-style multi-task modeling approach for multi-task learning, which rooted at a shared backbone, more dedicated separate module branches are appended as the model pipeline goes deeper. Following the tree-style approach, we propose a multi-task learning model for simultaneously performing driver fatigue detection and face recognition for identifying a driver. This model shares a common feature extraction backbone module, with further separated feature extraction and classification module branches. The dedicated branches exploit and combine spatial and channel attention mechanisms to generate space-channel fused-attention enhanced features, leading to improved detection performance. As only single-task datasets are available, we introduce techniques including alternating updation and gradient accumulation for training our multi-task model using only the single-task datasets. The effectiveness of our tree-style multi-task learning model is verified through extensive validations. © 2024, CC BY.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

RecCoder: Reformulating Sequential Recommendation as Large Language Model-Based Code Completion 24

RecCoder: Reformulating Sequential Recommendation as Large L...

引用

24th IEEE International Conference on Data Mining, ICDM 2024

作者： Lai, Kai-Huang Xi, Wu-Dong Xing, Xing-Xing Wan, Wei Wang, Chang-Dong Chen, Min Guizani, Mohsen School of Computer Science and Engineering Sun Yat-sen University Guangzhou China NetEase Games China UX Center NetEase Games Guangzhou China Guangdong Provincial Key Laboratory of Intellectual Property and Big Data Guangzhou China School of Computer Science and Engineering South China University of Technology Guangzhou China Pazhou Laboratory Guangzhou China Machine Learning Department Abu Dhabi United Arab Emirates

ISBN: (纸本)9798331506681

In the evolving landscape of sequential recommendation systems, the application of Large Language Models (LLMs) is increasingly prominent. However, current attempts typically utilize general-purpose LLMs, which present a mismatch in capability and a large semantic gap relative to the specialized needs of recommendation tasks. To tackle these issues, we introduce RecCoder, an innovative model that reformulates sequential recommendation as a code completion task. This approach leverages the superior reasoning capability of code LLMs as a backbone, aligning well with the requirements of recommendation systems. To bridge the semantic gap, RecCoder creates extra tokens for each item and employs item content to initialize token embeddings. Furthermore, we have developed a suite of Semantic Adaptation Fine-tuning tasks, tailored to enhance the model's acquisition of both content and collaborative semantic information, thus aligning the model's intrinsic capabilities with the unique demands of recommendation tasks. Through extensive testing on three public datasets, RecCoder has shown remarkable improvements over existing models in terms of recommendation accuracy and efficiency. This success highlights the substantial yet previously underexplored potential of code LLMs in improving recommendation accuracy and efficiency, suggesting a promising new direction for future research in this area. The implementation code is accessible at https://***/AllminerLab/Code-for-RecCoder-master. © 2024 IEEE.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Cross-Store Next-Basket Recommendation 24

Cross-Store Next-Basket Recommendation

引用

24th IEEE International Conference on Data Mining, ICDM 2024

作者： Ma, Liang-Chen Li, Ya Mai, Zi-Feng Liang, Fei-Yao Wang, Chang-Dong Chen, Min Guizani, Mohsen School of Electronics and Information Guangdong Polytechnic Normal University Guangzhou China School of Computer Science and Engineering Sun Yat-sen University Guangzhou China Guangdong Provincial Key Laboratory of Intellectual Property and Big Data Guangzhou China School of Computer Science and Engineering South China University of Technology Guangzhou China Pazhou Laboratory Guangzhou China Machine Learning Department Abu Dhabi United Arab Emirates

ISBN: (纸本)9798331506681

Next-basket recommendation (NBR) infers a set of items that a user will interact with in the next basket. Existing methods often struggle with the data sparsity problem, particularly when the number of baskets is significantly large due to diverse user behaviors. Cross-domain recommendation (CDR) can effectively alleviate this problem in NBR by transferring knowledge across different domains. Nevertheless, these methods often rely on the similarities of overlapping users, which leads to the negative transfer problem and ignores the overlapping items that are general in real-world scenarios like chain stores. In this paper, we provide a clear symbolic definition of cross-store recommendation (CSR) and distinguish it from CDR. We also propose a novel CSNBR model for cross-store next-basket recommendation task. To fully model the transferable collaborative information between two stores, we learn the embeddings of users, baskets, and items by two intra-store bipartite graphs, and use an inter-store unified bipartite graph to transfer the previously learned knowledge. Furthermore, to alleviate the negative transfer problem, we propose to reconstruct the inter-store unified bipartite graph by utilizing user embeddings obtained from the transfer layer and the disentanglement layer. We also employ two sequence encoders to model the historical sequential information at basket-level and item-level. Extensive experiments conducted on real-world datasets demonstrate the effectiveness of the CSNBR model. © 2024 IEEE.

关键词： Graph embeddings

来源：评论

学校读者我要写书评

暂无评论

CRLNet: Cascaded Resolution learning Network for Natural Scenes Segmentation

引用

IEEE Intelligent Systems 2025年

作者： Li, Wei Tian, Shishun Hua, Guoguang Liao, Muxin Zhang, Yuhang Zou, Wenbin Shenzhen University Guangdong Key Laboratory of Intelligent Information Processing Shenzhen Key Laboratory of Advanced Machine Learning and Applications College of Electronics and Information Engineering Shenzhen518060 China Jiangxi Agricultural University School of Computer Science and Engineering Nanchang330045 China

The natural environment presents a multitude of scenes with diverse content, posing challenges for satisfactory segmentation results using existing segmentation networks. In response, we propose a Cascaded Resolution learning Network (CRLNet) to enhance segmentation performance through global textual embedding and multi-resolution feature learning. The CRLNet constructs a multi-path segmentation system that integrates multi-resolution feature data from different paths, thereby progressively enhancing local feature learning. Two key modules, the Partition-Fusion Channel Attention Module (PFCAM) and the Features learning Module (FLM), are pivotal components of CRLNet. The PFCAM serves as a computationally efficient channel attention module to mitigate segmentation confusion stemming from similar objects. Meanwhile, the FLM is tailored to learn resolution feature maps from different paths, facilitating the refinement of object representation and enhancing segmentation performance. Extensive experiments conducted on real natural scene datasets demonstrate the superiority of the proposed CRLNet over existing efficient segmentation methods in terms of accuracy. © 2025 IEEE.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

Intelligent cache and buffer optimization for mobile VR adaptive transmission in 5G edge computing networks

引用

Digital Communications and Networks 2024年第5期10卷 1234-1244页

作者： Junchao Yang Ali Kashif Bashir Zhiwei Guo Keping Yu Mohsen Guizani Chongqing Key Laboratory of Intelligent Perception and BlockChain Technology School of Artificial IntelligenceChongqing Technology and Business UniversityChongqing400067China Department of Computing and Mathematics Manchester Metropolitan UniversityUK Woxsen School of Business Woxsen UniversityHyderabad 502345India Department of Computer Science and Mathematics Lebanese American UniversityBeirutLebanon Graduate School of Science and Engineering Hosei UniversityTokyo184-8584Japan Machine Learning Department Mohamed Bin Zayed University of Artificial Intelligence(MBZUAI)United Arab Emirates

Virtual Reality(VR)is a key industry for the development of the digital economy in the *** VR has advantages in terms of mobility,lightweight and cost-effectiveness,which has gradually become the mainstream implementation of *** this paper,a mobile VR video adaptive transmission mechanism based on intelligent caching and hierarchical buffering strategy in Mobile Edge Computing(MEC)-equipped 5G networks is proposed,aiming at the low latency requirements of mobile VR services and flexible buffer management for VR video adaptive *** support VR content proactive caching and intelligent buffer management,users’behavioral similarity and head movement trajectory are jointly used for viewpoint *** tile-based content is proactively cached in the MEC nodes based on the popularity of the VR ***,a hierarchical buffer-based adaptive update algorithm is presented,which jointly considers bandwidth,buffer,and predicted viewpoint status to update the tile chunk in client ***,according to the decomposition of the problem,the buffer update problem is modeled as an optimization problem,and the corresponding solution algorithms are ***,the simulation results show that the adaptive caching algorithm based on 5G intelligent edge and hierarchical buffer strategy can improve the user experience in the case of bandwidth fluctuations,and the proposed viewpoint prediction method can significantly improve the accuracy of viewpoint prediction by 15%.

关键词： Virtual reality Adaptive transmission Edge cache Buffer management 5G Mobile edge computing

来源：评论

学校读者我要写书评

暂无评论

Harnessing Light Field Angular Cues and Spatial Geometries for Semantic Segmentation

Harnessing Light Field Angular Cues and Spatial Geometries f...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Chen Jia Fan Shi Xu Cheng School of Computer Science and Engineering The Engineering Research Center of Learning-Based Intelligent System (Ministry of Education) The Key Laboratory of Computer Vision and System (Ministry of Education) Tianjin University of Technology Tianjin China

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

4D light field imaging captures rich spatial-angular information, providing essential geometric cues for semantic segmentation tasks. In this paper, we introduce a novel backbone network called the Light Field Extraction Interaction Network (LFEI-Net). LFEI-Net excels in extracting global structures and multi-scale spatial-angular features, capturing feature dependencies through channel modeling and diverse feature interactions. Unlike traditional methods that depend on pyramid and dilated feature extraction, LFEI-Net pioneers an efficient method by integrating large-scale horizontal depth-wise convolution (HDWC) and vertical depth-wise convolution (VDWC) with interactive operations for comprehensive spatial multi-scale feature extraction. Furthermore, we present the Multi-Angular Modeling (MAM) module, which effectively captures scene angle variations from multiple perspectives and precisely delineates object boundaries, thereby improving model adaptability. Our experimental evaluations on two datasets demonstrate that LFEI-Net significantly outperforms state-ofthe-art (SOTA) 2D and 4D light field semantic segmentation methods, achieving mean Intersection over Union (mIoU) of 83.72% and 86.88%, respectively.

关键词： Geometry Adaptation models Convolution Semantic segmentation Imaging Feature extraction Light fields Acoustics Speech processing

来源：评论

学校读者我要写书评

暂无评论

Video Test-Time Adaptation for Action Recognition

Video Test-Time Adaptation for Action Recognition

引用

Conference on computer vision and Pattern Recognition (CVPR)

作者： Wei Lin Muhammad Jehanzeb Mirza Mateusz Kozinski Horst Possegger Hilde Kuehne Horst Bischof Institute for Computer Graphics and Vision Graz University of Technology Austria Christian Doppler Laboratory for Semantic 3D Computer Vision Christian Doppler Laboratory for Embedded Machine Learning Goethe University Frankfurt Germany MIT-IBM Watson AI Lab

Although action recognition systems can achieve top performance when evaluated on in-distribution test points, they are vulnerable to unanticipated distribution shifts in test data. However, test-time adaptation of video action recognition models against common distribution shifts has so far not been demonstrated. We propose to address this problem with an approach tailored to spatio-temporal models that is capable of adaptation on a single video sample at a step. It consists in a feature distribution alignment technique that aligns online estimates of test set statistics towards the training statistics. We further enforce prediction consistency over temporally augmented views of the same test video sample. Evaluations on three benchmark action recognition datasets show that our proposed technique is architecture-agnostic and able to significantly boost the performance on both, the state of the art convolutional architecture TANet and the Video Swin Transformer. Our proposed method demonstrates a substantial performance gain over existing test-time adaptation approaches in both evaluations of a single distribution shift and the challenging case of random distribution shifts. Code will be available at https://***/wlin-at/ViTTA.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Sit Back and Relax: learning to Drive Incrementally in All Weather Conditions

Sit Back and Relax: Learning to Drive Incrementally in All W...

引用

IEEE Symposium on Intelligent Vehicle

作者： Stefan Leitner M. Jehanzeb Mirza Wei Lin Jakub Micorek Marc Masana Mateusz Kozinski Horst Possegger Horst Bischof Institute for Computer Graphics and Vision Graz University of Technology Austria Christian Doppler Laboratory for Embedded Machine Learning Christian Doppler Laboratory for Semantic 3D Computer Vision Silicon Austria Labs TU Graz - SAL Dependable Embedded Systems Lab

In autonomous driving scenarios, current object detection models show strong performance when tested in clear weather. However, their performance deteriorates significantly when tested in degrading weather conditions. In addition, even when adapted to perform robustly in a sequence of different weather conditions, they are often unable to perform well in all of them and suffer from catastrophic forgetting. To efficiently mitigate forgetting, we propose Domain-Incremental learning through Activation Matching (DILAM), which employs unsupervised feature alignment to adapt only the affine parameters of a clear weather pre-trained network to different weather conditions. We propose to store these affine parameters as a memory bank for each weather condition and plug-in their weather-specific parameters during driving (i.e. test time) when the respective weather conditions are encountered. Our memory bank is extremely lightweight, since affine parameters account for less than 2% of a typical object detector. Furthermore, contrary to previous domain-incremental learning approaches, we do not require the weather label when testing and propose to automatically infer the weather condition by a majority voting linear classifier.

关键词：

来源：评论

学校读者我要写书评

暂无评论

learning Concordant Attention via Target-aware Alignment for Visible-Infrared Person Re-identification

Learning Concordant Attention via Target-aware Alignment for...

引用

International Conference on computer vision (ICCV)

作者： Jianbing Wu Hong Liu Yuxin Su Wei Shi Hao Tang Key Laboratory of Machine Perception Shenzhen Graduate School Peking University China Computer Vision Lab ETH Zürich Switzerland

Owing to the large distribution gap between the heterogeneous data in Visible-Infrared Person Re-identification (VI Re-ID), we point out that existing paradigms often suffer from the inter-modal semantic misalignment issue and thus fail to align and compare local details properly. In this paper, we present Concordant Attention learning (CAL), a novel framework that learns semantic-aligned representations for VI Re-ID. Specifically, we design the Target-aware Concordant Alignment paradigm, which allows target-aware attention adaptation when aligning heterogeneous samples (i.e., adaptive attention adjustment according to the target image being aligned). This is achieved by exploiting the discriminative clues from the modality counterpart and designing effective modality-agnostic correspondence searching strategies. To ensure semantic concordance during the cross-modal retrieval stage, we further propose MatchDistill, which matches the attention patterns across modalities and learns their underlying semantic correlations by bipartite-graph-based similarity modeling and cross-modal knowledge exchange. Extensive experiments on VI Re-ID benchmark datasets demonstrate the effectiveness and superiority of the proposed CAL.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：