检索结果-内蒙古大学图书馆

Points of interest in the city of Barcelos in Portugal through augmented reality

Internet of Things and Cyber-Physical Systems 2024年第1期4卷 40-48页

作者： Pereira, Miguel Silva, João Carlos Pinheiro, Marisa Carvalho, Sandro Santos, Gilberto Polytechnic Institute of Cávado and Ave Barcelos Portugal 2Ai – Applied Artificial Intelligence Laboratory Barcelos Portugal INESC TEC - Institute for Systems and Computer Engineering Porto Portugal LIACC - Laboratory of Artificial Intelligence and Computer Science Porto Portugal

Barcelos is a historic city in Portugal with many tourist attractions, attracting more and more visitors who come to the city with the aim of exploring it. The main objective of this article is to boost tourism in the city of Barcelos, specifically highlighting tourist, historical and leisure spots, based on the development of a mobile application using augmented reality technologies and geolocation. This application intends to allow the users to know historical points of interest in Barcelos, as well as interact with a certain point. The results of this investigation were evaluated by testing the application by end users, with the aim of identifying whether the application meets their needs, in particular the promotion of tourist and historical points. © 2023

关键词： Augmented reality

来源：评论

学校读者我要写书评

暂无评论

Residual diverse ensemble for long-tailed multi-label text classification

引用

science China(Information sciences) 2024年第11期67卷 92-105页

作者： Jiangxin SHI Tong WEI Yufeng LI National Key Laboratory for Novel Software Technology Nanjing University School of Artificial Intelligence Nanjing University School of Computer Science and Engineering Southeast University Key Laboratory of Computer Network and Information Integration Southeast UniversityMinistry of Education

Long-tailed multi-label text classification aims to identify a subset of relevant labels from a large candidate label set, where the training datasets usually follow long-tailed label distributions. Many of the previous studies have treated head and tail labels equally, resulting in unsatisfactory performance for identifying tail labels. To address this issue, this paper proposes a novel learning method that combines arbitrary models with two steps. The first step is the “diverse ensemble” that encourages diverse predictions among multiple shallow classifiers, particularly on tail labels, and can improve the generalization of tail *** second is the “error correction” that takes advantage of accurate predictions on head labels by the base model and approximates its residual errors for tail labels. Thus, it enables the “diverse ensemble” to focus on optimizing the tail label performance. This overall procedure is called residual diverse ensemble(RDE). RDE is implemented via a single-hidden-layer perceptron and can be used for scaling up to hundreds of thousands of labels. We empirically show that RDE consistently improves many existing models with considerable performance gains on benchmark datasets, especially with respect to the propensity-scored evaluation ***, RDE converges in less than 30 training epochs without increasing the computational overhead.

关键词： multi-label learning extreme multi-label learning long-tailed distribution multi-label text classification ensemble learning

来源：评论

学校读者我要写书评

暂无评论

A survey on cross-user federated recommendation

引用

science China(Information sciences) 2025年第4期68卷 7-32页

作者： Enyue YANG Yudi XIONG Wei YUAN Weike PAN Qiang YANG Zhong MING College of Computer Science and Software Engineering Shenzhen University School of Electrical Engineering and Computer Science The University of Queensland WeBank AI Lab WeBank Department of Computer Science and Engineering Hong Kong University of Science and Technology College of Big Data and Internet Shenzhen Technology University Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ)

Recommender systems are effective in mitigating information overload, yet the centralized storage of user data raises significant privacy concerns. Cross-user federated recommendation(CUFR) provides a promising distributed paradigm to address these concerns by enabling privacy-preserving recommendations directly on user devices. In this survey, we review and categorize current progress in CUFR, focusing on four key aspects: privacy, security, accuracy, and efficiency. Firstly,we conduct an in-depth privacy analysis, discuss various cases of privacy leakage, and then review recent methods for privacy protection. Secondly, we analyze security concerns and review recent methods for untargeted and targeted *** untargeted attack methods, we categorize them into data poisoning attack methods and parameter poisoning attack methods. For targeted attack methods, we categorize them into user-based methods and item-based methods. Thirdly,we provide an overview of the federated variants of some representative methods, and then review the recent methods for improving accuracy from two categories: data heterogeneity and high-order information. Fourthly, we review recent methods for improving training efficiency from two categories: client sampling and model compression. Finally, we conclude this survey and explore some potential future research topics in CUFR.

关键词： cross-user federated recommendation federated recommendation federated learning recommender systems user privacy

来源：评论

学校读者我要写书评

暂无评论

Reinforcement learning of non-additive joint steganographic embedding costs with attention mechanism

引用

science China(Information sciences) 2023年第3期66卷 273-286页

作者： Weixuan TANG Bin LI Weixiang LI Yuangen WANG Jiwu HUANG Institute of Artificial Intelligence and Blockchain Guangzhou University Guangdong Key Laboratory of Intelligent Information Processing Shenzhen Key Laboratory of Media Security Shenzhen University Shenzhen Institute of Artificial Intelligence and Robotics for Society School of Computer Science and Cyber Engineering Guangzhou University

Image steganography is the art and science of secure communication by concealing information within digital images. In recent years, the techniques of steganographic cost learning have developed rapidly. Although the existing methods can learn satisfactory additive costs, the interplay of different pixels' embedding impacts has not been considered, so the potential of learning may not be fully exploited. To overcome this limitation, in this paper, a reinforcement learning paradigm called Jo Po L(joint policy learning) is proposed to extend the idea of additive cost learning to a non-additive situation. Jo Po L aims to capture the interactions within pixel blocks by defining embedding policies and evaluating contributions of embedding impacts on a block level rather than a pixel level. Then, a policy network is utilized to learn optimal joint embedding policies for pixel blocks through interactions with the environment. Afterwards,these policies can be converted into joint embedding costs for practical message embedding. The structure of the policy network is designed with an effective attention mechanism and incorporated with the domain knowledge derived from traditional non-additive steganographic methods. The environment is responsible for assigning rewards according to the impacts of the sampled joint embedding actions, which are evaluated by the gradient information of a neural network-based steganalyzer. Experimental results show that the proposed non-additive method Jo Po L significantly outperforms the existing additive methods against both feature-based and CNN-based steganalzyers over different payloads.

关键词： information hiding non-additive steganography steganalysis cost learning image processing

来源：评论

学校读者我要写书评

暂无评论

Dynamic Strip Convolution and Adaptive Morphology Perception Plugin for Medical Anatomy Segmentation

引用

IEEE Transactions on Medical Imaging 2025年第6期44卷 2541-2552页

作者： Hu, Guyue Kang, Yukun Zhao, Gangming Jin, Zhe Li, Chenglong Tang, Jin Anhui University Information Materials and Intelligent Sensing Laboratory of Anhui Province Anhui Provincial Key Laboratory of Security Artificial Intelligence School of Artificial Intelligence Hefei230601 China Anhui University Information Materials and Intelligent Sensing Laboratory of Anhui Province Anhui Provincial Key Laboratory of Multimodal Cognitive Computation School of Computer Science and Technology Hefei230601 China The University of Hong Kong Department of Computer Science Hong Kong Anhui University School of Artificial Intelligence Hefei230601 China

Medical anatomy segmentation is essential for computer-aided diagnosis and lesion localization in medical images. For example, segmenting individual ribs benefits localizing the lung lesions and providing vital medical measurements (such as rib spacing) for generating medical reports. Existing methods segment shape-different anatomies (such as striped ribs, bulky lungs, and angular scapula) with the same network architecture, the morphology heterogeneity is heavily overlooked. Although some shape-aware operators like deformable convolution and dynamic snake convolution have been introduced to cater to specific object morphology, they still struggle with orientation-varying strip structures, such as 24 ribs and 2 clavicles. In this paper, we propose a novel convolution plugin (DSC-AMP) for medical anatomy segmentation, which is comprised of a dynamic strip convolution (DSC) operator and an adaptive morphology perception (AMP) strategy. Specifically, the dynamic strip convolution customizes gradually varying directions and offsets for each local region, achieving dynamic striped receptive fields. Additionally, the adaptive morphology perception strategy incorporates insights from various shape-aware convolutional kernels, enabling the model to discern and integrate crucial representations corresponding to heterogeneous anatomies. Extensive experiments on two large-scale datasets demonstrate the effectiveness and superiority of the proposed approach for tackling heterogeneous medical anatomy segmentation. © 2025 IEEE. All rights reserved.

关键词： Diagnosis

来源：评论

学校读者我要写书评

暂无评论

ControlVideo: conditional control for one-shot text-driven video editing and beyond

引用

science China(Information sciences) 2025年第3期68卷 150-162页

作者： Min ZHAO Rongzhen WANG Fan BAO Chongxuan LI Jun ZHU Department of Computer Science and Technology Institute for AI Tsinghua-Bosch Joint ML CenterTsinghua Laboratory of Brain and Intelligence Lab Tsinghua University ShengShu Technology Gaoling School of Artificial Intelligence Renmin University of China Beijing Key Laboratory of Big Data Management and Analysis Methods Pazhou Laboratory (Huangpu)

This paper presents ControlVideo for text-driven video editing — generating a video that aligns with a given text while preserving the structure of the source video. Building on a pre-trained text-to-image diffusion model, ControlVideo enhances the fidelity and temporal consistency by incorporating additional conditions(such as edge maps), and fine-tuning the key-frame and temporal attention on the source video-text pair via an in-depth exploration of the design space. Extensive experimental results demonstrate that ControlVideo outperforms various competitive baselines by delivering videos that exhibit high fidelity w.r.t. the source content, and temporal consistency, all while aligning with the text. By incorporating low-rank adaptation layers into the model before training, ControlVideo is further empowered to generate videos that align seamlessly with reference images. More importantly, ControlVideo can be readily extended to the more challenging task of long video editing(e.g., with hundreds of frames), where maintaining long-range temporal consistency is crucial. To achieve this, we propose to construct a fused ControlVideo by applying basic ControlVideo to overlapping short video segments and key frame videos and then merging them by pre-defined weight functions. Empirical results validate its capability to create videos across 140 frames, which is approximately 5.83 to 17.5 times more than what previous studies achieved. The code is available at https://***/thu-ml/controlvideo.

关键词： diffusion models controllable generation text-driven editing video editing long video editing

来源：评论

学校读者我要写书评

暂无评论

Unlocking the potential of edge nodes: Range-extender for federated learning

引用

Alexandria Engineering Journal 2025年 128卷 12-40页

作者： Li, Boyuan School of Computer Science and Artificial Intelligence Zhengzhou University Zhengzhou450001 China Innovation Center of Intelligent Systems Longmen Laboratory Luoyang471000 China

With the rapid development of wireless networks and the widespread popularity of smart terminals, federated learning (FL) has attracted much attention as a distributed machine learning framework. This technique decentralizes the modeling process to mobile edge nodes, exploiting local data and edge arithmetic through collaboration. Although FL has many advantages, such as privacy protection, it still faces challenges in time management. Current FL frameworks suffer from inefficiencies in resource utilization (both synchronous and asynchronous), mainly due to idle arithmetic caused by communication gaps. In this case, collaboration time is usually wasted in long communication waits. To cope with this problem, we propose an edge node training range extender which can effectively utilize the communication window period for local training, thus compensating for edge node idling conditions and unleashing the potential of edge node training. This novel FL strategy revisits the FL process and provides two fused forms of additional training gradients. We critically analyze the convergence of additional FL and compare it with the mainstream FL frameworks at this stage. We demonstrate the potential benefits of this new strategy by performing a comprehensive analysis of the CIFAR10 and CIFAR100 datasets for a classification task. © 2025 The Author

关键词： Federated learning

来源：评论

学校读者我要写书评

暂无评论

BMLP:behavior-aware MLP for heterogeneous sequential recommendation

引用

Frontiers of computer science 2024年第3期18卷 235-237页

作者： Weixin LI Yuhao WU Yang LIU Weike PAN Zhong MING College of Computer Science and Software Engineering Shenzhen UniversityShenzhen 518060China Guangdong Laboratory of Artificial Intelligence and Digital Economy(SZ) Shenzhen 518123China

1 Introduction Recommender systems can effectively alleviate the problem of information ***,traditional recommendation methods cannot capture users’dynamic *** recommendation methods model user sequences to obtain more accurate and dynamic user ***,deep learning-based sequential recommendation methods have achieved great *** is proposed to capture the sequential information[1,2].Attention-based methods[3]use attention mechanisms to learn relationships between ***-based methods[4−6]transform sequences into graph structures to capture relationships of ***,they have the following two limitations.

关键词： sequences behavior sequential

来源：评论

学校读者我要写书评

暂无评论

ChemDFM-X: towards large multimodal model for chemistry

引用

science China(Information sciences) 2024年第12期67卷 99-100页

作者： Zihan ZHAO Bo CHEN Jingpiao LI Lu CHEN Liyang WEN Pengyu WANG Zichen ZHU Danyang ZHANG Yansi LI Zhongyang DAI Xin CHEN Kai YU X-LANCE Lab Department of Computer Science and EngineeringMoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Suzhou Laboratory

Chemistry, as a naturally multimodal discipline, plays a crucial role in various vital fields such as pharmaceutical research and material manufacturing. Therefore, research on artificial intelligence(AI) for chemistry has garnered increasing attention. Despite the rapid development, most of the chemical AI models today mainly focus on single tasks with unimodal input [1].

关键词：

来源：评论

学校读者我要写书评

暂无评论

A hybrid feature selection method for text classification using a feature-correlation-based genetic algorithm

引用

Soft Computing 2024年第23期28卷 13567-13593页

作者： Farek, Lazhar Benaidja, Amira Computer Science Department University of Guelma Guelma Algeria Computer Science Department University of Setif 1 Setif Algeria Laboratory of Vision and Artificial Intelligence - LAVIA Larbi Tebessi University Tebessa Algeria

This paper introduces a new hybrid method to address the issue of redundant and irrelevant features selected by filter-based methods for text classification. The method utilizes an enhanced genetic algorithm called "Feature Correlation-based Genetic Algorithm" (FC-GA). Initially, a feature subset with the highest classification accuracy is selected by a filter-based method, which will be then used by the FC-GA to generate potential solutions by considering the correlation between features that have similar classification weights and avoiding useless random solutions. The encoding process involves assigning a value of 0 to features that provide a high degree of correlation with other features having almost the same classification information beyond a specified context, while features that are lowly correlated retain their initial code of 1. Through iterative optimization using crossover and mutation operators, the algorithm should remove redundant features that provide strong correlations and high redundancy, which could lead to improved classification performance at a lower computation cost. The aim of this study is to improve the efficiency of filter-based methods, incorporate feature correlation information into genetic algorithms, and utilize pre-optimized feature subsets to efficiently identify optimal solutions. To evaluate the effectiveness of the proposed method, SVM and NB classifiers are employed on six public datasets and compared to five well-known and effective filter-based methods. The results indicate that a significant portion (about 50%) of the features selected by reference filter-based methods are redundant. Eliminating those redundant features leads to a significant improvement in classification performance as measured by the micro-f1 measure. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.

关键词： Genetic algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：