检索结果-内蒙古大学图书馆

On learning the right attention point for feature enhancement

science China(Information sciences) 2023年第1期66卷 131-143页

作者： Liqiang LIN Pengdi HUANG Chi-Wing FU Kai XU Hao ZHANG Hui HUANG College of Computer Science and Software Engineering Shenzhen University Department of Computer Science and Engineering The Chinese University of Hong Kong School of Computer Science National University of Defense Technology School of Computing Science Simon Fraser University

We present a novel attention-based mechanism to learn enhanced point features for point cloud processing tasks, e.g., classification and segmentation. Unlike prior studies, which were trained to optimize the weights of a pre-selected set of attention points, our approach learns to locate the best attention points to maximize the performance of a specific task, e.g., point cloud classification. Importantly, we advocate the use of single attention point to facilitate semantic understanding in point feature learning. Specifically,we formulate a new and simple convolution, which combines convolutional features from an input point and its corresponding learned attention point(LAP). Our attention mechanism can be easily incorporated into state-of-the-art point cloud classification and segmentation networks. Extensive experiments on common benchmarks, such as Model Net40, Shape Net Part, and S3DIS, all demonstrate that our LAP-enabled networks consistently outperform the respective original networks, as well as other competitive alternatives, which employ multiple attention points, either pre-selected or learned under our LAP framework.

关键词： point convolution feature enhancement attention point deep neural network

来源：评论

学校读者我要写书评

暂无评论

Partial multi-label learning via label-specific feature corrections

引用

science China(Information sciences) 2025年第3期68卷 95-109页

作者： Jun-Yi HANG Min-Ling ZHANG School of Computer Science and Engineering Southeast University Key Laboratory of Computer Network and Information Integration(Southeast University) Ministry of Education

Partial multi-label learning（PML） allows learning from rich-semantic objects with inaccurate annotations, where a set of candidate labels are assigned to each training example but only some of them are valid. Existing approaches rely on disambiguation to tackle the PML problem, which aims to correct noisy candidate labels by recovering the ground-truth labeling information ahead of prediction model induction. However, this dominant strategy might be suboptimal as it usually needs extra assumptions that cannot be fully satisfied in real-world scenarios. Instead of label correction, we investigate another strategy to tackle the PML problem, where the potential ambiguity in PML data is eliminated by correcting instance features in a label-specific manner. Accordingly, a simple yet effective approach named PASE, i.e., partial multi-label learning via label-specific feature corrections, is proposed. Under a meta-learning framework, PASElearns to exert label-specific feature corrections so that potential ambiguity specific to each class label can be eliminated and the desired prediction model can be induced on these corrected instance features with the provided candidate labels. Comprehensive experiments on a wide range of synthetic and real-world data sets validate the effectiveness of the proposed approach.

关键词： machine learning multi-label learning partial multi-label learning label-specific features feature correction

来源：评论

学校读者我要写书评

暂无评论

Local saliency consistency-based label inference for weakly supervised salient object detection using scribble annotations

引用

CAAI Transactions on Intelligence Technology 2024年第1期9卷 239-249页

作者： Shuo Zhao Peng Cui Jing Shen Haibo Liu School of Computer Science and Technology Harbin University of Science and TechnologyHarbinChina School of Computer Science and Technology Harbin Engineering UniversityHarbinChina

Recently,weak supervision has received growing attention in the field of salient object detection due to the convenience of ***,there is a large performance gap between weakly supervised and fully supervised salient object detectors because the scribble annotation can only provide very limited foreground/background ***,an intuitive idea is to infer annotations that cover more complete object and background regions for *** this end,a label inference strategy is proposed based on the assumption that pixels with similar colours and close positions should have consistent ***,k-means clustering algorithm was first performed on both colours and coordinates of original annotations,and then assigned the same labels to points having similar colours with colour cluster centres and near coordinate cluster ***,the same annotations for pixels with similar colours within each kernel neighbourhood was set *** experiments on six benchmarks demonstrate that our method can significantly improve the performance and achieve the state-of-the-art results.

关键词： label inference salient object detection weak supervision

来源：评论

学校读者我要写书评

暂无评论

Video Colorization:A Survey

引用

Journal of computer science & Technology 2024年第3期39卷 487-508页

作者：彭中正杨艺新唐金辉潘金山 School of Computer Science and Engineering Nanjing University of Science and TechnologyNanjing210094China CCF IEEE ACM

Video colorization aims to add color to grayscale or monochrome *** existing methods have achieved substantial and noteworthy results in the field of image colorization,video colorization presents more formidable obstacles due to the additional necessity for temporal ***,there is rarely a systematic review of video colorization *** this paper,we aim to review existing state-of-the-art video colorization *** addition,maintaining spatial-temporal consistency is pivotal to the process of video *** gain deeper insight into the evolution of existing methods in terms of spatial-temporal consistency,we further review video colorization methods from a novel *** colorization methods can be categorized into four main categories:optical-flow based methods,scribble-based methods,exemplar-based methods,and fully automatic ***,optical-flow based methods rely heavily on accurate optical-flow estimation,scribble-based methods require extensive user interaction and modifications,exemplar-based methods face challenges in obtaining suitable reference images,and fully automatic methods often struggle to meet specific colorization *** also discuss the existing challenges and highlight several future research opportunities worth exploring.

关键词： video colorization deep convolutional neural network spatial-temporal consistency

来源：评论

学校读者我要写书评

暂无评论

MMInstruct: a high-quality multi-modal instruction tuning dataset with extensive diversity

引用

science China(Information sciences) 2024年第12期67卷 36-51页

作者： Yangzhou LIU Yue CAO Zhangwei GAO Weiyun WANG Zhe CHEN Wenhai WANG Hao TIAN Lewei LU Xizhou ZHU Tong LU Yu QIAO Jifeng DAI School of Computer Science Nanjing University School of Electronic Information and Electrical Engineering Shanghai Jiao Tong University Shanghai AI Laboratory School of Computer Science Fudan University Department of Information Engineering The Chinese University of Hong Kong SenseTime Research Department of Electronic Engineering Tsinghua University

Despite the effectiveness of vision-language supervised fine-tuning in enhancing the performance of vision large language models(VLLMs), existing visual instruction tuning datasets include the following limitations.(1) Instruction annotation quality: despite existing VLLMs exhibiting strong performance,instructions generated by those advanced VLLMs may still suffer from inaccuracies, such as hallucinations.(2) Instructions and image diversity: the limited range of instruction types and the lack of diversity in image data may impact the model's ability to generate diversified and closer to real-world scenarios outputs. To address these challenges, we construct a high-quality, diverse visual instruction tuning dataset MMInstruct,which consists of 973k instructions from 24 domains. There are four instruction types: judgment, multiplechoice, long visual question answering, and short visual question answering. To construct MMInstruct, we propose an instruction generation data engine that leverages GPT-4V, GPT-3.5, and manual correction. Our instruction generation engine enables semi-automatic, low-cost, and multi-domain instruction generation at 1/6 the cost of manual construction. Through extensive experiment validation and ablation experiments,we demonstrate that MMInstruct could significantly improve the performance of VLLMs, e.g., the model fine-tuning on MMInstruct achieves new state-of-the-art performance on 10 out of 12 benchmarks. The code and data shall be available at https://***/yuecao0119/MMInstruct.

关键词： instruction tuning multi-modal multi-domain dataset vision large language model

来源：评论

学校读者我要写书评

暂无评论

Highly transparent anti-reflection coating enhances the underwater efficiency and stability of perovskite solar modules

引用

Nano Research 2024年第9期17卷 8126-8133页

作者： Feng Qian Shihao Yuan Ting Zhang Lei Wang Xiaobo Li Hualin Zheng Qien Xu Zhi David Chen Shibin Li School of Optoelectronic Science and Engineering University of Electronic Science and Technology of ChinaChengdu 610054China Department of Electrical and Computer Engineering and Center for Nanoscale Science and Engineering University of KentuckyLexingtonKentucky 40506USA

Perovskite solar cells have shown great potential in the field of underwater solar cells due to their excellent optoelectronic properties;however,their underwater performance and stability still hinder their practical *** this research,a 1H,1H,2H,2H-heptadecafluorodecyl acrylate(HFDA)anti-reflection coating(ARC)was introduced as a high-transparent material for encapsulating perovskite solar modules(PSMs).Optical characterization results revealed that HFDA can effectively reduce reflection of light below 800 nm,aiding in the absorption of light within this wavelength range by underwater solar ***,a remarkable efficiency of 14.65%was achieved even at a water depth of 50 ***,the concentration of Pb^(2+)for HFDA-encapsulated film is significantly reduced from 186 to 16.5 ppb after being immersed in water for 347 ***,the encapsulated PSMs still remained above 80%of their initial efficiency after continuous underwater illumination for 400 ***,being exposed to air,the encapsulated PSMs maintained 94%of their original efficiency after 1000 h light *** highly transparent ARC shows great potentials in enhancing the stability of perovskite devices,applicable not only to underwater cells but also extendable to land-based photovoltaic devices.

关键词： underwater photovoltaics perovskite solar modules anti-reflection coating underwater stability

来源：评论

学校读者我要写书评

暂无评论

Road Surface Analysis through Machine Learning Techniques

引用

IEIE Transactions on Smart Processing and Computing 2024年第4期13卷 344-353页

作者： Singh, Prabhat Sharma, Shilpi Kamal, Ahmed E. Kumar, Sunil Department of Computer Science and Engineering Amity School of Engineering & Technology Uttar Pradesh Noida India Department of Electrical and Computer Engineering Iowa State University Ames United States Computer Science and Engineering Amity School of Engineering & Technology Uttar Pradesh Noida India

Roads are an important part of transporting goods and products from one place to another. In developing countries, the main challenge is to maintain road conditions regularly. Roads can deteriorate from time to time. Monitoring the conditions of the roads, which may degrade with time, is very difficult, resulting in a delay in transportation and damage to the vehicles moving on the roads. Poor road conditions cause road accidents. A model is being proposed to monitor the conditions of the road surface by smartphone sensors. Accelerometer, gyroscope, and GPS sensors are deployed in the mobile phones, which will help to collect data on the road conditions. After collecting the data about the road conditions, various machine learning approaches, such as supervised, multi-layered, and multiclass, are applied to data filtration. Road conditions are divided into three categories to achieve this methodology: potholes, deep traverse cracks, and smooth roads. This categorization helped in analyzing the road surface condition through smartphone sensors over all three axes instead of taking it over a single axis. Neural networks helped analyze data or road conditions more accurately than Decision Tree and SVM. Copyrights © 2024 The Institute of Electronics and Information Engineers.

关键词： Decision trees

来源：评论

学校读者我要写书评

暂无评论

MCWOA Scheduler:Modified Chimp-Whale Optimization Algorithm for Task Scheduling in Cloud Computing

引用

computers, Materials & Continua 2024年第2期78卷 2593-2616页

作者： Chirag Chandrashekar Pradeep Krishnadoss Vijayakumar Kedalu Poornachary Balasundaram Ananthakrishnan School of Computer Science and Engineering Vellore Institute of TechnologyChennai600127India Center for Cyber Physical Systems School of Computer Science and EngineeringVellore Institute of TechnologyChennai600127India

Cloud computing provides a diverse and adaptable resource pool over the internet,allowing users to tap into various resources as *** has been seen as a robust solution to relevant challenges.A significant delay can hamper the performance of IoT-enabled cloud ***,efficient task scheduling can lower the cloud infrastructure’s energy consumption,thus maximizing the service provider’s revenue by decreasing user job processing *** proposed Modified Chimp-Whale Optimization Algorithm called Modified Chimp-Whale Optimization Algorithm(MCWOA),combines elements of the Chimp Optimization Algorithm(COA)and the Whale Optimization Algorithm(WOA).To enhance MCWOA’s identification precision,the Sobol sequence is used in the population initialization phase,ensuring an even distribution of the population across the solution ***,the traditional MCWOA’s local search capabilities are augmented by incorporating the whale optimization algorithm’s bubble-net hunting and random search mechanisms into MCWOA’s position-updating *** study demonstrates the effectiveness of the proposed approach using a two-story rigid frame and a simply supported beam *** outcomes reveal that the new method outperforms the original MCWOA,especially in multi-damage detection *** excels in avoiding false positives and enhancing computational speed,making it an optimal choice for structural damage *** efficiency of the proposed MCWOA is assessed against metrics such as energy usage,computational expense,task duration,and *** simulated data indicates that the new MCWOA outpaces other methods across all *** study also references the Whale Optimization Algorithm(WOA),Chimp Algorithm(CA),Ant Lion Optimizer(ALO),Genetic Algorithm(GA)and Grey Wolf Optimizer(GWO).

关键词： Cloud computing scheduling chimp optimization algorithm whale optimization algorithm

来源：评论

学校读者我要写书评

暂无评论

Using DTL-MD with GANs and ResNet for Malicious Code Detection

Informatica (Slovenia)

引用

Informatica (Slovenia) 2025年第14期49卷 63-78页

作者： Li, Yiming Xie, Tao Mei, Dongdong Department of Computer Science and Technology School of Computer Science and Engineering Ningxia Institute of Science and Technology Shizuishan753000 China

This study proposes a malicious code detection model DTL-MD based on deep transfer learning, which aims to improve the detection accuracy of existing methods in complex malicious code and data scarcity. In the feature extraction process, the weighted sum method of GIST and LBP features is used to combine the advantages of the two features. Online transfer learning is used to reduce the data distribution difference between the target domain and the source domain. The model uses ResNet50V2 as the backbone network and combines SimAM to enhance the feature extraction and representation capabilities. In addition, in order to further improve the robustness of detection, GAN is used to generate malicious code variants and expand the training data set. In the experiment, the public CICIDS 2017 data set is used for model training and testing. The performance test results show that when the threshold is 0.7, the accuracy of DTL-MD is 95.8% and the F1 score is 0.93. In a performance test involving 30,000 samples, the throughput of the DTL-MD model under Trojans, viruses, worms, and adware is 11, 12, 11, and 12 tasks/s, respectively, and the inference time is 211, 225, 239, and 234 samples/s, respectively. Compared with GAN, DTL-MD increases the throughput by about 10% and the inference speed by about 15%. The research aims to provide new ideas for improving the intelligence and automation level of malicious code detection technology, which has certain application value and practical significance. © 2025 Slovene Society Informatika. All rights reserved.

关键词： Feature Selection

来源：评论

学校读者我要写书评

暂无评论

Defending Federated Learning System from Poisoning Attacks via Efficient Unlearning

引用

computers, Materials & Continua 2025年第4期83卷 239-258页

作者： Long Cai Ke Gu Jiaqi Lei School of Computer and Communication Engineering Changsha University of Science and TechnologyChangsha410114China

Large-scale neural networks-based federated learning(FL)has gained public recognition for its effective capabilities in distributed ***,the open system architecture inherent to federated learning systems raises concerns regarding their vulnerability to potential *** attacks turn into a major menace to federated learning on account of their concealed property and potent destructive *** altering the local model during routine machine learning training,attackers can easily contaminate the global *** detection and aggregation solutions mitigate certain threats,but they are still insufficient to completely eliminate the influence generated by ***,federated unlearning that can remove unreliable models while maintaining the accuracy of the global model has become a *** some existing federated unlearning approaches are rather difficult to be applied in large neural network models because of their high computational ***,we propose SlideFU,an efficient anti-poisoning attack federated unlearning *** primary concept of SlideFU is to employ sliding window to construct the training process,where all operations are confined within the *** design a malicious detection scheme based on principal component analysis(PCA),which calculates the trust factors between compressed models in a low-cost way to eliminate unreliable *** confirming that the global model is under attack,the system activates the federated unlearning process,calibrates the gradients based on the updated direction of the calibration *** on two public datasets demonstrate that our scheme can recover a robust model with extremely high efficiency.

关键词： Federated learning malicious client detection model recovery machine unlearning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：