检索结果-内蒙古大学图书馆

An end-to-end network for co-saliency detection in one single image

science China(Information sciences) 2023年第11期66卷 5-22页

作者： Yuanhao YUE Qin ZOU Hongkai YU Qian WANG Zhongyuan WANG Song WANG School of Cyber Science and Engineering Wuhan University School of Computer Science Wuhan University Department of Electrical Engineering and Computer Science Cleveland State University Department of Computer Science and Engineering University of South Carolina

Co-saliency detection within a single image is a common vision problem that has not yet been well addressed. Existing methods often used a bottom-up strategy to infer co-saliency in an image in which salient regions are firstly detected using visual primitives such as color and shape and then grouped and merged into a co-saliency map. However, co-saliency is intrinsically perceived complexly with bottom-up and top-down strategies combined in human vision. To address this problem, this study proposes a novel end-toend trainable network comprising a backbone net and two branch nets. The backbone net uses ground-truth masks as top-down guidance for saliency prediction, whereas the two branch nets construct triplet proposals for regional feature mapping and clustering, which drives the network to be bottom-up sensitive to co-salient regions. We construct a new dataset of 2019 natural images with co-saliency in each image to evaluate the proposed method. Experimental results show that the proposed method achieves state-of-the-art accuracy with a running speed of 28 fps.

关键词： saliency detection convolutional neural network regional feature mapping co-saliency detection deep learning

来源：评论

学校读者我要写书评

暂无评论

Multi-Person Respiration Monitoring Leveraging Commodity Wi-Fi Devices

引用

计算机科学技术学报（英文版） 2025年第1期40卷 229-251页

作者：伊恩泽牛凯张扶桑高睿杨罗骏张大庆 School of Computer Science Peking University Beijing China Beijing Xiaomi Mobile Software Co. Ltd. Beijing China State Key Laboratory of Computer Sciences Institute of Software Chinese Academy of Sciences Beijing China School of Computer Science and Engineering Nanyang Technological University Singapore Singapore Telecom SudParis Institut Polytechnique de Paris Paris France

Monitoring respiration is an important component of personal health *** recent developments in Wi-Fi sensing offer a potential tool to achieve contact-free respiration monitoring,existing proposals for Wi-Fi-based mul-ti-person respiration sensing mainly extract individual's respiration rate in the frequency domain using the fast Fourier transform(FFT)or multiple signal classification(MUSIC)method,leading to the following limitations:1)largely ineffec-tive in recovering breaths of multiple persons from received mixed signals and in differentiating individual breaths,2)un-able to acquire the time-varying respiration pattern when the subject has respiratory abnormity,such as apnea and chang-ing respiration rates,and 3)difficult to identify the real number of subjects when multiple subjects share the same or simi-lar respiration *** address these issues,we propose Wi-Fi-enabled MUlti-person SEnsing(WiMUSE)as a signal pro-cessing pipeline to perform respiration monitoring for multiple persons ***,as a pioneering time domain approach,WiMUSE models the mixed signals of multi-person respiration as a linear superposition of multiple waveforms,so as to form a blind source separation(BSS)*** effective separation of the signal sources(respira-tory waveforms)further enables us to quantify the differences in the respiratory waveform patterns of multiple subjects,and thus to identify the number of subjects along with their respective respiration *** implement WiMUSE on commodity Wi-Fi devices and conduct extensive experiments to demonstrate that,compared with the approaches based on the FFT or MUSIC method,90％error of respiration rate can be reduced by more than 60％.

关键词： respiration monitoring multi-person sensing Wi-Fi sensing

来源：评论

学校读者我要写书评

暂无评论

Incremental Data Stream Classification with Adaptive Multi-Task Multi-View Learning

引用

Big Data Mining and Analytics 2024年第1期7卷 87-106页

作者： Jun Wang Maiwang Shi Xiao Zhang Yan Li Yunsheng Yuan Chengei Yang Dongxiao Yu School of Computer Science and Technology Shandong UniversityQingdao 266237China School of Software Shandong UniversityJinan 250101China

With the enhancement of data collection capabilities,massive streaming data have been accumulated in numerous application ***,the issue of classifying data streams based on mobile sensors can be formalized as a multi-task multi-view learning problem with a specific task comprising multiple views with shared features collected from multiple *** incremental learning methods are often single-task single-view,which cannot learn shared representations between relevant tasks and *** adaptive multi-task multi-view incremental learning framework for data stream classification called MTMVIS is proposed to address the above challenges,utilizing the idea of multi-task multi-view ***,the attention mechanism is first used to align different sensor data of different *** addition,MTMVIS uses adaptive Fisher regularization from the perspective of multi-task multi-view learning to overcome catastrophic forgetting in incremental *** reveal that the proposed framework outperforms state-of-the-art methods based on the experiments on two different datasets with other baselines.

关键词： data stream classification mobile sensors multi-task multi-view learning incremental learning

来源：评论

学校读者我要写书评

暂无评论

Residual diverse ensemble for long-tailed multi-label text classification

引用

science China(Information sciences) 2024年第11期67卷 92-105页

作者： Jiangxin SHI Tong WEI Yufeng LI National Key Laboratory for Novel Software Technology Nanjing University School of Artificial Intelligence Nanjing University School of Computer Science and Engineering Southeast University Key Laboratory of Computer Network and Information Integration Southeast UniversityMinistry of Education

Long-tailed multi-label text classification aims to identify a subset of relevant labels from a large candidate label set, where the training datasets usually follow long-tailed label distributions. Many of the previous studies have treated head and tail labels equally, resulting in unsatisfactory performance for identifying tail labels. To address this issue, this paper proposes a novel learning method that combines arbitrary models with two steps. The first step is the “diverse ensemble” that encourages diverse predictions among multiple shallow classifiers, particularly on tail labels, and can improve the generalization of tail *** second is the “error correction” that takes advantage of accurate predictions on head labels by the base model and approximates its residual errors for tail labels. Thus, it enables the “diverse ensemble” to focus on optimizing the tail label performance. This overall procedure is called residual diverse ensemble(RDE). RDE is implemented via a single-hidden-layer perceptron and can be used for scaling up to hundreds of thousands of labels. We empirically show that RDE consistently improves many existing models with considerable performance gains on benchmark datasets, especially with respect to the propensity-scored evaluation ***, RDE converges in less than 30 training epochs without increasing the computational overhead.

关键词： multi-label learning extreme multi-label learning long-tailed distribution multi-label text classification ensemble learning

来源：评论

学校读者我要写书评

暂无评论

Active self-training for weakly supervised 3D scene semantic segmentation

引用

Computational Visual Media 2024年第3期10卷 425-438页

作者： Gengxin Liu Oliver van Kaick Hui Huang Ruizhen Hu College of Computer Science&Software Engineering Shenzhen UniversityShenzhen 518060China School of Computer Science Carleton UniversityOttawa K1S 5B6Canada

Since the preparation of labeled datafor training semantic segmentation networks of pointclouds is a time-consuming process, weakly supervisedapproaches have been introduced to learn fromonly a small fraction of data. These methods aretypically based on learning with contrastive losses whileautomatically deriving per-point pseudo-labels from asparse set of user-annotated labels. In this paper, ourkey observation is that the selection of which samplesto annotate is as important as how these samplesare used for training. Thus, we introduce a methodfor weakly supervised segmentation of 3D scenes thatcombines self-training with active learning. Activelearning selects points for annotation that are likelyto result in improvements to the trained model, whileself-training makes efficient use of the user-providedlabels for learning the model. We demonstrate thatour approach leads to an effective method that providesimprovements in scene segmentation over previouswork and baselines, while requiring only a few userannotations.

关键词： semantic segmentation weakly supervised self-training active learning

来源：评论

学校读者我要写书评

暂无评论

A Global-Local Parallel Dual-Branch Deep Learning Model with Attention-Enhanced Feature Fusion for Brain Tumor MRI Classification

引用

computers, Materials & Continua 2025年第4期83卷 739-760页

作者： Zhiyong Li Xinlian Zhou School of Computer Science and Engineering Hunan University of Science and TechnologyXiangtan411100China

Brain tumor classification is crucial for personalized treatment *** deep learning-based Artificial Intelligence(AI)models can automatically analyze tumor images,fine details of small tumor regions may be overlooked during global feature ***,we propose a brain tumor Magnetic Resonance Imaging(MRI)classification model based on a global-local parallel dual-branch *** global branch employs ResNet50 with a Multi-Head Self-Attention(MHSA)to capture global contextual information from whole brain images,while the local branch utilizes VGG16 to extract fine-grained features from segmented brain tumor *** features from both branches are processed through designed attention-enhanced feature fusion module to filter and integrate important ***,to address sample imbalance in the dataset,we introduce a category attention block to improve the recognition of minority *** results indicate that our method achieved a classification accuracy of 98.04%and a micro-average Area Under the Curve(AUC)of 0.989 in the classification of three types of brain tumors,surpassing several existing pre-trained Convolutional Neural Network(CNN)***,feature interpretability analysis validated the effectiveness of the proposed *** suggests that the method holds significant potential for brain tumor image classification.

关键词： Deep learning attention mechanism feature fusion dual-branch structure brain tumor MRI classification

来源：评论

学校读者我要写书评

暂无评论

Local saliency consistency-based label inference for weakly supervised salient object detection using scribble annotations

引用

CAAI Transactions on Intelligence Technology 2024年第1期9卷 239-249页

作者： Shuo Zhao Peng Cui Jing Shen Haibo Liu School of Computer Science and Technology Harbin University of Science and TechnologyHarbinChina School of Computer Science and Technology Harbin Engineering UniversityHarbinChina

Recently,weak supervision has received growing attention in the field of salient object detection due to the convenience of ***,there is a large performance gap between weakly supervised and fully supervised salient object detectors because the scribble annotation can only provide very limited foreground/background ***,an intuitive idea is to infer annotations that cover more complete object and background regions for *** this end,a label inference strategy is proposed based on the assumption that pixels with similar colours and close positions should have consistent ***,k-means clustering algorithm was first performed on both colours and coordinates of original annotations,and then assigned the same labels to points having similar colours with colour cluster centres and near coordinate cluster ***,the same annotations for pixels with similar colours within each kernel neighbourhood was set *** experiments on six benchmarks demonstrate that our method can significantly improve the performance and achieve the state-of-the-art results.

关键词： label inference salient object detection weak supervision

来源：评论

学校读者我要写书评

暂无评论

Low-shot Video Object Segmentation

引用

IEEE Transactions on Pattern Analysis and Machine Intelligence 2025年第7期47卷 5538-5555页

作者： Yan, Kun Wei, Fangyun Dai, Shuyu Wu, Minghui Wang, Ping Xu, Chang Peking University School of Computer Science Beijing100871 China University of Sydney School of Computer Science DarlingtonNSW2008 Australia Peking University School of Software and Microelectronics Beijing100871 China Shanghai Artificial Intelligence Laboratory and Mininglamp Technology China Peking University National Engineering Research Center for Software Engineering Beijing100871 China Peking University School of Software and Microelectronics China China

Prior research in video object segmentation (VOS) predominantly relies on videos with dense annotations. However, obtaining pixel-level annotations is both costly and time-intensive. In this work, we highlight the potential of effectively training a VOS model using remarkably sparse video annotations - specifically, as few as one or two labeled frames per training video, yet maintaining near equivalent performance levels. We introduce this innovative training methodology as low-shot video object segmentation, abbreviated as low-shot VOS. Central to this method is the generation of reliable pseudo labels for unlabeled frames during the training phase, which are then used in tandem with labeled frames to optimize the model. Notably, our strategy is extremely simple and can be incorporated into the vast majority of current VOS models. For the first time, we propose a universal method for training VOS models on one-shot and two-shot VOS datasets. In the two-shot configuration, utilizing just 7.3% and 2.9% of labeled data from the YouTube-VOS and DAVIS benchmarks respectively, our model delivers results on par with those trained on completely labeled datasets. It is also worth noting that in the one-shot setting, a minor performance decrement is observed in comparison to models trained on fully annotated datasets. Code and models are available at https://***/yk-pku/Low-shot-VOS. © 1979-2012 IEEE.

关键词： Video analysis

来源：评论

学校读者我要写书评

暂无评论

Analyzing topics in social media for improving digital twinning based product development

引用

Digital Communications and Networks 2024年第2期10卷 273-281页

作者： Wenyi Tang Ling Tian Xu Zheng Ke Yan School of Computer Science and Engineering University of Electronic Science and Technology of ChinaChengdu611731China

Digital twinning enables manufacturers to create digital representations of physical entities,thus implementing virtual simulations for product *** efforts of digital twinning neglect the decisive consumer feedback in product development stages,failing to cover the gap between physical and digital *** work mines real-world consumer feedbacks through social media topics,which is significant to product *** specifically analyze the prevalent time of a product topic,giving an insight into both consumer attention and the widely-discussed time of a *** primary body of current studies regards the prevalent time prediction as an accompanying task or assumes the existence of a preset ***,these proposed solutions are either biased in focused objectives and underlying patterns or weak in the capability of generalization towards diverse *** this end,this work combines deep learning and survival analysis to predict the prevalent time of *** propose a specialized deep survival model which consists of two *** first module enriches input covariates by incorporating latent features of the time-varying text,and the second module fully captures the temporal pattern of a rumor by a recurrent network ***,a specific loss function different from regular survival models is proposed to achieve a more reasonable *** experiments on real-world datasets demonstrate that our model significantly outperforms the state-of-the-art methods.

关键词： Digital twinning Product development Topic analysis Social media

来源：评论

学校读者我要写书评

暂无评论

Dual-Task Contrastive Meta-Learning for Few-Shot Cross-Domain Emotion Recognition

引用

computers, Materials & Continua 2025年第2期82卷 2331-2352页

作者： Yujiao Tang Yadong Wu Yuanmei He Jilin Liu Weihan Zhang School of Computer Science and Engineering Sichuan University of Science and EngineeringYibin644002China School of Mechanical and Power Engineering Chongqing University of Science and TechnologyChongqing401331China

Emotion recognition plays a crucial role in various fields and is a key task in natural language processing (NLP). The objective is to identify and interpret emotional expressions in text. However, traditional emotion recognition approaches often struggle in few-shot cross-domain scenarios due to their limited capacity to generalize semantic features across different domains. Additionally, these methods face challenges in accurately capturing complex emotional states, particularly those that are subtle or implicit. To overcome these limitations, we introduce a novel approach called Dual-Task Contrastive Meta-Learning (DTCML). This method combines meta-learning and contrastive learning to improve emotion recognition. Meta-learning enhances the model’s ability to generalize to new emotional tasks, while instance contrastive learning further refines the model by distinguishing unique features within each category, enabling it to better differentiate complex emotional expressions. Prototype contrastive learning, in turn, helps the model address the semantic complexity of emotions across different domains, enabling the model to learn fine-grained emotions expression. By leveraging dual tasks, DTCML learns from two domains simultaneously, the model is encouraged to learn more diverse and generalizable emotions features, thereby improving its cross-domain adaptability and robustness, and enhancing its generalization ability. We evaluated the performance of DTCML across four cross-domain settings, and the results show that our method outperforms the best baseline by 5.88%, 12.04%, 8.49%, and 8.40% in terms of accuracy.

关键词： Contrastive learning emotion recognition cross-domain learning dual-task meta-learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：