检索结果-内蒙古大学图书馆

Hierarchical contrastive representation for zero shot learning

APPLIED INTELLIGENCE 2024年第19期54卷 9213-9229页

作者： Lu, Ziqian Lu, Zheming He, Zewei Sun, Xuecheng Luo, Hao Zheng, Yangming Zhejiang Univ Sch Aeronaut & Astronaut Zheda Rd Hangzhou 310027 Peoples R China

Zero-shot learning aims to identify unseen (novel) objects, using only labeled samples from seen (base) classes. Existing methods usually learn visual-semantic interactions or generate absent visual features of unseen classes to compensate for the data imbalance problem. However, existing methods ignore the representation quality of visual-semantic pairs, resulting in unsatisfactory alignment and prediction bias. To tackle these issues, we propose a Hierarchical Contrastive Representation learning paradigm, termed HCR, which fully exploits model representation capability and discriminative information. Specifically, we first propose a contrastive embedding, which preserves not only high quality representations but also discriminative enough information from class-level and instance-level supervision. Then, we introduce a regressor by valuable prior knowledge for conducting more desirable visual-semantic alignment for unseen classes. A pluggable calibrator is also aggregated to further alleviate prediction bias in contrastive embedding. Extensive experiments show that the proposed HCR can significantly outperform the state-of-the-arts on popular benchmarks under ZSL and challenging GZSL settings.

关键词： Generalized zero-shot learning visual-semantic alignment Contrastive embedding Calibration model

来源：评论

学校读者我要写书评

暂无评论

semantic-Aligned Attention With Refining Feature Embedding for Few-Shot Image Classification

引用

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 2022年第12期23卷 25458-25468页

作者： Xu, Xianda Xu, Xing Shen, Fumin Li, Yujie Univ Elect Sci & Technol China Ctr Future Multimedia Chengdu 611731 Peoples R China Univ Elect Sci & Technol China Sch Comp Sci & Engn Chengdu 611731 Peoples R China Carnegie Mellon Univ Sch Comp Sci Pittsburgh PA 15213 USA Yangzhou Univ Sch Informat Engn Yangzhou 225002 Jiangsu Peoples R China

Autonomous driving relies on trusty visual recognition of surrounding objects. Few-shot image classification is used in autonomous driving to help recognize objects that are rarely seen. Successful embedding and metric-learning approaches to this task normally learn a feature comparison framework between an unseen image and the labeled images. However, these approaches usually have problems with ambiguous feature embedding because they tend to ignore important local visual and semantic information when extracting intra-class common features from the images. In this paper, we introduce a semantic-Aligned Attention (SAA) mechanism to refine feature embedding and it can be applied to most of the existing embedding and metric-learning approaches. The mechanism highlights pivotal local visual information with attention mechanism and aligns the attentive map with semantic information to refine the extracted features. Incorporating the proposed mechanism into the prototypical network, evaluation results reveal competitive improvements in both few-shot and zero-shot classification tasks on various benchmark datasets.

关键词： semantics Task analysis visualization Training Feature extraction Autonomous vehicles Real-time systems Autonomous driving few-shot image classification zero-shot image classification attention mechanism visual-semantic alignment

来源：评论

学校读者我要写书评

暂无评论

Incorporating attribute-level aligned comparative network for generalized zero-shot learning

引用

NEUROCOMPUTING 2024年 573卷

作者： Chen, Yuan Zhou, Yuan Tianjin Univ Sch Elect & Informat Engn Tianjin Peoples R China

The key challenge of zero -shot learning (ZSL) is to sufficiently disentangle each latent attribute from the class -level semantic annotations of images, thereby achieving a desirable semantic transfer to unseen classes with the disentangled attributes. However, most existing studies tackle the ZSL task with a strict classlevel alignment strategy that may yield insufficient disentanglement: (1) this strategy simply aligns holistic visual feature with its associated class -level semantic vector for each image;(2) the class -level semantic vectors have limited diversity and complex compositions of attributes. To address these issues, we propose an incorporating attribute -level aligned comparative network, i.e., IAAC-net, that develops the alignment strategy of ZSL to the attribute level. IAAC-net aims to establish diversified attribute -level and refined class -level alignments to facilitate attribute disentanglement and simultaneously improve zero -shot generalization. By further proposing a confusion -aware loss, the model is forced to rectify the disentanglement of indistinguishable attributes, which leads to a more accurate attribute disentanglement. The proposed IAAC-net yields significant improvements over the strong baselines, leading to new state-of-the-art performances on three popular challenging benchmarks, i.e., CUB, SUN, and AWA2.

关键词： Zero-shot learning visual-semantic alignment Representation learning Transfer learning Image classification

来源：评论

学校读者我要写书评

暂无评论

Learning to Classify Fine-Grained Categories with Privileged visual-semantic Misalignment

引用

IEEE TRANSACTIONS ON BIG DATA 2017年第1期3卷 37-43页

作者： Chen, Ke Zhang, Zhaoxiang Tampere Univ Technol Dept Signal Proc Tampere 33720 Finland Chinese Acad Sci Inst Automat CAS Ctr Excellence Brain Sci & Intelligence Techn Beijing 100190 Peoples R China

Image categorisation is an active yet challenging research topic in computer vision, which is to classify the images according to their semantic content. Recently, fine-grained object categorisation has attracted wide attention and remains difficult due to feature inconsistency caused by smaller inter-class and larger intra-class variation as well as large varying poses. Most of the existing frameworks focused on exploiting a more discriminative imagery representation or developing a more robust classification framework to mitigate the suffering. The concern has recently been paid to discovering the dependency across fine-grained class labels based on Convolutional Neural Networks. Encouraged by the success of semantic label embedding to discover the fine-grained class labels' correlation, this paper exploits the misalignment between visual feature space and semantic label embedding space and incorporates it as a privileged information into a cost-sensitive learning framework. Owing to capturing both the variation of imagery feature representation and also the label correlation in the semantic label embedding space, such a visual-semantic misalignment can be employed to reflect the importance of instances, which is more informative that conventional cost-sensitivities. Experiment results demonstrate the effectiveness of the proposed framework on public fine-grained benchmarks with achieving superior performance to state-of-the-arts.

关键词： Fine-grained categorisation cost-sensitive learning deep feature visual-semantic alignment multiclass classification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：