检索结果-内蒙古大学图书馆

Toward Real-World multi-view object classification: Dataset, Benchmark, and Analysis

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2024年第7期34卷 5653-5664页

作者： Wang, Ren Kim, Tae Sung Kim, Jin-Sung Lee, Hyuk-Jae Seoul Natl Univ Dept Elect & Comp Engn Seoul 08826 South Korea Sun Moon Univ Dept Elect Engn Asan 31460 South Korea

Aggregating information from multiple views is essential to accurately identifying similar objects. Nevertheless, existing datasets have limitations that hinder the development of practical multi-view object classification methods for real-world scenarios. The limitations include synthetic and coarse-grained objects in the datasets and the absence of a validation split to enable standard hyperparameter tuning. This paper proposes a new dataset, MVP-N (multi-view, Retail Products, Label Noise), which contains 16k real captured views and 9k multi-view sets collected from 44 retail products. In MVP-N, each view is annotated with a human-perceived information quantity (HPIQ) for analyzing how views are utilized in information aggregation. Moreover, the fine-grained categorization of objects provides the inter-class view similarity and intra-class view variance, enabling the research on learning from noisy labels of the multi-view images. Finally, a new soft label scheme, HS-HPIQ, is proposed considering the hidden stratification phenomenon in the multi-view images and achieves superior performance. To assess the effectiveness of MVP-N and the proposed HS-HPIQ, this study overviews 50 recent multi-view-based methods regarding their practicality in real-world scenarios. Six feature aggregation methods and twelve soft label methods are benchmarked on MVP-N with a deep analysis. The dataset and code are publicly available at https://***/SMNUResearch/MVP-N.

关键词： Convolution Benchmark testing Feature extraction Circuits and systems Annotations Transformers Neural networks multi-view object classification learning from noisy labels hidden stratification dataset benchmark

来源：评论

学校读者我要写书评

暂无评论

Few-shot multi-view object classification via dual augmentation network

引用

INFORMATION FUSION 2023年第1期100卷

作者： Zhou, Yaqian Lu, Haochun Hao, Tong Li, Xuanya Liu, An-An Tianjin Normal Univ Coll Life Sci Tianjin Key Lab Anim & Plant Resistance Tianjin 300387 Peoples R China Tianjin Univ Sch Microelect Tianjin 300072 Peoples R China Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China Baidu Inc Beijing 100105 Peoples R China

Existing multi-view object classification algorithms usually rely on sufficient labeled multi-view objects, which substantially restricts their scalability to novel classes with few annotated training samples in real-world applications. Aiming to go beyond these limitations, we explore a novel yet challenging task, few-shot multi -view object classification (FS-MVOC), which expects the network to build its classification ability efficiently based on limited labeled multi-view objects. To this end, we design a dual augmentation network (DANet) to provide excellent performance for the under-explored FS-MVOC task. On the one hand, we employ an attention -guided multi-view representation augmentation (AMRA) strategy to help the model focus on salient features and suppress unnecessary ones on multiple views of multi-view objects, resulting in more discriminative multi -view representations. On the other hand, during the meta-training stage, we adopt the category prototype augmentation (CPA) strategy to improve the class-representativeness of each prototype and increase the inter -prototype difference by injecting Gaussian noise in the deep feature space. Extensive experiments on the benchmark datasets (Meta-ModelNet and Meta-ShapeNet) indicate the effectiveness and robustness of DANet.

关键词： multi-view object classification Dual augmentation network Few-shot learning

来源：评论

学校读者我要写书评

暂无评论

Selective multi-view Deep Model for 3D object classification

Selective Multi-View Deep Model for 3D Object Classification

引用

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

作者： Alzahrani, Mona Usman, Muhammad Anwar, Saeed Helmy, Tarek KFUPM Dept Informat & Comp Sci Dhahran Saudi Arabia Jouf Univ Coll Comp & Informat Sci Sakaka Saudi Arabia KFUPM SDAIA KFUPM Joint Res Ctr Artificial Intelligence Dhahran Saudi Arabia KFUPM Ctr Intelligent Secure Syst Dhahran Saudi Arabia

ISBN: (纸本)9798350365474

3D object classification has emerged as a practical technology with applications in various domains, such as medical image analysis, automated driving, intelligent robots, and crowd surveillance. Among the different approaches, multi-view representations for 3D object classification have shown the most promising results, achieving state-of-theart performance. However, there are certain limitations in current view-based 3D object classification methods. One observation is that using all captured views for classification can confuse the classifier and lead to misleading results for certain classes. Additionally, some views may contain more discriminative information for object classification than others. These observations motivate the development of smarter and more efficient selective multi-view classification models. In this work, we propose a Selective multiview Deep Model that extracts multi-view images from 3D data representations and selects the most influential view by assigning importance scores using the cosine similarity method based on visual features detected by a pre-trained CNN. The proposed method is evaluated on the ModelNet40 dataset for the task of 3D classification. The results demonstrate that the proposed model achieves an overall accuracy of 88.13% using only a single view when employing a shading technique for rendering the views, pre-trained ResNet152 as the backbone CNN for feature extraction, and a Fully Connected Network (FCN) as the classifier.

关键词： 3D object classification 3D object Recognition multi-view Conventional Neural Network multi-view object classification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：