检索结果-内蒙古大学图书馆

Bag of shape descriptor using unsupervised deep learning for non-rigid shape recognition

SIGNAL PROCESSING-IMAGE COMMUNICATION 2021年 96卷 116297-116297页

作者： Yang, Linjie Wang, Luping Su, Yijing Gao, Yin Sun Yat Sen Univ Sch Elect & Commun Engn Guangzhou 510006 Peoples R China Chinese Acad Sci Quanzhou Inst Equipment Mfg Quanzhou 362200 Peoples R China

Highly discriminative feature expression for non-rigid shape recognition is an important and challenging task, which requires both abstract and robust shape descriptors. However, the majority of existing low-level descriptors are designed via hand-crafted, which are sensitive to local changes and larger deformation. To address this issue, this paper proposes a bag of shape descriptor based on unsupervised deep learning and Bag of Words (BoW) for shape recognition. Different from existing pipelines, our method is specially designed to learn high-level and hierarchical shape features from multi-scale context structures. It effectively overcomes obstacles, such as irregular topology, orientation ambiguity, and rigid or non-rigid transformation in the hierarchical learning of contour fragments. Specifically, by adopting an improved decomposing strategy, the shape can be decomposed to a series of valuable contour fragments, results in local to global feature learning. An unsupervised learning framework is also applied to the contour fragment for its feature expression based on the context structure and SSAE (Stack Sparse Auto Encode). In the process of shape representation, a high-level shape dictionary is learned by K-clustering to achieve discriminative feature coding. In addition, to achieve a compact and simplified shape representation, SPM (Spatial Pyramid Matching) is adopted by max-pooling, which effectively incorporates spatial layout information of the given shape. The experiments demonstrate that the proposed method achieves state-of-the-art performance on several public shape datasets comparing with the latest approaches. Our method also obtains high performance under the noisy and occlusion condition.

关键词： shape recognition Bag of shape descriptor Unsupervised deep learning High-level feature dictionary shape coding

来源：评论

学校读者我要写书评

暂无评论

Low-complexity MPEG-4 shape encoding towards realtime object-based applications

引用

ETRI JOURNAL 2004年第2期26卷 122-135页

作者： Jang, ES Hanyang Univ Coll Informat & Commun Software Div Seoul 133791 South Korea

Although frame-based MPEG-4 video services have been successfully deployed since 2000, MPEG-4 video coding is now facing great competition in becoming a dominant player in the market. Object-based coding is one of the key functionalities of MPEG-4 video coding. Real-time object-based video encoding is also important for multimedia broadcasting for the near future. Object-based video services using MPEG-4 have not yet made a successful debut due to several reasons. One of the critical problems is the coding complexity of object-based video coding over frame-based video coding. Since a video object is described with an arbitrary shape, the bitstream contains not only motion and texture data but also shape data. This has introduced additional complexity to the decoder side as well as to the encoder side. In this paper, we have analyzed the current MPEG-4 video encoding tools and proposed efficient coding technologies that reduce the complexity of the encoder. Using the proposed coding schemes, we have obtained a 56 percent reduction in shape-coding complexity over the MPEG-4 video reference software (Microsoft version, 2000 edition).

关键词： MPEG4 encoding complexity real-time video shape coding

来源：评论

学校读者我要写书评

暂无评论

An integrated scheme of arbitrarily shaped segmentation and motion estimation

引用

Systems and Computers in Japan 1000年第13期31卷 19-30页

作者： Ryoichi Kawada Atsushi Koike Shuichi Matsumoto KDD R&D Laboratories Saitama Japan 356-8502 KDD R&D Laboratories Saitama Japan 356-8502 KDD R&D Laboratories Saitama Japan 356-8502

In a rectangular-block division-based coding method, there is a problem that, in the blocks where different regions coexist, the coding efficiency decreases. In the segmentation-based coding method, which is promising as a way to avoid this problem, the objective of motion estimation and segmentation is to minimize the total generated entropy, that is, the sum of prediction error entropy, shape entropy, and motion vector entropy. The conventional motion estimation and segmentation methods based on processing such as split and merge, from this viewpoint, are not sufficiently optimized. Thus, in this paper, with the objective of optimizing segmentation-based motion-compensated prediction, the authors propose a scheme where motion estimation and segmentation are performed at the same time via dynamic programming. In the proposed scheme, first, for each motion vector and for each pattern of possible segment shapes, its generated entropy is estimated beforehand. Next, dynamic programming is applied to determine the segment for each motion vector that minimizes the total generated entropy. Computer simulation experiments using ordinary test video sequences show that this scheme outperforms the block matching method and conventional segmentation and motion estimation methods. The authors hope that this study will help to establish a general way to apply a segmentation-based coding method, where it has been pointed out that the efficiency is highly dependent on the kinds of pictures involved. © 2000 Scripta Technica, Syst Comp Jpn, 31(13): 19–30, 2000

关键词： video coding motion‐compensated prediction segmentation shape coding dynamic programming

来源：评论

学校读者我要写书评

暂无评论

High-level aftereffects reveal the role of statistical features in visual shape encoding

引用

CURRENT BIOLOGY 2024年第5期34卷 1098-1106.e5页

作者： Morgenstern, Yaniv Storrs, Katherine R. Schmidt, Filipp Hartmann, Frieder Tiedemann, Henning Wagemans, Johan Fleming, Roland W. Erasmus Univ Dept Psychol Burgemeester Oudlaan 50 NL-3062PA Rotterdam Netherlands Univ Leuven KU Leuven Brain & Cognit Tiensestr 102 B-3000 Leuven Belgium Justus Liebig Univ Giessen Dept Psychol Otto-Behaghel-Str 10 Giessen Germany Univ Auckland Sch Psychol 23 Symonds St Auckland 1010 New Zealand Univ Marburg Hans-Meerwein-Str 6 D-35032 Marburg Germany Justus Liebig Univ Giessen Ctr Mind Brain & Behav CMBB Hans-Meerwein-Str 6 D-35032 Marburg Germany

Visual shape perception is central to many everyday tasks, from object recognition to grasping and handling tools.1-10 Yet how shape is encoded in the visual system remains poorly understood. Here, we probed shape representations using visual aftereffects-perceptual distortions that occur following extended exposure to a stimulus.11-17 Such effects are thought to be caused by adaptation in neural populations that encode both simple, low-level stimulus characteristics17-20 and more abstract, high-level object features.21-23 To tease these two contributions apart, we used machine -learning methods to synthesize novel shapes in a multidimensional shape space, derived from a large database of natural shapes.24 Stimuli were carefully selected such that low-level and high-level adaptation models made distinct predictions about the shapes that observers would perceive following adaptation. We found that adaptation along vector trajectories in the high-level shape space predicted shape aftereffects better than simple low-level processes. Our findings reveal the central role of high-level statistical features in the visual representation of shape. The findings also hint that human vision is attuned to the distribution of shapes experienced in the natural environment.

关键词： aftereffects shape coding visual perception computational modeling

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：