检索结果-内蒙古大学图书馆

A two-level framework for place recognition with 3D LiDAR based on spatial relation graph

PATTERN RECOGNITION 2021年 120卷 108171-108171页

作者： Gong, Yansong Sun, Fengchi Yuan, Jing Zhu, Wenbin Sun, Qinxuan Nankai Univ Coll Software 38 Tongyan Rd Tianjin 300350 Peoples R China Nankai Univ Coll Artificial Intelligence 38 Tongyan Rd Tianjin 300350 Peoples R China

In the field of robotics, due to the complexity of real environments, place recognition using the 3D LiDAR is always a challenging problem. The spatial relations of internal structures underlying the LiDAR data from different places are distinguishable, which can be used to describe the environment. In this paper, we utilize the spatial relations of internal structures and propose a two-level framework for 3D LiDAR place recognition based on the spatial relation graph (SRG). At first, the proposed framework segments the point cloud into multiple clusters, then the features of the clusters and the spatial relation descriptors (SRDs) between the clusters are extracted, and the point cloud is represented by the SRG, which uses the clusters as the nodes and their spatial relations as the edges. After that, we propose a two-level matching model in which two different models are fused for accurately and efficiently matching the SRGs, including the upper-level searching model (U-LSM) and lower-level matching model (L-LMM). In the U-LSM, an incremental bag-of-words model is used to search for candidate SRGs through the distribution of the SRDs in the SRG. In the L-LMM, we utilize the improved spectral method to calculate similarities between the current SRG and the candidates. The experimental results demonstrate that our framework achieves good precision, recall and viewpoint robustness on both public benchmarks and self-built campus dataset. (c) 2021 Elsevier Ltd. All rights reserved.

关键词： Place recognition 3D LiDAR spatial relation graph Two-level framework

来源：评论

学校读者我要写书评

暂无评论

Video Captioning With Object-Aware Spatio-Temporal Correlation and Aggregation

引用

IEEE TRANSACTIONS ON IMAGE PROCESSING 2020年 29卷 6209-6222页

作者： Zhang, Junchao Peng, Yuxin Peking Univ Wangxuan Inst Comp Technol Beijing 100871 Peoples R China

Video captioning is a significant challenging task in computer vision and natural language processing, aiming to automatically describe video content by natural language sentences. Comprehensive understanding of video is the key for accurate video captioning, which needs to not only capture the global content and salient objects in video, but also understand the spatio-temporal relations of objects, including their temporal trajectories and spatial relationships. Thus, it is important for video captioning to capture the objects' relationships both within and across frames. Therefore, in this paper, we propose an object-aware spatio-temporal graph (OSTG) approach for video captioning. It constructs spatio-temporal graphs to depict objects with their relations, where the temporal graphs represent objects' inter-frame dynamics, and the spatial graphs represent objects' intra-frame interactive relationships. The main novelties and advantages are: (1) Bidirectional temporal alignment: Bidirectional temporal graph is constructed along and reversely along the temporal order to perform bidirectional temporal alignment for objects across different frames, which provides complementary clues to capture the inter-frame temporal trajectories for each salient object. (2) graph based spatial relation learning: spatial relation graph is constructed among objects in each frame by considering their relative spatial locations and semantic correlations, which is exploited to learn relation features that encode intra-frame relationships for salient objects. (3) Object-aware feature aggregation: Trainable VLAD (vector of locally aggregated descriptors) models are deployed to perform object-aware feature aggregation on objects' local features, which learn discriminative aggregated representations for better video captioning. A hierarchical attention mechanism is also developed to distinguish contributions of different object instances. Experiments on two widely-used datasets, MSR-VTT and

关键词： Video captioning spatio-temporal graph bidirectional temporal graph spatial relation graph object-aware feature aggregation

来源：评论

学校读者我要写书评

暂无评论

An online composite graphics recognition approach based on matching of spatial relation graphs

引用

International Journal on Document Analysis and Recognition 2004年第1期7卷 44-55页

作者： Xu, Xiaogang Sun, Zhengxing Peng, Binbin Jin, Xiangyu Liu, Wenyin State Key Lab for Novel Software Technology Nanjing University Nanjing 210093 China Department of Computer Science and Technology Nanjing University Nanjing 210093 China Department of Computer Science City University of Hong Kong Hong Kong Hong Kong

A spatial relation graph (SRG) and its partial matching method are proposed for online composite graphics representation and recognition. The SRG-based approach emphasizes three characteristics of online graphics recognition: partial, structural, and independent of stroke order and stroke number. A constrained partial permutation strategy is also proposed to reduce the computational cost of matching two SRGs, which is originally an NP-complete problem as is graph isomorphism. Experimental results show that our proposed SRG-based approach is both efficient and effective for online composite graphics recognition in our sketch-based graphics input system - SmartSketchpad. © Springer-Verlag Berlin/Heidelberg 2004.

关键词： graph matching Online graphics recognition Sketch-based user interface spatial relation graph

来源：评论

学校读者我要写书评

暂无评论

Matching spatial relation graphs using a constrained partial permutation strategy

引用

Journal of Southeast University(English Edition) 2003年第3期19卷 236-239页

作者：徐晓刚孙正兴刘文印南京大学计算机软件新技术国家重点实验室南京210093 香港城市大学计算机科学系

A constrained partial permutation strategy is proposed for matching spatial relation graph (SRG), which is used in our sketch input and recognition system Smart Sketchpad for representing the spatial relationship among the components of a graphic object. Using two kinds of matching constraints dynamically generated in the matching process, the proposed approach can prune most improper mappings between SRGs during the matching process. According to our theoretical analysis in this paper, the time complexity of our approach is O(n 2) in the best case, and O(n!) in the worst case, which occurs infrequently. The spatial complexity is always O(n) for all cases. Implemented in Smart Sketchpad, our proposed strategy is of good performance.

关键词： spatial relation graph graph matching constrained partial permutation graphics recognition

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：