检索结果-内蒙古大学图书馆

Thiruvananthapuram Aerial LidAR dataset (TALd): A Benchmark for Complex Urban Point Cloud

IEEE ACCESS 2025年 13卷 42350-42363页

作者： Vijaywargiya, Jayati Ramiya, Anandakumar M. Indian Inst Space Sci & Technol Dept Earth & Space Sci Thiruvananthapuram 695547 India

Urban heterogeneity is influenced by population density, cultural factors, and historical background. Three-dimensional mapping using LidAR is essential for capturing structural and spatial changes in complex urban areas. Machine learning-based algorithms for processing point clouds play vital role in transforming unprocessed LidAR data into relevant information suitable for urban applications such as object recognition, and 3d mapping. This underscores the importance of having range of benchmark LidAR datasets to enhance development, testing, and validation of algorithms tailored to various urban contexts. However, existing Airborne LidAR Scanned (ALS) datasets represent a limited range of global land cover diversity. To address this gap, we introduce Thiruvananthapuram Aerial LidAR dataset (TALd), a benchmark dataset covering 9 square kilometers from Thiruvananthapuram, Kerala, India. This South Indian region exhibits high-density mixed urban development integrating both built and vegetative elements. TALd, derived from ALS point clouds, has an average point density of 12 points/m(2) and includes colored LidAR points classified into buildings, trees, shrubs, and ground. The dataset is created through systematic pre-processing, classification using automated algorithms and manual corrections, and instance segmentation for noise removal. It includes X, Y, Z coordinates (UTM 43N), RGB values, return number, number of returns, scan angle rank, and class designation. The Land-cover diversity Index (LdI) is 1.49 for TALd, significantly higher than dALES (0.23) and ISPRS Vaihingen 3d (0.37), highlighting its focus on tropical urban environments with dense vegetation and complex infrastructure. TALd serves as a valuable benchmark for advancing point cloud processing, supporting urban mapping in challenging landscapes.

关键词： Point cloud compression Laser radar Benchmark testing Three-dimensional displays Urban areas Vegetation mapping Buildings Accuracy Complexity theory Land surface Point clouds airborne LidAR scan (ALS) urban mapping semantic segmentation 3d datasets landcover diversity index (LdI)

来源：评论

学校读者我要写书评

暂无评论

Symbolism and directivity of Joint Keypoints in Temporal and Spatial dimensions in Human Pose Prediction With GCN-Based Model

引用

IEEE ACCESS 2023年 11卷 146090-146102页

作者： Li, Jinhui Huang, Jianying Kang, Hoon Chung Ang Univ Sch Elect & Elect Engn Seoul 06974 South Korea

A wide variety of methods have been developed to predict the posture of the human body at a given point in time based on data on previous movements. More recently, prediction models based on deep learning have become a topic of active research and development. In this study, we adopt the strategy of separating spatial and temporal information based on an existing STGCN model to extract features effectively in both space and time, and we analyzed the effects of signed or unsigned and directed or undirected forecasts of the positions of human joints with this approach. We propose a method using an encoder based on a modified graph adjacency matrix in a graph convolutional network model and focus especially on the terms of the signs and directions of data on the locations of the joints in space and time. We also introduce a global residual block. The results of an experimental evaluation of our proposed method showed that we obtained better performance by applying the signed and directed features independently to the spatial and temporal adjacency matrices. The proposed model exhibited noticeable improvements in several aspects. In future research, we expect these features of the modified adjacency matrix to help learning models understand the correlation between symbols and directions for various actions and poses.

关键词： Graph convolutional networks (GCN) spatial temporal graph convolutional networks (STGCN) 3d datasets human pose prediction directional & symbolic method human joints key points

来源：评论

学校读者我要写书评

暂无评论

Learning 3d Semantic Segmentation with only 2d Image Supervision 9

Learning 3D Semantic Segmentation with only 2D Image Supervi...

引用

9th International Conference on 3d Vision (3dV)

作者： Genova, Kyle Yin, Xiaoqi Kundu, Abhijit Pantofaru, Caroline Cole, Forrester Sud, Avneesh Brewington, Brian Shucker, Brian Funkhouser, Thomas Google Res Mountain View CA 94043 USA Princeton Univ Princeton NJ 08544 USA

ISBN: (纸本)9781665426886

With the recent growth of urban mapping and autonomous driving efforts, there has been an explosion of raw 3d data collected from terrestrial platforms with lidar scanners and color cameras. However, due to high labeling costs, ground-truth 3d semantic segmentation annotations are limited in both quantity and geographic diversity, while also being difficult to transfer across sensors. In contrast, large image collections with ground-truth semantic segmentations are readily available for diverse sets of scenes. In this paper, we investigate how to use only those labeled 2d image collections to supervise training 3d semantic segmentation models. Our approach is to train a 3d model from pseudo-labels derived from 2d semantic image segmentations using multiview fusion. We address several novel issues with this approach, including how to select trusted pseudo-labels, how to sample 3d scenes with rare object categories, and how to decouple input features from 2d images from pseudo-labels during training. The proposed network architecture, 2d3dNet, achieves significantly better performance (+6.2-11.4 mIoU) than baselines during experiments on a new urban dataset with lidar and images captured in 20 cities across 5 continents.

关键词： 2d3dNet 3d datasets 3d Semantic Segmentation Cross modal Supervision Multi view Fusion Sparse Voxel Convolution

来源：评论

学校读者我要写书评

暂无评论

Selective coding with controlled quality decay for 2d and 3d images in a JPEG2000 framework

Selective coding with controlled quality decay for 2D and 3D...

引用

Conference on Visual Communications and Image Processing 2003

作者： Signoroni, A Lazzaroni, F Leonardi, R Univ Brescia DEA Signals & Commun Lab I-25123 Brescia Italy

ISBN: (纸本)0819450235

This paper presents some ideas which extend the functionality and the application fields of a spatially selective coding within a JPEG2000 framework. At first, the image quality drop between the Regions of Interest (ROI) and the background (BG) is considered. In a conventional approach, the reconstructed image quality steeply drops along the ROI boundary;however, this effect could be considered or perceived objectionable in some use cases. A simple quality decay management is proposed here, which makes use of concentric ROI with different scaling factors. This allows the technique to be perfectly consistent with the JPEG2000 part 2 ROI definition and description. Another considered issue is the extension of the selective ROI coding to a 3d Volume of Interest coding. This extension is currently under consideration for the part 10 of JPEG2000, JP3d. An easy and effective 2d to 3d extension for the VOI definition and description is proposed here: a VOI is defined by a set composition of ROI generated solids, where ROI are defined along one or more volume cutting direction, and is described by the relative set of ROI parameters. Moreover, the quality decay management can be applied to this extension. The proposed techniques could have a significant impact on the selective coding of medical images and volumes. Image quality issues are very important but very critical factors in that field, which also constitutes the dominant market for 3d applications. Therefore;some experiments are presented on medical images and volumes in order evaluate the benefits of the proposed approaches in terms of diagnostic quality improvement with respect to a conventional ROI coding usage.

关键词： selective ROI coding image quality wavelet coding 3d datasets JPEG2000 medical imaging

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：