Univariate time series is utilized in numerous scientific applications for a variety of purposes. They serve as the foundation for different statistical measurements and show how data evolves over time when viewed ind...
详细信息
The externalization of the state of one's mind, which people refer to as "mind reading"in science fiction, is currently being realized through brain decoding research. This field of study aims to deepen ...
详细信息
We proposed a new way to represent and reconstruct multidimensional MR images. Specifically, a representation capable of disentangling different types of features in high-dimensional images was learned via training an...
详细信息
Despite achieving impressive improvement in accuracy, most existing monocular 3D human mesh reconstruction methods require large-scale 2D/3D ground-truths for supervision, which limits their applications on unlabeled ...
详细信息
ISBN:
(纸本)1577358872
Despite achieving impressive improvement in accuracy, most existing monocular 3D human mesh reconstruction methods require large-scale 2D/3D ground-truths for supervision, which limits their applications on unlabeled in-the-wild data that is ubiquitous. To alleviate the reliance on 2D/3D ground-truths, we present a self-supervised 3D human pose and shape reconstruction framework that relies only on self-consistency between intermediate representations of images and projected 2D predictions. Specifically, we extract 2D joints and depth maps from monocular images as proxy inputs, which provides complementary clues to infer accurate 3D human meshes. Furthermore, to reduce the impacts from noisy and ambiguous inputs while better concentrate on the high-quality information, we design an uncertainty-aware module to automatically learn the reliability of the inputs at body-joint level based on the consistency between 2D joints and depth map. Experiments on benchmark datasets show that our approach outperforms other state-of-the-art methods at similar supervision levels.
The existence of burnt rock seriously affects the safety of coal mine production. In this paper, the airborne hyperspectral remote sensing image (CASI/SASI) was used as a main data resource in Rujigou Coalfield, Ningx...
详细信息
Light fields can simultaneously capture the intensity and direction of each light ray, offering enhanced information for depth estimation. Currently, accurately extracting the epipolar plane lines and handling occlusi...
详细信息
Terahertz computed tomography (THz CT) demonstrates its advantages in aspects of nonmetallic and nonpolar materials penetration, 3D internal structure visualization, etc. To perform satisfied reconstruction results, i...
详细信息
Terahertz computed tomography (THz CT) demonstrates its advantages in aspects of nonmetallic and nonpolar materials penetration, 3D internal structure visualization, etc. To perform satisfied reconstruction results, it is necessary to obtain complete measurements from many different views. However, this process is time-consuming and we usually obtain incomplete projections for THz CT in practice, which generates artifacts in the final reconstructed images. To address this issue, dictionary learning-based THz CT reconstruction (DLTR) model is proposed in this study. Especially, the image patches are extracted from other state-of-the-art reconstructed images to train the initial dictionary by using the K-SVD algorithm. Then, the dictionary can be adaptively updated during THz CT reconstruction. Finally, the updated dictionary is used for further updating reconstructed images. In order to verify the accuracy and quality of DLTR method, the filtered back-projection (FBP), simultaneous algebraic reconstruction technique (SART), and total variation (TV) reconstruction are chosen as comparisons. The experiment results show that the DLTR method has a good capability for noise suppression and structures preservation.
To successfully adhere to flight plans, aerial vehicles must keep track of their location in 3D space, which is usually reliant on external references such as GNSS which are susceptible to interference. To develop sel...
详细信息
ISBN:
(纸本)9781665476331
To successfully adhere to flight plans, aerial vehicles must keep track of their location in 3D space, which is usually reliant on external references such as GNSS which are susceptible to interference. To develop self-reliant onboard positional localization, a workflow using 360-degree panoramic images in an image-based localization system using a Deep Convolutional Neural Network is proposed. 360-degree panoramic images have the advantage that they take into account visual information from all angles. Model performance is also enhanced by generating synthetic datafrom a 3D model of the region of interest created via photogrammetry techniques. The performances of different training configurations are compared, and the configuration with mixed real and synthetic data exhibits the highest performance, an approximately 10 to 15 percent improvement over using solely real data. Additional image augmentations also further reduce the localization error by 8 to 15 percent.
Anomaly detection is an essential component of machine learning that renders the outcomes neutral to any category or class. Due to the wide range of anomalies that might exist in time-series data, it plays a crucial r...
详细信息
In various studies trees have been extracted and their conditions have been examined through different detection algorithms from two main data sources including (a) point cloud and (b) raster data. The output of tree ...
详细信息
In various studies trees have been extracted and their conditions have been examined through different detection algorithms from two main data sources including (a) point cloud and (b) raster data. The output of tree extraction is the input of the next processing steps, and the importance of these outputs is proved more than before. Tree Extraction (TE) has many applications in biomass estimation, CHM extraction, etc. All of which require high accuracy and the correct position of the trees. therefore, in this study, a comparison between tree extraction algorithms in two common sources of data has been conducted. As for the raster data, all bands are first co-registered. Afterward, the trees are separated from the background by using image processing techniques such as changing the image color space and weighted averaging on different bands. Finally, TE algorithms such as watershed segmentation, valley following, local maxima, and image binarization were applied. As for the point cloud data, TE can be conducted in the object space to compensate for the methods used in the raster space with object detection algorithms e.g., the coherence between the two trees, etc. which have been discussed in detail in this paper. In the object space, three algorithms, region-based, surface normal, and Euclidean segmentation, were implemented and discussed on the same raster data set in the photogrammetric point cloud. The results show the higher accuracy of the region-based algorithm in object-space by more than 26% in comparison with the valley following algorithm in image space.
暂无评论