MediVision revolutionizes medical imaging by integrating Augmented Reality (AR) and Virtual Reality (VR) with traditional 2D data. This platform transforms standard DICOM images into immersive 3D models, enhancing dia...
详细信息
This article proposes a rapid monitoring and visualization method for surface changes based on multi-temporal DEM and WebGis. Firstly, high-resolution surface images are obtained using drones and processed into a digi...
详细信息
visualization in scientific computing has greatly improved the speed and quality of scientific computation, achieved further modernization of scientific computation tools and environments, and brought fundamental chan...
详细信息
ISBN:
(纸本)9798400708305
visualization in scientific computing has greatly improved the speed and quality of scientific computation, achieved further modernization of scientific computation tools and environments, and brought fundamental changes to scientific research work. We have developed a general post-processing tool for computational fluid dynamics (CFD) based on VTK and QT, which is used to visually represent the output files generated by CFD solvers. The tool utilizes the VTK graphics rendering library to develop a general CFD file reading and visualization interface for data visualization, and implements general CFD post-processing algorithms such as extracting contours, extracting iso-surfaces, generating streamline, etc. In addition, some auxiliary display functions have been added, such as displaying bounding boxes, adjusting color mapping tables, rotating perspective, etc. Besides, a simple but effective graphical user interface has been developed using QT.
The proceedings contain 23 papers. The topics discussed include: simulating gait and structural effects of aging for improved diversity in virtual crowds;a combined linear and circular-arc approximation of curves for ...
ISBN:
(纸本)9780769550510
The proceedings contain 23 papers. The topics discussed include: simulating gait and structural effects of aging for improved diversity in virtual crowds;a combined linear and circular-arc approximation of curves for feedrate smoothing of CNC machining;real-time rendering of rough refraction under dynamically varying environmental lighting;conversion of rational Bezier curves into non-rational Bezier curves using progressive iterative approximation;an approach to Thai decorative pattern recognition using Bezier curve representation with progressive iterative approximation;interactive segmentation based on initial segmentation and region merging;wavelets-based smoothness metric for volume data;efficient handwritten curve approximation by a Bezier curve using Chebyshev polynomials;iterated graph cut integrating texture characterization for interactive image segmentation;and multi-touch multi-user interactive control system using mobile devices.
We present a new data-driven approach for extracting geometric and structural information from a single spherical panorama of an interior scene, and for using this information to render the scene from novel points of ...
详细信息
We present a new data-driven approach for extracting geometric and structural information from a single spherical panorama of an interior scene, and for using this information to render the scene from novel points of view, enhancing 3D immersion in VR applications. The approach copes with the inherent ambiguities of single-image geometry estimation and novel view synthesis by focusing on the very common case of Atlanta-world interiors, bounded by horizontal floors and ceilings and vertical walls. Based on this prior, we introduce a novel end-to-end deep learning approach to jointly estimate the depth and the underlying room structure of the scene. The prior guides the design of the network and of novel domain-specific loss functions, shifting the major computational load on a training phase that exploits available large-scale synthetic panoramic imagery. An extremely lightweight network uses geometric and structural information to infer novel panoramic views from translated positions at interactive rates, from which perspective views matching head rotations are produced and upsampled to the display size. As a result, our method automatically produces new poses around the original camera at interactive rates, within a working area suitable for producing depth cues for VR applications, especially when using head-mounted displays connected to graphics servers. The extracted floor plan and 3D wall structure can also be used to support room exploration. The experimental results demonstrate that our method provides low-latency performance and improves over current state-of-the-art solutions in prediction accuracy on available commonly used indoor panoramic benchmarks.
"Unknown unknowns" are instances predicted models assign incorrect labels with high confidence, greatly reducing the generalization ability of models. In practical applications, unknown unknowns may lead to ...
详细信息
ISBN:
(纸本)9783031500749;9783031500756
"Unknown unknowns" are instances predicted models assign incorrect labels with high confidence, greatly reducing the generalization ability of models. In practical applications, unknown unknowns may lead to significant decision-making mistakes and reduce the application value of models. As unknown unknowns are agnostic to models, it is extremely difficult to figure out why models would make highly confident but incorrect predictions. In this paper, based on identification of unknown unknowns, we investigate the interpretability of unknown unknowns arising from convolutional neural network models in image classification tasks by interpretable methods. We employ visualization methods to interpret prediction results on unknown unknowns, further understand predictive models and analyze the predictive basis of unknown unknowns. We focus the application scenario of interpretability of unknown unknowns on a clothes category recognition task (dress vs shorts) in e-commerce platforms, and observe some patterns of models making wrong classifications that lead to unknown unknowns, which indicates that a CNN model that lacks of common sense can make mistakes even for a large dataset. Besides, we observe some interesting phenomena: certain correct predictions of instances are unreliable due to wrongly identified features by CNNs.
3D scene visualization has a wide range of applications. With the rapid development of various technologies in the corresponding fields, large-scale and complex type of data become a challenge for 3D scene visualizati...
详细信息
visualization is an important tool for understanding complex phenomena and large-scale data, and it has been widely used in the field of natural science. Since the concept of visualization was put forward in 1986, the...
详细信息
Today, school students are first introduced to geom-etry and advanced mathematical concepts by plotting a graph in 2D and they struggle with 3D concepts since current technologies are either too difficult to grasp or ...
详细信息
One of the grand challenges in computer vision is to recover 3D poses and shapes of multiple human bodies with absolute scales from a single RGB image. The challenge stems from the inherent depth and scale ambiguity f...
详细信息
ISBN:
(纸本)9783031667428;9783031667435
One of the grand challenges in computer vision is to recover 3D poses and shapes of multiple human bodies with absolute scales from a single RGB image. The challenge stems from the inherent depth and scale ambiguity from a single view. The state of the art on 3D human pose and shape estimation mainly focuses on estimating the 3D joint locations relative to the root joint, defined as the pelvis joint. In this paper, a novel approach called Absolute-ROMP is proposed, which builds upon a one-stage multi-person 3D mesh predictor network, ROMP, to estimate multi-person 3D poses and shapes, but with absolute scales from a single RGB image. To achieve this, we introduce absolute root joint localization in the camera coordinate frame, which enables the estimation of 3D mesh coordinates of all persons in the image and their root joint locations normalized by the focal point. Moreover, a CNN and transformer hybrid network, called TransFocal, is proposed to predict the focal length of the image's camera. This enables Absolute-ROMP to obtain absolute depth information of all joints in the camera coordinate frame, further improving the accuracy of our proposed method. The Absolute-ROMP is evaluated on the root joint localization and root-relative 3D pose estimation tasks on publicly available multi-person 3D pose datasets, and TransFocal is evaluated on a dataset created from the Pano360 dataset. Our proposed approach achieves state-of-the-art results on these tasks, outperforming existing methods or has competitive performance. Due to its real-time performance, our method is applicable to in-the-wild images and videos.
暂无评论