This paper evaluates different state-of-the-art approaches for implementing an X-ray view in Augmented Reality (AR). Our focus is on approaches supporting a better scene understanding and in particular a better sense ...
详细信息
ISBN:
(纸本)9781450306539
This paper evaluates different state-of-the-art approaches for implementing an X-ray view in Augmented Reality (AR). Our focus is on approaches supporting a better scene understanding and in particular a better sense of depth order between physical objects and digital objects. One of the main goals of this work is to provide effective X-ray visualization techniques that work in unprepared outdoor environments. In order to achieve this goal, we focus on methods that automatically extract depth cues from video images. The extracted depth cues are combined in ghosting maps that are used to assign each video image pixel a transparency value to control the overlay in the AR view. Within our study, we analyze three different types of ghosting maps, 1) alpha-blending which uses a uniform alpha value within the ghosting map, 2) edge-based ghosting which is based on edge extraction and 3) image-based ghosting which incorporates perceptual grouping, saliency information, edges and texture details. Our study results demonstrate that the latter technique helps the user to understand the subsurface location of virtual objects better than using alpha-blending or the edge-based ghosting. Copyright 2014 ACM.
Designing an elegant 3D virtual garment model for a 3D virtual human model is labor-intensive, because most existing garment models are custom-made for a specific human model and cannot be easily reused for other indi...
详细信息
The current state of the art in visualization research places a strong emphasis on different techniques to derive insight from disparate types of data. However, little work has investigated the visualization process i...
详细信息
The current state of the art in visualization research places a strong emphasis on different techniques to derive insight from disparate types of data. However, little work has investigated the visualization process itself. The information content of the visualization process-the results, history, and relationships between those results-is addressed by this work. A characterization of the visualization process is discussed, leading to a general model of the visualization exploration process. The model, based upon a new parameter derivation calculus, can be used for automated reporting, analysis, or visualized directly. An XML-based language for expressing visualization sessions using the model is also described. These sessions can then be shared and reused by collaborators. The model, along with the XML representation, provides an effective means to utilize the information within the visualization process to further data exploration.
We discuss techniques for the visualization of medical volume data dedicated for their clinical use. We describe the need for rapid dynamic interaction facilities with such visualizations and discuss emphasis techniqu...
详细信息
An appropriate choice of the distance metric is a fundamental problem in pattern recognition, machine learning and cluster analysis. Some methods that based on the distance of samples, e.g, the k-means clustering algo...
An appropriate choice of the distance metric is a fundamental problem in pattern recognition, machine learning and cluster analysis. Some methods that based on the distance of samples, e.g, the k-means clustering algorithm and the k-nearest neighbor classifier, are crucially relied on the performance of the distance metric. In this paper, the property of translation invariance for the distance metric of images is especially emphasized. The consideration is twofold. Firstly, some of the commonly used distance metrics, such as the Euclidean and Minkowski distance, are independent of the training set and/or the domain-specific knowledge. Secondly, the translation invariance is a necessary property for any intuitively reasonable image metric. The image Euclidean distance (IMED) and generalized Euclidean distance (GED) are image metrics that take the spatial relationship between pixels into consideration. Sun et al.(IEEE Conference on computer Vision and Pattern Recognition, pp 1398–1405, 2009) showed that IMED is equivalent to a translation-invariant transform and proposed a metric learning algorithm based on the equivalency. In this paper, we provide a complete treatment on this topic and extend the equivalency to the discrete frequency domain. Based on the connection, we show that GED and IMED can be implemented as low-pass filters, which reduce the space and time complexities significantly. The transform domain metric learning proposed in (Sun et al. 2009) is also resembled as a translation-invariant counterpart of LDA. Experimental results demonstrate improvements in algorithm efficiency and performance boosts on the small sample size problems.
Ear recognition is a promising biometric measure, especially with the growing interest in multi-modal biometrics. Histogram of Oriented Gradients (HOG) have been effectively and efficiently used solving the problems o...
详细信息
Ear recognition is a promising biometric measure, especially with the growing interest in multi-modal biometrics. Histogram of Oriented Gradients (HOG) have been effectively and efficiently used solving the problems of object detection and recognition, especially when illumination variations are present. This work presents a robust approach for ear recognition using multi-scale dense HOG features as a descriptor of 2D ear images. The multi-scale features assure to capture the different and complicated structures of ear images. Dimensionality reduction was performed to avoid feature redundancy and provide a more efficient recognition process while being prone to over-fitting. Finally, a test was performed on a large and realistic database and the results were compared to the state of the art ear recognition approaches tested on the same dataset and under the same test procedure.
The two volume set LNCS 8887 and 8888 constitutes the refereed proceedings of the 10th International Symposium on Visual Computing, ISVC 2014, held in Las Vegas, NV, USA. The 74 revised full papers and 55 poster paper...
详细信息
ISBN:
(数字)9783319142494
ISBN:
(纸本)9783319142487
The two volume set LNCS 8887 and 8888 constitutes the refereed proceedings of the 10th International Symposium on Visual Computing, ISVC 2014, held in Las Vegas, NV, USA. The 74 revised full papers and 55 poster papers presented together with 39 special track papers were carefully reviewed and selected from more than 280 submissions. The papers are organized in topical sections: Part I (LNCS 8887) comprises computational bioimaging, computergraphics; motion, tracking, feature extraction and matching, segmentation, visualization, mapping, modeling and surface reconstruction, unmanned autonomous systems, medical imaging, tracking for human activity monitoring, intelligent transportation systems, visual perception and robotic systems. Part II (LNCS 8888) comprises topics such as computational bioimaging , recognition, computer vision, applications, face processing and recognition, virtual reality, and the poster sessions.
The two volume set LNCS 8887 and 8888 constitutes the refereed proceedings of the 10th International Symposium on Visual Computing, ISVC 2014, held in Las Vegas, NV, USA. The 74 revised full papers and 55 poster paper...
详细信息
ISBN:
(数字)9783319143644
ISBN:
(纸本)9783319143637
The two volume set LNCS 8887 and 8888 constitutes the refereed proceedings of the 10th International Symposium on Visual Computing, ISVC 2014, held in Las Vegas, NV, USA. The 74 revised full papers and 55 poster papers presented together with 39 special track papers were carefully reviewed and selected from more than 280 submissions. The papers are organized in topical sections: Part I (LNCS 8887) comprises computational bioimaging, computergraphics; motion, tracking, feature extraction and matching, segmentation, visualization, mapping, modeling and surface reconstruction, unmanned autonomous systems, medical imaging, tracking for human activity monitoring, intelligent transportation systems, visual perception and robotic systems. Part II (LNCS 8888) comprises topics such as computational bioimaging , recognition, computer vision, applications, face processing and recognition, virtual reality, and the poster sessions.
Automatic character recognition and image understanding of a given paper document are the main objectives of the computer vision field. For these problems, a basic step is to isolate characters and group words from th...
详细信息
暂无评论