Most imageprocessing algorithms are inherently parallel, so multithreading processors are suitable in such applications. In huge image databases, imageprocessing takes very long time for run on a single core process...
详细信息
Most imageprocessing algorithms are inherently parallel, so multithreading processors are suitable in such applications. In huge image databases, imageprocessing takes very long time for run on a single core processor because of single thread execution of algorithms. GPU is more common in most imageprocessing applications due to multithread execution of algorithms, programmability and low cost. In this paper we show how to implement the MPRG-7 Edge Histogram Descriptor in parallel using CUDA programming model on a GPU. The Edge Histogram Descriptor describes the distribution of various types of edges with a histogram that can be a tool for image matching. This feature is applied to search images from a database which are similar to a query image. We evaluated the retrieval of the proposed technique using recall, precision, and average precision measures. Experimental results showed that parallel implementation led to an average speed up of 14.74×over the serial implementation. The average precision and the average recall of presented method are 67.02% and 55.00% respectively.
In this paper we present how Intel's Single-Chip-Cloud processor behaves for parallel macro pipeline applications. Subsets of the SCC's available cores can be arranged as a pipeline where each core processes o...
详细信息
Among various image retrieval approaches, the use of sketches lets one express a precise visual query with simple and widespread means. The challenge consists in finding a content representation that allows you to eff...
详细信息
Among various image retrieval approaches, the use of sketches lets one express a precise visual query with simple and widespread means. The challenge consists in finding a content representation that allows you to effectively compare sketches and images, while supporting efficient retrieval in order to make the system scalable. We put forward a sketch-based image retrieval solution where sketches and natural image contours are represented and compared in the wavelet domain. The relevant information regarding query sketches and image content has, thus, a compact representation that can be readily employed by an efficient index for retrieval by similarity. Furthermore, with this solution, the balance between effectiveness and efficiency can be easily modified in order to adapt to the available resources. A comparative evaluation with a state-of-the-art method on the Paris dataset and a subset with 535K images of the image Net dataset shows that our solution can preserve effectiveness while being more than one order of magnitude faster.
There are many scenarios in which user interaction is essential for effective image segmentation. In this paper, we present a new interactive segmentation method based on the image Foresting Transform (IFT). The metho...
详细信息
There are many scenarios in which user interaction is essential for effective image segmentation. In this paper, we present a new interactive segmentation method based on the image Foresting Transform (IFT). The method over segments the input image, creates a graph based on these segments (super pixels), receives markers (labels) drawn by the user on some super pixels and organizes a competition to label every pixel in the image. Our method has several interesting properties: it is effective, efficient, capable of segmenting multiple objects in almost linear time on the number of super pixels, readily extendable through previously published techniques, and benefits from domain-specific feature extraction. We also present a comparison with another technique based on the IFT, which can be seen as its pixel-based counterpart. Another contribution of this paper is the description of automatic (robot) users. Given a ground truth image, these robots simulate interactive segmentation by trained and untrained users, reducing the costs and biases involved in comparing segmentation techniques.
This work presents a method to extract polygonal surfaces from volumetric models created by artists, proposing a way of using voxel modeling tools to build B-Rep models. The volumetric data created by voxel editors us...
详细信息
This work presents a method to extract polygonal surfaces from volumetric models created by artists, proposing a way of using voxel modeling tools to build B-Rep models. The volumetric data created by voxel editors usually contain topological features that do not describe solid structures. Hence, the main objective of this work is to solve the problem of extracting triangle meshes from volumes that contain these topological features. In order to extract surfaces successfully, a methodology was conceived to resample any volumetric model, in a way that it is possible to reconstruct a three-dimensional manifold that can be polygonized without generating a surface with gaps or topological problems. The meshes generated by this technique have good properties, satisfying some of the main criteria used to measure the quality of meshes, such as aspect ratio, smoothness and skewness.
Recent developments in touch and display technologies have laid the groundwork to combine touch-sensitive display systems with stereoscopic three-dimensional (3D) display. Although this combination provides a compelli...
详细信息
The swallowing process affects several aspects of one's welfare, as nutrition, hydration, respiration and hearing. Magnetic resonance imaging (MRI) has been a valuable tool to study swallowing, since it is a non-i...
详细信息
The swallowing process affects several aspects of one's welfare, as nutrition, hydration, respiration and hearing. Magnetic resonance imaging (MRI) has been a valuable tool to study swallowing, since it is a non-invasive procedure that can dynamically capture the shapes of the tongue and other elements involved in the process. The resolution enhancement of the MRI frames support directly diseases diagnoses, helping the visual analysis, or can be a pre-processing tool to segmentation, classification, recognition or modelling. MRI frames with better resolution, with less blurring or noise can be obtained changing the acquisition process, or using more powerful devices, but the cost of this solution is higher than applying computational Super-Resolution (SR) techniques. This paper studies a Bayesian approach to provide a Wiener filter to regularize the conjugate gradient solution, and promote an adaptation of an iterative SR method for non-rigid registration that can be generalized to other iterative SR methods.
computer simulation of realistic crowd behavior has been the focus of active research for more than two decades now. In crowd simulation, there is usually a trade-off between performance and realistic crowd behavior. ...
详细信息
computer simulation of realistic crowd behavior has been the focus of active research for more than two decades now. In crowd simulation, there is usually a trade-off between performance and realistic crowd behavior. In this paper, we propose a model, based on potential fields, that enables the introduction of many behaviors in crowd simulations, while keeping good performance. The model uses multiple groups to guide agents to various different goals in the environment, and combines potential fields and reciprocal velocity obstacles (RVO) approaches, where the first sets the preferred velocities of the agents according to their current goals, whereas the second makes the agents avoid collisions. We used three scenarios to demonstrate the capabilities of our model for simulating crowds in which the agents present greater variety of behaviors in real-time without using a complex architecture.
Dynamic queries continuously update the data that is visualized in accordance with the user actions. They are typically applied for visual information seeking. This paper proposes to introduce this interaction style f...
详细信息
Dynamic queries continuously update the data that is visualized in accordance with the user actions. They are typically applied for visual information seeking. This paper proposes to introduce this interaction style for exploring 3D medical neuroimages in its original form, enhancing visual seeking technology in a medical diagnostic procedure. More precisely, we present three dynamic query tools that allow the user to change the focus on-the-fly, while the surrounding tissue is preserved. They are a curvilinear cropper, a volumetric probe and a movable magnifying lens. Once information-preserving visualization is essential for accurate diagnosis and legal protection, the dataset is in its original form. The originality of our work relies on the input interface through which an expert can directly manipulate those tools on the raw data and the responsiveness of each displayed voxel by exploiting the power of GPUs. The proposed techniques have been integrated in a visualization prototype and were assessed by the neuroimaging experts, who were be able to identify subtle lesions in the brain.
The continuous creation of digital video has caused an exponential growth of digital video content. To increase the usability of such large volume of videos, a lot of research has been made. Video summarization has be...
详细信息
The continuous creation of digital video has caused an exponential growth of digital video content. To increase the usability of such large volume of videos, a lot of research has been made. Video summarization has been proposed to rapidly browse large video collections. To summarize any type of video, researchers have relied on visual features contained in frames. In order to extract these features, different techniques have used local or global descriptors. In this paper, we propose a method for static video summarization that can produce meaningful and informative video summaries. We perform an evaluation using over 100 videos in order to achieve a stronger position about the performance of local descriptors in semantic video summarization. Our experimental results show, with a confidence level of 99%, that our proposed method using local descriptors and temporal video segmentation produces better summaries than state of the art methods. We also demonstrate the importance of a more elaborate method for temporal video segmentation, improving the generation of summaries, achieving 10% improvement in accuracy. We also acknowledge a marginal importance of color information when using local descriptors to produce video summaries.
暂无评论