Evaluation of ill-posed problems like Intrinsic image Decomposition (IID) is challenging. IID involves decomposing an image into its constituent illumination-invariant Reflectance (R) and albedo-invariant Shading (S) ...
详细信息
The GPUs pack high computation power and a restricted architecture into easily available hardware today. They are now used as computation co-processors and come with programming models that treat them as standard para...
详细信息
The GPUs pack high computation power and a restricted architecture into easily available hardware today. They are now used as computation co-processors and come with programming models that treat them as standard parallel architectures. We explore the problem of realtime ray casting of large deformable models (over a million triangles) on large displays (a million pixels) on an off-the-shelf GPU in this paper. Ray casting is an inherently parallel and highly compute intensive operation. We build a GPU-efficient three-dimensional data structure for this purpose and a corresponding algorithm that uses it for fast ray casting. We also present fast methods to build the data structure on the SIMD GPUs, including a fast multi-split operation. We achieve real-time ray-casting of a million triangle model onto a million pixels on current Nvidia GPUs using the CUDA model. Results are presented on the data structure building and ray casting on a number of models. The ideas presented here are likely to extend to later models and architectures of the GPU as well as to other multi-core architectures.
Recent methods of bottom-up salient object detection have attempted to either: (i) obtain a probability map with a 'contrast rarity' based functional, formed using low level cues;or (ii) Minimize an objective ...
详细信息
ISBN:
(纸本)1595930361
Recent methods of bottom-up salient object detection have attempted to either: (i) obtain a probability map with a 'contrast rarity' based functional, formed using low level cues;or (ii) Minimize an objective function, to detect the object. Most of these methods fail for complex, natural scenes, such as the PASCAL-VOC challenge dataset which contains images with diverse appearances, illumination conditions, multiple distracting objects and varying scene environments. We thus formulate a novel multi-criteria objective function which captures many dependencies and the scene structure for correct spatial propagation of low-level priors to perform salient object segmentation, in such cases. Our proposed formulation is based on CRF modeling where the minimization is performed using graph cut and the optimal parameters of the objective function are learned using a max-margin framework from the training set, without the use of class labels. Hence the method proposed is unsu-pervised, and works efficiently when compared to the very recent state-of-the art methods of saliency map detection and object proposals. Results, compared using F-measure and intersection-over-union scores, show that the proposed method exhibits superior performance in case of the complex PASCAL-VOC 2012 object segmentation dataset as well as the traditional MSRA-B saliency dataset. Copyright 2014 ACM.
We present an improved mesh denoising method based on 3D geometric bilateral filtering. Its novelty is that it can preserve the details of the object as well as reduce the noise in an effective manner. The previous ap...
详细信息
We present an improved mesh denoising method based on 3D geometric bilateral filtering. Its novelty is that it can preserve the details of the object as well as reduce the noise in an effective manner. The previous approach of geometric bilateral filtering for 3D-scan points has a limitation that it reduces the point density, thereby losing the details present in the object. The approach proposed by us, on the contrary, works on the surface mesh obtained after triangulating the 3D-scan points without any data downsampling. Each vertex of the mesh is repositioned appropriately based on the estimated centroid of the vertices in its local neighborhood and a Gaussian weight function. Experimental results demonstrate its strength, efficiency, and robustness.
This book constitutes the refereed conferenceproceedings of the 8th International conference on Multi-disciplinary Trends in Artificial Intelligence, MIWAI 2014, held in Bangalore, India, in December 2014. The 22 rev...
详细信息
ISBN:
(数字)9783319133652
ISBN:
(纸本)9783319133645
This book constitutes the refereed conferenceproceedings of the 8th International conference on Multi-disciplinary Trends in Artificial Intelligence, MIWAI 2014, held in Bangalore, India, in December 2014.
The 22 revised full papers were carefully reviewed and selected from 44 submissions. The papers feature a wide range of topics covering both theory, methods and tools as well as their diverse applications in numerous domains.
This paper proposes a new method for using implicit user feedback from clickthrough data to provide personalized ranking of results in a video retrieval system. The annotation based search is complemented with a featu...
详细信息
ISBN:
(纸本)9781595937827
This paper proposes a new method for using implicit user feedback from clickthrough data to provide personalized ranking of results in a video retrieval system. The annotation based search is complemented with a feature based ranking in our approach. The ranking algorithm uses belief revision in a Bayesian Network, which is derived from a multimedia ontology that captures the probabilistic association of a concept with expected video features. We have developed a content model for videos using discrete feature states to enable Bayesian reasoning and to alleviate on-line feature processing overheads. We propose a reinforcement learning algorithm for the parameters of the Bayesian Network with the implicit feedback obtained from the clickthrough data. Copyright 2007 ACM.
Car insurance claims are rising in tandem with the rising tide of car users. Every insurance claim requires an engineer's manual assessment and a surveyor's actual examination. This procedure can last anywhere...
详细信息
Large space with many cameras require huge storage and computational power to process these data for surveillance applications. In this paper we propose a distributed camera and processing based face detection and rec...
详细信息
Large space with many cameras require huge storage and computational power to process these data for surveillance applications. In this paper we propose a distributed camera and processing based face detection and recognition system which can generate information for finding spatiotemporal movement pattern of individuals over a large monitored space. The system is built upon Hadoop Distributed File System using map reduce programming model. A novel key generation scheme using distance based hashing technique has been used for distribution of the face matching task. Experimental results have established effectiveness of the technique.
This paper proposes a transform domain data-hiding scheme for quality access control of images. The original image is decomposed into tiles by applying n-level lifting-based discrete wavelet transformation (DWT). A bi...
详细信息
This paper proposes a transform domain data-hiding scheme for quality access control of images. The original image is decomposed into tiles by applying n-level lifting-based discrete wavelet transformation (DWT). A binary watermark image (external information) is spatially dispersed using the sequence of number generated by a secret key. The encoded watermark bits are then embedded into all DWT-coefficients of n th -level and only in the high-high (HH) coefficients of the subsequent levels using dither modulation (DM) but without complete self-noise suppression. It is well known that due to insertion of external information, there will be degradation in visual quality of the host image (cover). The degree of deterioration depends on the amount of external data insertion as well as step size used for DM. If this insertion process is reverted, better quality of images can be accessed. To achieve that goal, watermark bits are detected using minimum distance decoder and the remaining self-noise due to information embedding is suppressed to provide better quality of image. The simulation results have shown the validity of this claim.
We target the problem of image Denoising using Gaussian Processes Regression (GPR). Being a non-parametric regression technique, GPR has received much attention in the recent past and here we further explore its versa...
详细信息
We target the problem of image Denoising using Gaussian Processes Regression (GPR). Being a non-parametric regression technique, GPR has received much attention in the recent past and here we further explore its versatility by applying it to a denoising problem. The focus is primarily on the design of a local gradient sensitive kernel that captures pixel similarity in the context of image denoising. This novel kernel formulation is used to shape the smoothness of the joint GP prior. We apply the GPR denoising technique to small patches and then stitch back these patches, this allows the priors to be local and relevant, also this helps us in dealing with GPR complexity. We demonstrate that our GPR based technique gives better PSNR values in comparison to existing popular denoising techniques.
暂无评论