Finding regions of interest (ROIs) is a fundamentally important problem in the area of computer vision and imageprocessing. Previous studies addressing this issue have mainly focused on investigating chromatic cues t...
详细信息
Finding regions of interest (ROIs) is a fundamentally important problem in the area of computer vision and imageprocessing. Previous studies addressing this issue have mainly focused on investigating chromatic cues to characterize visually salient image regions, while less attention has been devoted to monochromatic cues. The purpose of this paper is the study of monochromatic cues, which have the potential to complement chromatic cues, for the detection of ROIs in an image. This paper first presents a taxonomy of existing ROI detection approaches using monochromatic cues, ranging from well-known algorithms to the most recently published techniques. We then propose a novel monochromatic cue for ROI detection. Finally, a comparative evaluation has been conducted on large scale challenging test sets of real-world natural scenes. Experimental results demonstrate that the use of our proposed monochromatic cue yields a more accurate identification of ROIs. This paper serves as a benchmark for future research on this particular topic and a steppingstone for developers and practitioners interested in adopting monochromatic cues to ROI detection systems and methodologies. (C) 2014 Elsevier B.v. All rights reserved.
The main current directions of research in nanoscale images processing of a high degree of detail are shown in the paper. A system of algorithms for solving a wide range of tasks for the structural analysis of the ima...
详细信息
ISBN:
(纸本)9781479962211
The main current directions of research in nanoscale images processing of a high degree of detail are shown in the paper. A system of algorithms for solving a wide range of tasks for the structural analysis of the images of industrial materials has been developed. The experimental results of the developed algorithms have been presented in the article. The results of the imageprocessing of industrial materials have been shown.
Advances in computational methods and hardware platforms provide efficient processing of medical-imaging datasets for surgical planning. For neurosurgical interventions employing a straight access path, planning entai...
详细信息
Advances in computational methods and hardware platforms provide efficient processing of medical-imaging datasets for surgical planning. For neurosurgical interventions employing a straight access path, planning entails selecting a path from the scalp to the target area that's of minimal risk to the patient. A proposed GPU-accelerated method enables interactive quantitative estimation of the risk for a particular path. It exploits acceleration spatial data structures and efficient implementation of algorithms on GPUs. In evaluations of its computational efficiency and scalability, it achieved interactive rates even for high-resolution meshes. A user study and feedback from neurosurgeons identified this methods' potential benefits for preoperative planning and intraoperative replanning.
We study the efficacy of utilizing a powerful image descriptor, the curvelet transform, to learn a no-reference (NR) image quality assessment (IQA) model. A set of statistical features are extracted from a computed im...
详细信息
We study the efficacy of utilizing a powerful image descriptor, the curvelet transform, to learn a no-reference (NR) image quality assessment (IQA) model. A set of statistical features are extracted from a computed image curvelet representation, including the coordinates of the maxima of the log-histograms of the curvelet coefficients values, and the energy distributions of both orientation and scale in the curvelet domain. Our results indicate that these features are sensitive to the presence and severity of image distortion. Operating within a 2-stage framework of distortion classification followed by quality assessment, we train an image distortion and quality prediction engine using a support vector machine (SvM). The resulting algorithm, dubbed CurveletQA for short, was tested on the LIvE IQA database and compared to state-of-the-art NR/FR IQA algorithms. We found that CurveletQA correlates well with human subjective opinions of image quality, delivering performance that is competitive with popular full-reference (FR) IQA algorithms such as SSIM, and with top-performing NR IQA models. At the same time, CurveletQA has a relatively low complexity.(c) 2014 Elsevier B.v. All rights reserved.
Optical character recognition systems (OCR) have been effectively developed for the recognition of printed characters. One such application is the identifying engine number and chassis number which is engraved on mach...
详细信息
ISBN:
(纸本)9781467377591
Optical character recognition systems (OCR) have been effectively developed for the recognition of printed characters. One such application is the identifying engine number and chassis number which is engraved on machine parts. Manual logging of serial numbers in industries is very tedious and a time consuming affair. Our proposed system is robust under poor illumination conditions. Our overall system is efficient and can be applied in realtime applications. Since OCR is well-studied area where powerful algorithms like Zidouri algorithm for letter segmentation, Blob detection algorithm for removal of unwanted areas and character extraction, Hilditch algorithm for Arabic character recognition already exists, our OCR based engraved character recognition yields more accurate results up to 99.99% accuracy. The paper explains how optical character recognition technique along with computer vision can be applied to identify engine number and chassis number which are engraved on two and four wheeler vehicles.
Automatic recognition of emotions in speech has attracted the attention of the research community in recent years. Some of the most relevant proposed applications of it are in call-centers. In these scenarios the spee...
详细信息
Automatic recognition of emotions in speech has attracted the attention of the research community in recent years. Some of the most relevant proposed applications of it are in call-centers. In these scenarios the speech is distorted by compression algorithms. The effects of such distortion on the performance of systems for automatic recognition of emotions must be assessed. In this study these effects are evaluated independently of any other distortions generated by the communications channel. Several state-of-the-art codecs are used to compress the speech signals of two emotional speech databases. The databases used are the Berlin Database of Emotional Speech and the enterface05. The methodology considers voiced and unvoiced segments of the speech separately. Spectral, cepstral, noise and Non-Linear Dynamics (NLD) measures are used to characterize the segments. Finally, a classifier based on a Gaussian Mixture Model (GMM) is used to identify the emotion. The results indicate that voiced segments are less affected by the compression than unvoiced ones in terms in classification accuracy. They also show that the bandwidth of the analyzed signals is an important factor in the classification results.
Captured retina images enable important parts of the visual system to be analyzed. Automated retinal imageprocessing is becoming a primary screening tool for the detection of diseases such as diabetic retinopathy (DR...
详细信息
Captured retina images enable important parts of the visual system to be analyzed. Automated retinal imageprocessing is becoming a primary screening tool for the detection of diseases such as diabetic retinopathy (DR). An automated system reduces human error and also reduces the burden on ophthalmologists. The accurate detection of microaneurysms (MAs) is an important step for the early detection of DR. MAs appear as a first sign of DR and can be seen on retina images. This paper discusses some of the current techniques used to automatically detect MAs from retinal digital fundus images. This review outlines the general principle upon which retinal digital image analysis is based for the detection of MAs. The algorithms are categorized according to four processing steps (preprocessing, candidate MA detection, feature extraction, and classification). various gold standard or ground truth databases, data sample size, and the use of image databases are discussed. The variety of outcome measures and flaws in the literature are discussed. The challenges and future potential for research are discussed to provide guidance to algorithm designers of the early detection of DR.
Current vision systems are designed to perform in normal weather condition. However, no one can escape from severe weather conditions. Bad weather reduces scene contrast and visibility, which results in degradation in...
详细信息
ISBN:
(纸本)9781627055765
Current vision systems are designed to perform in normal weather condition. However, no one can escape from severe weather conditions. Bad weather reduces scene contrast and visibility, which results in degradation in the performance of various computer vision algorithms such as object tracking, segmentation and recognition. Thus, current vision systems must include some mechanisms that enable them to perform up to the mark in bad weather conditions such as rain and fog. Rain causes the spatial and temporal intensity variations in images or video frames. These intensity changes are due to the random distribution and high velocities of the raindrops. Fog causes low contrast and whiteness in the image and leads to a shift in the color. This book has studied rain and fog from the perspective of vision. The book has two main goals: 1) removal of rain from videos captured by a moving and static camera, 2) removal of the fog from images and videos captured by a moving single uncalibrated camera system. The book begins with a literature survey. Pros and cons of the selected prior art algorithms are described, and a general framework for the development of an efficient rain removal algorithm is explored. Temporal and spatiotemporal properties of rain pixels are analyzed and using these properties, two rain removal algorithms for the videos captured by a static camera are developed. For the removal of rain, temporal and spatiotemporal algorithms require fewer numbers of consecutive frames which reduces buffer size and delay. These algorithms do not assume the shape, size and velocity of raindrops which make it robust to different rain conditions (i.e., heavy rain, light rain and moderate rain). In a practical situation, there is no ground truth available for rain video. Thus, no reference quality metric is very useful in measuring the efficacy of the rain removal algorithms. Temporal variance and spatiotemporal variance are presented in this book as no reference quality metr
Micro Aerial vehicles (MAv's) are becoming ubiquitous with its ever increasing applications in defense, space and environmental sectors. In real time scenario, MAv's are expected to perform autonomously and de...
详细信息
Micro Aerial vehicles (MAv's) are becoming ubiquitous with its ever increasing applications in defense, space and environmental sectors. In real time scenario, MAv's are expected to perform autonomously and development of intelligent algorithms meant for pattern recognition and object tracking are most demanding. This work concentrates on the development of vision based navigation system for real time target tracking using MAvs. The target is identified based on its color feature and various color models namely RGB, Normalized RGB, HSI, YUv, YIQ, YCbCr, CIELAB and CIELUv are considered for thresholding analysis. The idea is to frame an effective imageprocessing algorithm concerning thresholding time and accuracy. In addition, the robustness of the color models for various noises such as fast fading, gaussian blur, jpeg, jp2k and white noise are also investigated. Simulation results suggests that, Y based color models exhibits less thresholding time, good accuracy and robust to noise. The target tracking algorithm is developed using optimum color model and justified through real time experimentation. A MATLAB (Matrix Laboratory) based navigation system is developed encompassing micro camera, A/v transmitter and receiver unit, flight controller, imageprocessing system and other interfacing circuits. The navigation system is successfully tested in our lab environments and it is proven to be a realizable and a cost effective solution.
The work addresses the problem of compensating the distortion effects induced by the translational motion of moving targets in Inverse Synthetic Aperture Radar (ISAR) imaging systems. The ISAR motion compensation is t...
详细信息
ISBN:
(纸本)9781479979301
The work addresses the problem of compensating the distortion effects induced by the translational motion of moving targets in Inverse Synthetic Aperture Radar (ISAR) imaging systems. The ISAR motion compensation is the most crucial step in the Autofocusing ISAR technique;this task is typically solved by implementing exhaustive search algorithms by adopting proper functionals based f.i. on image entropy or image contrast. In this work, we discuss an innovative and fast motion compensation procedure that is based on the estimation of two Doppler key Parameters: the Doppler Centroid and the Doppler Rate, which are related to the target motion parameters. The effectiveness of the proposed method is tested on real data acquired by a static Frequency Modulated Continuous Wave radar with an azimuth wide beamwidth;the radar is installed near the inner harbor of La Spezia (Italy) and it owned to the Centre for Maritime Research and Experimentation of the North Atlantic Treaty Organization (CMRE-NATO).
暂无评论