This article describes our recent study of a real-life face recognition problem using a hybrid architecture consisting of a very deep convolution neural network (CNN) and a support vector machine (SVM). The novel aspe...
详细信息
computervision and robotic assistance are increasingly being used to improve the quality of surgical interventions. Tool tracking becomes critical in interventions viz. endoscopy, laparoscopy and retinal microsurgery...
详细信息
Hand shredded content-less pages reassembly is a challenging task. This has applications in forensics and fun games. The process is even more tedious when the number of pages from which the fragments are obtained is u...
详细信息
Magnetic Resonance Imaging (MRI) system in recent times demands a high rate of acceleration in data acquisition to reduce the scanning time. The data acquisition rate can be accelerated to a significant order through ...
详细信息
Feature extraction is an essential step in many imageprocessing and computervision applications. It is quite desirable that the extracted features can effectively represent an image. Furthermore, the dominant inform...
详细信息
ISBN:
(纸本)9788132225386;9788132225379
Feature extraction is an essential step in many imageprocessing and computervision applications. It is quite desirable that the extracted features can effectively represent an image. Furthermore, the dominant information visually perceived by human beings should be efficiently represented by the extracted features. Over the last few decades, different algorithms are proposed to address the major issues of image representations by the efficient features. Gabor wavelet is one of the most widely used filters for image feature extraction. Existing Gabor wavelet-based feature extraction methodologies unnecessarily use both the real and the imaginary coefficients, which are subsequently processed by dimensionality reduction techniques such as PCA, LDA etc. This procedure ultimately affects the overall performance of the algorithm in terms of memory requirement and the computational complexity. To address this particular issue, we proposed a local image feature extraction method by using a Gabor wavelet. In our method, an image is divided into overlapping image blocks, and subsequently each of the image blocks are separately filtered out by Gabor wavelet. Finally, the extracted coefficients are concatenated to get the proposed local feature vector. The efficacy and effectiveness of the proposed feature extraction method is evaluated using the estimation of mean square error (MSE), peak signal-to-noise ratio (PSNR), and the correlation coefficient (CC) by reconstructing the original image using the extracted features, and compared it with the original input image. All these performance evaluation measures clearly show that real coefficients of the Gabor filter alone can effectively represent an image as compared to the methods which utilize either the imaginary coefficients or the both. The major novelty of our method lies on our claim-capability of the real coefficients of a Gabor filter for image representation.
Office automation is an active area of research. It involves archival and retrieval of official documents. For developing a system for this purpose, it is necessary to have an extensive benchmark dataset consisting va...
详细信息
Automatic License Plate Recognition (ALPR) has important applications in traffic surveillance. It is a challenging problem especially in countries like in India where the license plates have varying sizes, number of l...
详细信息
ISBN:
(纸本)9781450347532
Automatic License Plate Recognition (ALPR) has important applications in traffic surveillance. It is a challenging problem especially in countries like in India where the license plates have varying sizes, number of lines, fonts etc. The difficulty is all the more accentuated in traffic videos as the cameras are placed high and most plates appear skewed. This work aims to address ALPR using Deep CNN methods for real-time traffic videos. We first extract license plate candidates from each frame using edge information and geometrical properties, ensuring high recall. These proposals are fed to a CNN classifier for License Plate detection obtaining high precision. We then use a CNN classifier trained for individual characters along with a spatial transformer network (STN) for character recognition. Our system is evaluated on several traffic videos with vehicles having different license plate formats in terms of tilt, distances, colors, illumination, character size, thickness etc. Results demonstrate robustness to such variations and impressive performance in both the localization and recognition. We also make available the dataset for further research on this topic.
In this article, we propose a novel multimodal Medical image Fusion (MIF) method based on a neuro-fuzzy technique in the transform (Non-Subsampled Shearlet Transform (NSST)) domain for spatially registered, multi-moda...
详细信息
An image captured in dark environment usually has ambient illumination, but the image looks dark and noisy. However, the use of flash can introduce unwanted artifacts such as sharp shadows at silhouettes, red eyes, an...
详细信息
ISBN:
(纸本)9781467385640
An image captured in dark environment usually has ambient illumination, but the image looks dark and noisy. However, the use of flash can introduce unwanted artifacts such as sharp shadows at silhouettes, red eyes, and non-uniform brightness in the image. We propose a new framework to enhance photographs captured in dark environments by combining the best features from a flash and a no-flash image. We use sparse and redundant dictionary learning based approach to denoise the no-flash image. A weighted least squares framework is used to transfer sharp details from the flash image into the no-flash image. We show that our approach is simple and able to generate better images than that of the state-of-the-art flash/no-flash fusion method.
暂无评论