Super-resolution (SR) is a technique to improve the resolution of an image from a sequence of input images or from a single image. As SR is an ill-posed inverse problem, it leads to many suboptimal solutions. Since mo...
详细信息
Automatic License Plate Recognition (ALPR.) has important applications in traffic surveillance. It is a challenging problem especially in countries like in India where the license plates have varying sizes, number of ...
详细信息
ISBN:
(纸本)9781450347532
Automatic License Plate Recognition (ALPR.) has important applications in traffic surveillance. It is a challenging problem especially in countries like in India where the license plates have varying sizes, number of lines, fonts etc. the difficulty is all the more accentuated in traffic videos as the cameras are placed high and most plates appear skewed. this work aims to address ALPR. using Deep CNN methods for real-time traffic videos. We first extract license plate candidates from each frame using edge information and geometrical properties, ensuring high recall. these proposals are fed to a CNN classifier for License Plate detection obtaining high precision. We then use a CNN classifier trained for individual characters along with a spatial transformer network (STN) for character recognition. Our system is evaluated on several traffic videos with vehicles having different license plate formats in terms of tilt, distances, colors, illumination, character size, thickness etc. Results demonstrate robustness to such variations and impressive performance in boththe localization and recognition. We also make available the dataset for further research on this topic.
Face Recognition (FR) using Convolutional Neural Network (CNN) based models have achieved considerable success in constrained environments. they however fail to perform well in unconstrained scenarios, especially when...
详细信息
ISBN:
(纸本)9781450366151
Face Recognition (FR) using Convolutional Neural Network (CNN) based models have achieved considerable success in constrained environments. they however fail to perform well in unconstrained scenarios, especially when the images are captured using surveillance cameras. these probe samples suffer from degradations such as noise, poor illumination, low resolution, blur as well as aliasing, when compared to the rich training (gallery) set, comprising mostly of mugshot images captured in laboratory settings. these images in the training (gallery) set are crisp and have high contrast, compared to the probe samples. To cope withthis scenario, we propose a novel dual-pathway generative adversarial network (DP-GAN) which maps low resolution images captured using surveillance camera into their corresponding high resolution images, which are gallery-like, using a novel combination of multi-scale reconstruction and Jensen-Shannon divergence based loss. these images thus obtained are then used to train a deep domain adaptation (deep-DA) network to perform the task of FR. the proposed network achieves superior results (>90%) on four benchmark surveillance face datasets, evident from the rank-1 recognition rates when compared with recent state-of-the-art CNN-based techniques.
Brain mapping research is facilitated by first aligning digital images of mouse brain slices to standardized atlas framework such as the Allen Reference Atlas (ARA). However, conventional processing of these brain sli...
详细信息
ISBN:
(纸本)9781450347532
Brain mapping research is facilitated by first aligning digital images of mouse brain slices to standardized atlas framework such as the Allen Reference Atlas (ARA). However, conventional processing of these brain slices introduces many histological artifacts such as tears and missing regions in the tissue, which make the automatic alignment process extremely challenging. We present an end-to-end fully automatic registration pipeline for alignment of digital images of mouse brain slices that may have histological artifacts, to a standardized atlas space. We use a geometric approach where we first align the bounding box of convex hulls of brain slice contours and atlas template contours, which are extracted using a variant of Canny edge detector. We then detect the artifacts using Constrained Delaunay Triangulation (CDT) and remove them from the contours before performing global alignment of points using iterative closest point (ICP). this is followed by a final non-linear registration by solving the Laplace's equation with Dirichlet boundary conditions. We tested our algorithm on 200 mouse brain slice images including slices acquired from conventional processing techniques having major histological artifacts, and from serial two-photon tomography (STPT) with no major artifacts. We show significant improvement over other registration techniques, both qualitatively and quantitatively, in all slices especially on slices with significant histological artifacts.
Camera-based lane detection and vehicle tracking algorithms are one of the keystones for many autonomous systems. the navigational process of those systems is mainly focused on the output of detection algorithms. Howe...
详细信息
this book constitutes the refereed proceedings of the 6th International conference, ICISP 2014, held in June/July 2014 in Cherbourg, France. the 76 revised full papers were carefully reviewed and selected from 164 sub...
ISBN:
(纸本)9783319079998
this book constitutes the refereed proceedings of the 6th International conference, ICISP 2014, held in June/July 2014 in Cherbourg, France. the 76 revised full papers were carefully reviewed and selected from 164 submissions. the contributions are organized in topical sections on multispectral colour science, color imaging and applications, digital cultural heritage, document image analysis, graph-based representations, image filtering and representation, computervision and pattern recognition, computergraphics, biomedical, and signal processing.
Super-resolving a noisy image is a challenging problem, and needs special care as compared to the conventional super resolution approaches, when the power of noise is unknown. In this scenario, we propose an approach ...
详细信息
Content-aware applications in computational photography define the relative importance of objects or actions present in an image using a saliency map. Most saliency detection algorithms learn from the human visual sys...
详细信息
Bayesian Sparse Signal Recovery (SSR) for Multiple Measurement Vectors, when elements of each row of solution matrix are correlated, is addressed in the paper. We propose a standard linear Gaussian observation model a...
详细信息
ISBN:
(纸本)9781450366151
Bayesian Sparse Signal Recovery (SSR) for Multiple Measurement Vectors, when elements of each row of solution matrix are correlated, is addressed in the paper. We propose a standard linear Gaussian observation model and a three-level hierarchical estimation framework, based on Gaussian Scale Mixture (GSM) model with some random and deterministic parameters, to model each row of the unknown solution matrix. this hierarchical model induces heavy-tailed marginal distribution over each row which encompasses several choices of distributions viz. Laplace distribution, Student's t distribution and Jeffery prior. Automatic Relevance Determination (ARD) phenomenon introduces sparsity in the model. It is interesting to see that Block Sparse Bayesian Learning framework is a special case of the proposed framework when induced marginal is Jeffrey prior. Experimental results for synthetic signals are provided to demonstrate its effectiveness. We also explore the possibility of using Multiple Measurement Vectors to model Dynamic Hand Posture Database which consists of sequence of temporally correlated hand posture sequence. It can be seen that by exploiting temporal correlation information present in the successive image samples, the proposed framework can reconstruct the data with less linear random measurements with high fidelity.
this book constitutes the thoroughly refereed post-conference proceedings of the four workshops on Photographic Aesthetics and Non-Photorealistic Rendering (PAESNPR13), Geometric Properties from Incomplete Data (GPID)...
详细信息
ISBN:
(数字)9783642539268
ISBN:
(纸本)9783642539251
this book constitutes the thoroughly refereed post-conference proceedings of the four workshops on Photographic Aesthetics and Non-Photorealistic Rendering (PAESNPR13), Geometric Properties from Incomplete Data (GPID), Quality Assessment and Control by image and Video Analysis (QACIVA) and Geometric Computation for computervision (GCCV2013), held in conjunction withthe 6th Pacific-Rim Symposium on image and Video Technology (PSIVT) in Guanajuato, Mexico during October 28-November 1, 2013. the 38 revised full papers presented were carefully selected from numerous submissions and cover all aspects of Imaging and graphics Hardware and Visualization, image/Video Coding and Transmission; processing and Analysis; Retrieval and Scene Understanding, but also Applications of image and Video Technology, Biomedical imageprocessing and Analysis, Biometrics and image Forensics, Computational Photography and Arts, computer and Robot vision, Pattern Recognition and Video Surveillance.
暂无评论