this book constitutes the refereed proceedings of the 4th International conference on Pattern Recognition and Machine Intelligence, PReMI 2011, held in Moscow, Russia in June/July 2011. the 65 revised papers presented...
ISBN:
(数字)9783642217869
ISBN:
(纸本)9783642217852
this book constitutes the refereed proceedings of the 4th International conference on Pattern Recognition and Machine Intelligence, PReMI 2011, held in Moscow, Russia in June/July 2011. the 65 revised papers presented together with 5 invited talks were carefully reviewed and selected from 140 submissions. the papers are organized in topical sections on pattern recognition and machine learning; image analysis; image and video information retrieval; natural language processing and text and data mining; watermarking, steganography and biometrics; soft computing and applications; clustering and network analysis; bio and chemo analysis; and document imageprocessing.
Touching characters are major problem of achieving higher recognition rate in Optical Character Recognition (OCR). Present OCR systems do not perform well when adjacent characters touch. If characters are touched in g...
详细信息
ISBN:
(纸本)9781424442195
Touching characters are major problem of achieving higher recognition rate in Optical Character Recognition (OCR). Present OCR systems do not perform well when adjacent characters touch. If characters are touched in graphical documents (e.g. map) then such touching string recognition is more difficult because in such documents touching characters appear in multi-oriented direction. In this paper, we present a scheme towards the recognition of English two-character multi-oriented touching strings. When two or more characters touch, the), generate a big cavity region at the background portion and we used this background information in our scheme. To handle the background information, convex hull is used. In this scheme, at first, a set of initial segmentation points is predicted based on the concave residues of the convex hull of the touching characters. Next, based on the initial points, we select some candidate segmentation lines. Finally the recognition confidence of two sub-images of a touching string, obtained from each candidate segmentation line is computed. the candidate segmentation line from which we get optimum confidence is the actual segmentation line and the corresponding characters in favour of which the two segmentation parts show optimum confidence is the recognition result of the touching string. To compute the recognition confidence, SVM classifier is used. the features used in the SVM are invariant to character orientation. Circular ring and convex hull ring based approach has been used along with angular information of the contour pixels of the character to make the feature rotation invariant. From the experiment we obtained encouraging result.
Person re-identification addresses the problem of matching pedestrian images across disjoint camera views. Design of feature descriptor and distance metric learning are the two fundamental tasks in person re-identific...
详细信息
ISBN:
(纸本)9781450366151
Person re-identification addresses the problem of matching pedestrian images across disjoint camera views. Design of feature descriptor and distance metric learning are the two fundamental tasks in person re-identification. In this paper, we propose a metric learning framework for person re-identification, where the discriminative metric space is learned using Kernel Fisher Discriminant Analysis (KFDA), to simultaneously maximize the inter-class variance as well as minimize the intra-class variance. We derive a Mahalanobis metric induced by KFDA and argue that KFDA is efficient to be applied for metric learning in person re-identification. We also show how the efficiency of KFDA in metric learning can be further enhanced for person re-identification by using two simple yet efficient multiple kernel learning methods. We conduct extensive experiments on three benchmark datasets for person re-identification and demonstrate that the proposed approaches have competitive performance with state-of-the-art methods.
Nonlocal means (NLM) video denoising algorithm though provide very competitive results, suffer from high computational cost. We propose to reduce the computations through the concept of dimensionality reduction using ...
详细信息
We present an automatic two dimensional model based non-photorealistic painterly rendering method which uses automatic relative focus map extraction from the model image to produce a relevance-based multilayer paintin...
详细信息
the bilateral filter is an edge-preserving smoother that has diverse applications in imageprocessing, computervision, computergraphics, and computational photography. the filter uses a spatial kernel along with a r...
详细信息
ISBN:
(纸本)9781509017461
the bilateral filter is an edge-preserving smoother that has diverse applications in imageprocessing, computervision, computergraphics, and computational photography. the filter uses a spatial kernel along with a range kernel to perform edge-preserving smoothing. In this paper, we consider the Gaussian bilateral filter where boththe kernels are Gaussian. A direct implementation of the Gaussian bilateral filter requires O (sigma(2)(s)) operations per pixel, where sigma(s) is the standard deviation of the spatial Gaussian. In fact, it is well-known that the direct implementation is slow in practice. We present an approximation of the Gaussian bilateral filter, whereby we can cut down the number of operations to O (1) per pixel for any arbitrary sigma(s), and yet achieve very high-quality filtering that is almost indistinguishable from the output of the original filter. We demonstrate that the proposed approximation is few orders faster in practice compared to the direct implementation. We also demonstrate that the approximation is competitive with existing fast algorithms in terms of speed and accuracy.
this book constitutes the thoroughly refereed post-workshop proceedings of the International Workshop on Clinical image-based Procedures: From Planning to Intervention, CLIP 2012, held in Nice, France, in conjunction ...
详细信息
ISBN:
(数字)9783642380792
ISBN:
(纸本)9783642380785
this book constitutes the thoroughly refereed post-workshop proceedings of the International Workshop on Clinical image-based Procedures: From Planning to Intervention, CLIP 2012, held in Nice, France, in conjunction withthe 15th International conference on Medical image Computing and computer-Assisted Intervention, MICCAI 2012.
this successful workshop was a productive and exciting forum for the discussion and dissemination of clinically tested, state-of-the-art methods for image-based planning, monitoring and evaluation of medical procedures. the 16 papers presented in this volume were carefully reviewed and selected from 24 submissions.
In this paper we present a novel system for facilitating the creation of stylized view-dependent 3D animation. Our system harnesses the skill and intuition of a traditionally trained animator by providing a convivial ...
详细信息
In this paper we present a novel system for facilitating the creation of stylized view-dependent 3D animation. Our system harnesses the skill and intuition of a traditionally trained animator by providing a convivial sketch based 2D to 3D interface. A base mesh model of the character can be modified to match closely to an input sketch, with minimal user interaction. To do this, we recover the best camera from the intended view direction in the sketch using robust computervision techniques. this aligns the mesh model withthe sketch. We then deform the 3D character in two stages-first we reconstruct the best matching skeletal pose from the sketch and then we deform the mesh geometry. We introduce techniques to incorporate deformations in the view-dependent setting. this allows us to set up view-dependent models for animation.
the presence of haze within the atmospheric medium degrades the quality of videos captured by camera sensors. the expulsion of haze, referred to as dehazing, is typically performed subject to the physical degradation ...
详细信息
ISBN:
(纸本)9781450366151
the presence of haze within the atmospheric medium degrades the quality of videos captured by camera sensors. the expulsion of haze, referred to as dehazing, is typically performed subject to the physical degradation display, that involves an explanation of an ill-posed inverse drawback. A few efforts have been made for image dehazing, whereas, video dehazing still remains an unexplored area of research. this paper proposes an approach for video dehazing combining the concepts of single image dehazing, optical stream estimation and Markov Random Field (MRF). the proposed method enhances the temporal and spatial coherence of the hazy video. Assuming that the dark channel of the haze-free picture is zero, we acquire the raw transmission map. In the proposed approach, we focus on the raw transmission map obtained from the dark channel prior using guided filter. We assess the forward and reverse optical streams between the neighboring frames to locate individual pixels using Linear Discriminant Analysis. the color of the haze-free pixels in the frames is approximated by a few hundred discrete colors, which generate a fixed cluster in space and the directions of the pixel. the pixels at a given cluster are spread and can be determined by analyzing the forward and in reverse optical frames to predict its value after haze removal. Largest Margin Nearest Neighbor (LMNN) algorithm is applied to get the smooth transmission map of the foggy frames of the video to approximate the pixel value in the RGB space. the stream fields are utilized in an augmented MRF model on the transmission guide obtained to enhance the temporal and the spatial coherence of the transmission. the proposed method is compared against the state-of-the-art on both real and synthetic videos to preserve the information optimally.
Multiplayer Online Battle Arena (MOBA) is a computer game genre with increasing popularity and leaping revenues. the MOBA phenomenon is particularly interesting from an economic perspective since the majority of purch...
详细信息
暂无评论