AM-FM models analyze an image in terms of amplitude and frequency modulated sinusoids. Such models are well suited to describe the local non-stationarities of an image. In this paper the invariance of AM-FM features w...
详细信息
ISBN:
(纸本)9781424442195
AM-FM models analyze an image in terms of amplitude and frequency modulated sinusoids. Such models are well suited to describe the local non-stationarities of an image. In this paper the invariance of AM-FM features with respect to scaling, rotation and illumination changes is presented. These properties can be used in applications of AM-FM image models where features for affine transformed images need to be computed. The application of invariance properties in template tracking is illustrated using a particle filter based tracker that uses the AM-FM features.
Performance of an OCR system is badly affected due to presence of hand-drawn annotation lines in various forms, such as underlines, circular lines, and other text-surrounding curves. Such annotation lines are drawn by...
详细信息
ISBN:
(纸本)9781479915880
Performance of an OCR system is badly affected due to presence of hand-drawn annotation lines in various forms, such as underlines, circular lines, and other text-surrounding curves. Such annotation lines are drawn by a reader usually in free hand in order to summarize some text or to mark the keywords within a document page. In this paper, we propose a generalized scheme for detection and removal of these handdrawn annotations from a scanned document page. An underline drawn by hand is roughly horizontal or has a tolerable undulation, whereas for a hand-drawn curved line, the slope usually changes at a gradual pace. Based on this observation, we detect the cover of an annotation object-be it straight or curvedas a sequence of straight edge segments. The novelty of the proposed method lies in its ability to compute the exact cover of the annotation object, even when it touches or passes through any text character. After getting the annotation cover, an effective method of inpainting is used to quantify the regions where text reconstruction is needed. We have done our experimentation with various documents written in English, and some results are presented here to show the efficiency and robustness of the proposed method.
image registration is an essential step in many computervision applications which demands high accuracy for significantly random and complex deformations. In medical imageprocessing applications image registration i...
详细信息
ISBN:
(纸本)9781450366151
image registration is an essential step in many computervision applications which demands high accuracy for significantly random and complex deformations. In medical imageprocessing applications image registration is a basic preprocessing step to non-rigidly align images from different acquisition environments to an atlas image. We propose a novel non-rigid image registration method to get more reliable registration even under noisy conditions with manageable time complexity. The proposed method explores the the inherent multi-resolution capability of wavelets to perform nonrigid registration in a graph environment. The inherent time complexity of wavelet feature map calculation is avoided using Chebyshev Polynomial approximations for the wavelet operators.
This paper presents an efficient combination of two well-known tracking algorithms, Tracking-Learning-Detection (TLD) and Compressive Tracking (CT) to devise an algorithm which takes advantages of both and outperforms...
详细信息
ISBN:
(纸本)9781467385640
This paper presents an efficient combination of two well-known tracking algorithms, Tracking-Learning-Detection (TLD) and Compressive Tracking (CT) to devise an algorithm which takes advantages of both and outperforms them on their short-ends by virtue of other. TLD fails in cases including full out-of-plane rotation, fast motion and articulated object tracking. While CT fails in resuming tracking once the object leaves the frame and comes back. We propose a combining algorithm mentioned as Algorithm 1, which robustly handles all the tracking challenges. Different thresholds are set which can be varied to weigh each component as required. The proposed algorithm is tested on different test sequences involving challenging tracking scenarios such as fast motion and their success rates are calculated in Table I. The proposed algorithm works favourably against both algorithms in terms of robustness and success rate.
Underwater images suffer from non uniform contrast and poor visibility due to bad illumination and color cast in deep water. Such images have a hazy and color diminished appearance making underwater studies a difficul...
详细信息
ISBN:
(纸本)9781467385640
Underwater images suffer from non uniform contrast and poor visibility due to bad illumination and color cast in deep water. Such images have a hazy and color diminished appearance making underwater studies a difficult task. Researches in last decades performed color correction, assuming that underwater images have bluish color cast which is not always true. In this paper, a new image enhancement approach is proposed which modifies the gray world algorithm by finding the color cast using fuzzy logic and then removing the color cast by optimizing the correction method using Bacterial Foraging Optimisation (BFO). Proposed approach is adaptive in nature as it finds the intensity of color cast instead of assuming it which improves the quality of underwater images. Computed results have enhanced visual details, contrast and color performance.
Egocentric cameras are wearable cameras mounted on a person's head or shoulder. With their ability to have first person view, such cameras are spawning new set of exciting applications in computervision. Recognis...
详细信息
ISBN:
(纸本)9781467385640
Egocentric cameras are wearable cameras mounted on a person's head or shoulder. With their ability to have first person view, such cameras are spawning new set of exciting applications in computervision. Recognising activity of the wearer from an egocentric video is an important but challenging problem. The task is made especially difficult because of unavailability of wearer's pose as well as extreme camera shake due to motion of wearer's head. Solutions suggested so far for the problem, have either focussed on short term actions such as pour, stir etc. or long term activities such as walking, driving etc. The features used in both the styles are very different and the technique developed for one style often fail miserably on other kind. In this paper we propose a technique to identify if a long term or a short term action is present in an egocentric video segment. This allows us to have a generic first-person action recognition system where we can recognise both short term as well as long term actions of the wearer. We report an accuracy of 90.15% for our classifier on publicly available egocentric video dataset comprising 18 hours of video amounting to 1.9 million tested samples.
We present a novel algorithm to remove near regular, fence or wire like foreground patterns from an image. The fence detection or fence removal algorithms, developed so far, have poor performance in detecting the fenc...
详细信息
ISBN:
(纸本)9781450347532
We present a novel algorithm to remove near regular, fence or wire like foreground patterns from an image. The fence detection or fence removal algorithms, developed so far, have poor performance in detecting the fence. We use signal demixing to utilize the sparsity and regularity property of fences to detect them. Results demonstrate the effectiveness of our technique as compared to other state of the art techniques.
Bishnupur is an attractive tourist place in West Bengal, India and is known for its terracotta temples. The place is one of the prospective candidates to be included in the list of UNESCO World Heritage sites. We inte...
详细信息
ISBN:
(纸本)9781450347532
Bishnupur is an attractive tourist place in West Bengal, India and is known for its terracotta temples. The place is one of the prospective candidates to be included in the list of UNESCO World Heritage sites. We intend to preserve this heritage site digitally and also to present some virtual interaction for the tourist and researchers. In this paper, we present an image dataset of different temples (namely, Jor Bangla, Kalachand, Madan Mohan, Radha Madhav, Rasmancha, Shyamrai and Nandalal) in Bishnupur for evaluating different types of computervision and imageprocessing algorithms (like 3D reconstruction, image inpainting, texture classification and content specific image retrieval). The dataset is captured using four different cameras with different parameter settings. Some datasets are extracted and earmarked for certain applications such as texture classification, image inpainting and content specific image retrieval. Example results of baseline methods are also shown for these applications. Thus we evaluate the usefulness of this dataset. To the best of our knowledge, probably this is the first attempt of combined dataset for evaluating various types of problems for a heritage site in India.
Pomegranate is a fruit which grows with a very high yield in many states of India and one of the most profits gaining fruit in the market. But due to various conditions, the plants are infected by various diseases whi...
详细信息
ISBN:
(纸本)9781467385640
Pomegranate is a fruit which grows with a very high yield in many states of India and one of the most profits gaining fruit in the market. But due to various conditions, the plants are infected by various diseases which destroy the entire crop leaving very less product yield. So, the work proposes an imageprocessing and neural network methods to deal with the main issues of phytopathology i.e. disease detection and classification. The Pomegranate fruit as well as the leaves are affected by various diseases caused by fungus, bacteria and the climatic conditions. These diseases are like Bacterial Blight, Fruit Spot, Fruit rot and Leaf spot. The system uses some images for training, some for testing purpose and so on. The color images are pre-processed and undergo k-means clustering segmentation. The texture features are extracted using GLCM method, and given to the artificial neural network. The overall accuracy of this method is 90%. The results are proved to be accurate and satisfactory in contrast to manual grading and hopefully take a strong rise in establishing itself in the market as one of the most efficient process.
Face recognition under varying background and pose is challenging, and extracting background and pose invariant features is an effective approach to solve this problem. This paper proposes a skin detection-based appro...
详细信息
ISBN:
(纸本)9781479915880
Face recognition under varying background and pose is challenging, and extracting background and pose invariant features is an effective approach to solve this problem. This paper proposes a skin detection-based approach for enhancing the performance of a Face Recognition (FR) system, employing a unique combination of Skin based background removal, Discrete Wavelet Transform (DWT), Adaptive Multi-Level Threshold Binary Particle Swarm Optimization (ABPSO) and an Error Control Feedback (ECF) loop. Skin based background removal is used for efficient background removal and ABPSO-based feature selection algorithm is used to search the feature space for the optimal feature subset. The ECF loop is used to neutralize pose variations. Experimental results, obtained by applying the proposed algorithm on Color FERET and CMUPIE face databases, show that the proposed system outperforms other FR systems. A significant increase in the recognition rate and substantial reduction in the number of features are observed.
暂无评论