Everyday spatio-temporal reasoning is driven through qualitative abstractions over mental maps or 'diagrams'. Diagrammatic reasoning involves direct manipulation and inspection of diagrams as the primary means...
详细信息
ISBN:
(纸本)9781467385640
Everyday spatio-temporal reasoning is driven through qualitative abstractions over mental maps or 'diagrams'. Diagrammatic reasoning involves direct manipulation and inspection of diagrams as the primary means of inference. Diagrammatic representation offer computational advantage in problems where spatial relationships play a prominent role. In video, objects change spatial relationships over time. Therefore, combining diagrammatic reasoning with qualitative spatial and temporal reasoning holds promise. In this paper, we put forward a framework combining diagrammatic representation with qualitative spatial and temporal reasoning for motion event detection in video. Key frames with tracked objects for a given video are extracted. These frames in forward moving time are represented using specified;'diagrams'. A set of perception function is defined exploiting results from diagrammatic reasoning and qualitative spatial and temporal reasoning for determining spatial relations between objects of interest. Inter diagrammatic reasoning operator is used to combine sequence of diagrams for extracting spatio-temporal changes. Diagram modification function is defined exploiting results from qualitative reasoning to extract directional information. Considering extracted relative position and relative direction of displacement as features, we use supervised machine learning techniques to recognize motion events in video. The approach is tested in videos with few people/groups meeting, walking together and splitting up/fighting from the CAVIAR dataset.
Over 70% of software development effort is spent in software maintenance comprising bug fixes and version updates. These activities involve fast comprehension of large codebases authored by multiple developers. Develo...
详细信息
image inpainting is the process to reconstruct the missing or corrupted regions of the images. This paper presents context aware exemplar based image inpainting using adaptive image division technique. In previous inp...
详细信息
Every document holds some sorts of information. Paper documents especially old paper document provide a wealth of information about every aspect of the society and civilization. So these paper documents are being conv...
详细信息
ISBN:
(纸本)9781467374378
Every document holds some sorts of information. Paper documents especially old paper document provide a wealth of information about every aspect of the society and civilization. So these paper documents are being converted to electronic form for better storage, processing and analysis. The people, especially students while reading a book or documents have a tendency to mark important lines through the pen. To preserve these types of documents electronically, these lines need to be removed. This paper proposed a method to remove hand drawn lines from scanned paper document images. As there is no any benchmark database, we also proposed a dataset consisting of document images with hand-drawn lines (made it publicly available). The proposed method is tested on this dataset and the results are encouraging.
In recent days most digital imaging devices i.e. the high resolution images or videos are playing a critical role in the areas of imageprocessing and application. They too are becoming helpful in the areas of medical...
详细信息
The bilateral filter is a versatile non-linear filter that has found diverse applications in imageprocessing, computervision, computergraphics, and computational photography. A common form of the filter is the Gaus...
详细信息
ISBN:
(纸本)9781479983391
The bilateral filter is a versatile non-linear filter that has found diverse applications in imageprocessing, computervision, computergraphics, and computational photography. A common form of the filter is the Gaussian bilateral filter in which both the spatial and range kernels are Gaussian. A direct implementation of this filter requires O(sigma(2)) operations per pixel, where sigma is the standard deviation of the spatial Gaussian. In this paper, we propose an accurate approximation algorithm that can cut down the computational complexity to O(1) per pixel for any arbitrary sigma (constant-time implementation). This is based on the observation that the range kernel operates via the translations of a fixed Gaussian over the range space, and that these translated Gaussians can be accurately approximated using the so-called Gauss-polynomials. The overall algorithm emerging from this approximation involves a series of spatial Gaussian filtering, which can be efficiently implemented (in parallel) using separability and recursion. We present some preliminary results to demonstrate that the proposed algorithm compares favorably with some of the existing fast algorithms in terms of speed and accuracy.
This paper presents a real-time hands-free immersive image navigation system that can respond to various gestures and voice commands. We combine Microsoft Kinect 2.0 and Leap Motion Controller, on a single platform, t...
详细信息
This paper presents a system for unconstrained handwritten Odia text recognition using Hidden Markov Model (HMM) framework. Existing literature for Odia text recognition works primarily with individual isolated charac...
详细信息
This paper presents a system for unconstrained handwritten Odia text recognition using Hidden Markov Model (HMM) framework. Existing literature for Odia text recognition works primarily with individual isolated characters. In this study we introduce a Odia dataset of word samples collected from different professionals. Concavity feature from each word image is extracted in our approach. Next, the features are fed to HMM-based sequential classifier for recognition. The experiment has been performed on a large dataset consisting of 4000 words and results obtained are encouraging.
Stamps and logos are generally used for authenticating the source of a document. For automatic document processing, identification and segmentation of stamps and logos are essential. In the past, methods to detect sta...
详细信息
Stamps and logos are generally used for authenticating the source of a document. For automatic document processing, identification and segmentation of stamps and logos are essential. In the past, methods to detect stamps and logos were limited to specific shapes, colors, or training data. However, stamps and logos can be of any shape or color. In this paper, we have proposed a novel stamp and logo detection technique. Our approach is based on the fact that stamps and logos, in general, are not the primary contents of a document. This fact motivates us to propose an outlier detection technique for the same purpose in a feature space. Based on some geometric features, the detected outliers are classified as stamps and logos. Our method shows good performance in case of separating them from text. Moreover, this technique is capable of detecting logos as well as chromatic and achromatic stamps.
暂无评论