In this paper, we take the advantages of color contrast and color distribution to get high quality saliency maps. The overall procedure flow of our unified framework contains superpixel pre-segmentation, color contras...
详细信息
Multiple kernel learning (MKL) is a widely used kernel learning method, but how to select kernel is lack of theoretical guidance. The performance of MKL is depend on the users' experience, which is difficult to ch...
详细信息
In this paper we present a new content-based retrieval descriptor, density-based silhouette descriptor (DBS). It characterizes a 3D object with multivariate probability functions of its 2D silhouette features. The new...
详细信息
ISBN:
(纸本)9789898565419
In this paper we present a new content-based retrieval descriptor, density-based silhouette descriptor (DBS). It characterizes a 3D object with multivariate probability functions of its 2D silhouette features. The new descriptor is computationally efficient and induces a permutation property that guarantees invariance at the matching stage. Also, it is insensitive to small shape perturbations and mesh resolution. The retrieval performance on several 3D databases shows that the DBS provides state-of-art discrimination over a broad and heterogeneous set of shape categories.
Segmentation becomes a difficult task if the objects are not homogeneous and have overlapping characteristics. The Graph Cuts methods combined with Gaussian Mixture Model (GMM) for initialization label has been adopte...
详细信息
Segmentation becomes a difficult task if the objects are not homogeneous and have overlapping characteristics. The Graph Cuts methods combined with Gaussian Mixture Model (GMM) for initialization label has been adopted to detect cattle object in an image with complex background. The RGB colors and Gray Level Co-occurrence Matrix (GLCM) textures are used as the features set. This method can robustly segment the cattle beef image from its background. This segmentation method produces the average of accuracy value up to 90%.
In this paper, we propose a robust visual tracking algorithm based on online learning of a joint sparse dictionary. The joint sparse dictionary consists of positive and negative sub-dictionaries, which model foregroun...
详细信息
Texture feature extraction plays an important role in texture image classification. In this paper, we have proposed a texture feature extraction method by utilizing the Short-time Fourier Transform to provide local im...
详细信息
ISBN:
(纸本)9781479906505
Texture feature extraction plays an important role in texture image classification. In this paper, we have proposed a texture feature extraction method by utilizing the Short-time Fourier Transform to provide local image information, and for the global geometric correspondence we have proposed to use Spatial Pyramid Matching in frequency domain named as Short-time Fourier Transform with Spatial Pyramid Matching (STFT-SPM). The experiments are conducted on standard benchmark datasets for texture classification like Brodatz and KTH-TIPS2-a, shows that STFT-SPM can achieve significant improvement compared to the Local Phase Quantization, Weber local Descriptor and local Binary pattern methods.
It is well known that the backgrounds or the targets always change in real scenes, which weakens the effectiveness of classical tracking algorithms because of frequent model mismatches. In this paper, an object tracki...
详细信息
This paper presents an objective comparative evaluation of layout analysis and recognition methods for scanned historical books. It describes the competition (modus operandi, dataset and evaluation methodology) held i...
详细信息
This paper presents an objective comparative evaluation of layout analysis and recognition methods for scanned historical books. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context of ICDAR2013 and the 2nd International Workshop on Historical Document Imaging and Processing (HIP2013), presenting the results of the evaluation of five methods - three submitted and two state-of-the-art systems (one commercial and one open-source). Three scenarios are reported in this paper, one evaluating the ability of methods to accurately segment regions, one evaluating segmentation and region classification (with a text extraction goal) and the other the whole pipeline including recognition. The results indicate that there is a convergence to a certain methodology, in terms of layout analysis, with some variations in the approach. However, there is still a considerable need to develop robust methods that deal with the idiosyncrasies of historical books, especially for OCR.
This paper presents an objective comparative evaluation of layout analysis methods for scanned historical newspapers. It describes the competition (modus operandi, dataset and evaluation methodology) held in the conte...
详细信息
This paper presents an objective comparative evaluation of layout analysis methods for scanned historical newspapers. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context of ICDAR2013 and the 2nd International Workshop on Historical Document Imaging and Processing (HIP2013), presenting the results of the evaluation of five submitted methods. Two state-of-the-art systems, one commercial and one open-source, are also evaluated for comparison. Two scenarios are reported in this paper, one evaluating the ability of methods to accurately segment regions and the other evaluating the whole pipeline of segmentation and region classification (with a text extraction goal). The results indicate that there is a convergence to a certain methodology with some variations in the approach. However, there is still a considerable need to develop robust methods that deal with the idiosyncrasies of historical newspapers.
Reading order detection and representation is an important task in many digitisation scenarios involving the preservation of the logical structure of a document. The corresponding need for the evaluation of reading or...
详细信息
ISBN:
(纸本)9781479901937
Reading order detection and representation is an important task in many digitisation scenarios involving the preservation of the logical structure of a document. The corresponding need for the evaluation of reading order results generated by layout analysis methods poses a particular challenge due to potential deviations between ground truth and actually detected segmentation of the page. To this end a novel evaluation approach that responds to this problem by incorporating region correspondence analysis is proposed. Furthermore, a sophisticated reading order representation scheme is presented and used by the system allowing the grouping of objects with ordered and/or unordered relations. This is a typical requirement for documents with complex layouts such as magazines and newspapers. The evaluation method has been validated using the results of two state-of-the-art OCR / layout analysis systems and a basic top-to-bottom reading order detection algorithm applied on representative samples from the PRImA contemporary and the IMPACT historical document datasets.
暂无评论