this paper proposes a new hybrid approach to verification aspect of a multibiometric system. this also gives a comparative analysis with traditional approaches such as multialgorithmic and multimodal versions of the s...
详细信息
In this paper, we convert multi class subjective ratings for lung nodules from radiologists to binary class problem and use that to classify. We also evaluate the difference in performance between homogenous and heter...
详细信息
this book constitutes the refereed proceedings of the 4th International conference on Pattern Recognition and Machine Intelligence, PReMI 2011, held in Moscow, Russia in June/July 2011. the 65 revised papers presented...
ISBN:
(数字)9783642217869
ISBN:
(纸本)9783642217852
this book constitutes the refereed proceedings of the 4th International conference on Pattern Recognition and Machine Intelligence, PReMI 2011, held in Moscow, Russia in June/July 2011. the 65 revised papers presented together with 5 invited talks were carefully reviewed and selected from 140 submissions. the papers are organized in topical sections on pattern recognition and machine learning; image analysis; image and video information retrieval; natural language processing and text and data mining; watermarking, steganography and biometrics; soft computing and applications; clustering and network analysis; bio and chemo analysis; and document imageprocessing.
A major preprocessing step in a multi-script OCR is to identify the script type of the test document image. the published papers on script identification usually assume that the test image is in correct i.e. 0 degrees...
详细信息
ISBN:
(纸本)9780769545202
A major preprocessing step in a multi-script OCR is to identify the script type of the test document image. the published papers on script identification usually assume that the test image is in correct i.e. 0 degrees orientation. But by mistake a document may be fed to the system in wrong orientation, say at an angle of nearly 180 degrees or +/- 90 degrees. In this method we propose a script identification method that works for unknown orientation for all 11 official indian scripts. Here, we first find the skew and counter-rotate the document by the skew angle. this will lead to correct (0 degrees) or upside down (180 degrees) orientation. then script identification is done by a multi-stage tree classifier using features invariant to 0 degrees/180 degrees orientation. Next we go to find the orientation of the image by a two class classifier for each script. Performance of the proposed method has been tested on a variety of documents and promising results have been obtained.
graphicsprocessing Units (GPUs) are becoming increasingly important in high performance computing. To maintain high quality solutions, programmers have to efficiently parallelize and map their algorithms. this task i...
详细信息
In many real world prediction problems the output is a structured object like a sequence or a tree or a graph. Such problems range from natural language processing to computational biology or computervision and have ...
详细信息
the digital rubbing is a novel approach to promote and pass on Chinese traditional arts, as well as a new idea to protect stone relics. Combined withcomputerimageprocessing technology, this work fully manipulates t...
详细信息
ISBN:
(纸本)9780889868243
the digital rubbing is a novel approach to promote and pass on Chinese traditional arts, as well as a new idea to protect stone relics. Combined withcomputerimageprocessing technology, this work fully manipulates the advantages of arts, proposes a new method to transform relief images into digital rubbings, and presents the solution to generate digital rubbing automatically. In addition, this paper explicitly analyzes the basis to establish key technologies of digital rubbings. It has proved that the proposed method is an effective means to make digital rubbing more quickly and conveniently, and it also produces better processed images.
this paper presents a two layer CODEC architecture for lossy high dynamic range image compression. the first layer contains the tone mapped image obtained using a conventional low dynamic range encoding approach, such...
详细信息
ISBN:
(纸本)9780889868243
this paper presents a two layer CODEC architecture for lossy high dynamic range image compression. the first layer contains the tone mapped image obtained using a conventional low dynamic range encoding approach, such as JPEG. the second layer contains the image difference, in perceptually uniform color space, between the result of inverse tone mapped low dynamic range content and the original image. We present techniques for efficient implementation and encoding of non-uniform tone mapping operators. We show that better de-correlation of the information in the layers can improve the coding efficiency.
In order to successfully locate and retrieve document images such as technical articles and newspapers, a text localization technique must be employed. the proposed method detects and extracts homogeneous text areas i...
详细信息
ISBN:
(纸本)9780889868243
In order to successfully locate and retrieve document images such as technical articles and newspapers, a text localization technique must be employed. the proposed method detects and extracts homogeneous text areas in document images indifferent to font types and size by using connected components analysis to detect blocks of foreground objects. Next, a descriptor that consists of a set of structural features is extracted from the merged blocks and used as input to a trained Support Vector Machines (SVM). Finally, the output of the SVM classifies the block as text or not.
this paper introduces an approach for dense 3D reconstruction from unregistered Internet-scale photo collections with about 3 million images within the span of a clay on a single PC ("cloudless"). Our method...
详细信息
ISBN:
(纸本)9783642155604
this paper introduces an approach for dense 3D reconstruction from unregistered Internet-scale photo collections with about 3 million images within the span of a clay on a single PC ("cloudless"). Our method advances image clustering, stereo, stereo fusion and structure from motion to achieve high computational performance. We leverage geometric and appearance constraints to obtain a highly parallel implementation on modern graphics processors and multi-core architectures. this leads to two orders of magnitude higher performance on an order of magnitude larger dataset than competing state-of-the-art approaches.
暂无评论