This paper presents a new semi-supervised clustering framework to the recognition of heavily degraded characters in historical typewritten documents, where off-the-shelf OCR typically fails. The constraints are genera...
详细信息
This paper presents a new semi-supervised clustering framework to the recognition of heavily degraded characters in historical typewritten documents, where off-the-shelf OCR typically fails. The constraints are generated using typographical (collection-independent) domain knowledge and are used to guide both sample (glyph set) partitioning and metric learning. Experimental results using simple features provide encouraging evidence that this approach can lead to significantly improved clustering results compared to simple K-means clustering, as well as to clustering using a state-of-the art OCR engine.
The proceedings contain 44 papers. The special focus in this conference is on General Problems of H-CSI, Disabled Persons Helping, Medical H-CSI Applications, Psychological and Linguistic Aspects of H-CSI, Robots, Tra...
ISBN:
(纸本)9783642032011
The proceedings contain 44 papers. The special focus in this conference is on General Problems of H-CSI, Disabled Persons Helping, Medical H-CSI Applications, Psychological and Linguistic Aspects of H-CSI, Robots, Training Systems and Various H-CSI Applications. The topics include: From research on the decision-making in ill-structured situation control and the problem of risks;emulating the perceptual system of the brain for the purpose of sensor fusion;knowledge acquisition in conceptual ontological artificial intelligence system;a dialogue-based interaction system for human-computer interfaces;image annotation based on semantic rules;eye-mouse for disabled;eye-blink controlled human-computer interface for the disabled;machine learning of melanocytic skin lesion images;an application of detection function for the eye blinking detection;emotion recognition from facial expression using neural networks;emotion eliciting and decision making by psychodynamic appraisal mechanism;toward daydreaming machines;VoiceXML platform for minority languages;a web-oriented Java3D talking head;from research on the virtual reality installation;biologically reasoned point-of-interest image compression for mobile robots;graphical human-machine interface for QB systems;visualization of two parameters in a three-dimensional environment;enterprsise ontology for knowledge-based system;a new and improved skin detection method using mixed color space;a formal model for supporting the adaptive access to virtual museums;3D molecular interactive modeling and diagnosis based on fuzzy IF-THEN rules and genetic algorithms.
Over the last two decades, in the field of secure communication and financial on-line applications, we have witnessed an explosive growth in biometric personal authentication systems which are covenant with a measurab...
详细信息
Over the last two decades, in the field of secure communication and financial on-line applications, we have witnessed an explosive growth in biometric personal authentication systems which are covenant with a measurable physical characteristic or behavioral trait. On-line refers to making use of the time functions of the signing process. Verification of on-line signature as a biometric modality still is challenging fields of research, since the number of pointer based devices emerges as input devices for many e-commerce and m-commerce applications. The deployment of automatic handwritten signature verification with technology still remains open for novel methods due to inter-class and intra-class variations of signature. Since the area is currently one of the most on the go and the bulk of research is very large, this survey paper covers some of the examples of the ways.
Template matching is used for many applications in image processing. Cross correlation is the basic statistical approach to image registration. It is used for template matching or patternrecognition. Template can be ...
详细信息
Template matching is used for many applications in image processing. Cross correlation is the basic statistical approach to image registration. It is used for template matching or patternrecognition. Template can be considered a sub-image from the reference image, and the image can be considered as a sensed image. The objective is to establish the correspondence between the reference image and sensed image. It gives the measure of the degree of similarity between an image and template. This paper describes medical image registration by template matching based on Normalized Cross-Correlation (NCC) using Cauchy-Schwartz inequality. The algorithm for template matching using NCC is implemented in MATLAB. The algorithm does the template matching and uses the Cauchy-Schwartz's inequality to simplify the procedure. The developed algorithm is robust for similarity measure. An experimental result with medical images registration with noise and without noise is shown in the results section.
Wavelet packet decomposition not only has the decompose effect at low-frequency by using wavelet decomposition, but also has the decompose effect at high-frequency where can not do by using wavelet decomposition. In t...
详细信息
Wavelet packet decomposition not only has the decompose effect at low-frequency by using wavelet decomposition, but also has the decompose effect at high-frequency where can not do by using wavelet decomposition. In this paper, the wavelet packet decomposition algorithm was proposed and applied to glass-image recognition. Compared with other feature extracting technologies such as Zernikepsilas moments and wavelet transformation, the experiments proved that the wavelet packet decomposition was the best on both precision and efficiency.
The purpose of fingerprint matching is to compare two fingerprint images and return a similarity score that represents to the probability of match between the two fingerprints. The performance of an automatic fingerpr...
详细信息
The purpose of fingerprint matching is to compare two fingerprint images and return a similarity score that represents to the probability of match between the two fingerprints. The performance of an automatic fingerprint identification system is greatly determined by its fingerprint matching algorithm. In this paper, a full fingerprint matching algorithm is presented. The main contribution of this paper is that a novel algorithm based on convex hulls for eliminating spurious matching in fingerprint matching is proposed. It is very important for improving the performance of fingerprint matching algorithm to reducing spurious matching. The algorithm is tested on a public database FVC2002 DB1_A. Experimental results show that the proposed algorithm is performed well compared with other ones.
Artificial Neural Networks (ANN) embed shallow knowledge through learning. Used in diagnosis and decision support, ANN are immediate computational models for effects and causes as from human experience but keep out fr...
详细信息
Artificial Neural Networks (ANN) embed shallow knowledge through learning. Used in diagnosis and decision support, ANN are immediate computational models for effects and causes as from human experience but keep out from the deep knowledge of them. The paper presents a way of embedding logical processing over the numerical ones in ldquoneural logical sitesrdquo for the classical ANN paradigms, then proposes a way of structuring deep knowledge in the network for all types of abduction problems in a unified way, which is compared with similar attempt. The approach may be spread in any diagnosis and decision support applications involving deep and shallow knowledge.
Coping with differences in the expression of emotions is a challenging task not only for a machine, but also for humans. Since individualism in the expression of emotions may occur at various stages of the emotion gen...
详细信息
Coping with differences in the expression of emotions is a challenging task not only for a machine, but also for humans. Since individualism in the expression of emotions may occur at various stages of the emotion generation process, human beings may react quite differently to the same stimulus. Consequently, it comes as no surprise that recognition rates reported for a user-dependent system are significantly higher than recognition rates for a user-independent system. Based on empirical data we obtained in our earlier work on the recognition of emotions from biosignals, speech and their combination, we discuss which consequences arise from individual user differences for automated recognition systems and outline how these systems could be adapted to particular user groups.
We present Antelogue, a novel pronoun resolution architecture for dialogues based on efficient filtering of potential antecedents through a simple look-up of information using existing resources (gender, number, NER, ...
详细信息
ISBN:
(纸本)9781424449620
We present Antelogue, a novel pronoun resolution architecture for dialogues based on efficient filtering of potential antecedents through a simple look-up of information using existing resources (gender, number, NER, etc). Our system does not require large labelled datasets for training or complex handcrafted rules. We will demo the system's real time performance on dialogues extracted from the screenplays of the popular TV series Lost. Antelogue achieves accuracy (90.4%) for first, second and third person singular pronouns.
Edge detection is one of the most commonly used operations in image processing and patternrecognition, the reason for this is that edges form the outline of an object. An edge is the boundary between an object and th...
详细信息
ISBN:
(纸本)9781424451043
Edge detection is one of the most commonly used operations in image processing and patternrecognition, the reason for this is that edges form the outline of an object. An edge is the boundary between an object and the background, and indicates the boundary between overlapping objects. This means that if the edges in an image can be identified accurately, all of the objects can be located and basic properties such as area, perimeter, and shape can be measured. Since computer vision involves the identification and classification of objects in an image, edge detection is an essential tool. Efficient and accurate edge detection will lead to increase the performance of subsequent image processing techniques, including image segmentation, object-based image coding, and image retrieval. A color image edge detection algorithm is proposed in this paper. Average maximum color difference value is used to predict the optimum threshold value for a color image and thinning technique is applied to extract proper edges. The proposed method is applied over large database of color images both synthetic and real life images and performance of the algorithm is evident from the results and is comparable with other edge detection algorithms.
暂无评论