A new partition criterion for pairwise clustering is proposed naturally in the probabilistic analysis framework. Its connection to normal K-means algorithm, is explained in two different views which also builds its re...
详细信息
ISBN:
(纸本)0780374886
A new partition criterion for pairwise clustering is proposed naturally in the probabilistic analysis framework. Its connection to normal K-means algorithm, is explained in two different views which also builds its relationship withthe kernel approach introduced by Vapnik. Both synthetic examples and the challenging task of planar shape analysis problem have been given to show its efficiency of unsupervised pairwise clustering application.
One of the important reasons for poor recognition rate in optical character recognition (OCR) system is the error in character segmentation. Existence of touching characters in the scanned documents is a major problem...
详细信息
ISBN:
(纸本)0769512631
One of the important reasons for poor recognition rate in optical character recognition (OCR) system is the error in character segmentation. Existence of touching characters in the scanned documents is a major problem to design an effective character segmentation procedure. In this paper, a new technique is presented for identification and segmentation of touching characters. the technique is based on fuzzy multifactorial analysis. A predictive algorithm is developed for effectively selecting possible cut columns for segmenting the touching characters. the proposed method has been applied to printed documents in Devnagari and Bangla: the two most popular scripts of the Indian sub-continent. the results obtained from a test-set of considerable size show that a reasonable improvement in recognition rate can be achieved with a modest increase in computations.
this paper presents a very fast mufti-stage algorithm for the recognition of non-Latin script. Although the examples use Arabic script, the system could be adapted in minutes to deal with any character set, in particu...
详细信息
ISBN:
(纸本)0769516564
this paper presents a very fast mufti-stage algorithm for the recognition of non-Latin script. Although the examples use Arabic script, the system could be adapted in minutes to deal with any character set, in particular non-Latin characters where no commercial OCR systems are available. the approach used normalises isolated characters for size and extracts an image signature based on the number of black pixels in the rows and columns of the character and compares these values to a set of signatures for typical characters of the set. this technique identifies not only the closet match but gives the closeness of match to all other characters in the set, which is expressed in a triangular Confusion Matrix.
Most of the Prototype Reduction Schemes (PRS), which have been reported in the literature, process the data in its entirety to yield a subset of prototpyes that are useful in nearest-neighbourlike classification. Fore...
详细信息
A new Voice Activity Detector (VAD) algorithm using Support Vector Machines (SVM) is proposed in the paper, and the new VAD effectiveness is validated. Sequential Minimal Optimization (SMO) algorithm for fast training...
详细信息
ISBN:
(纸本)0780374886
A new Voice Activity Detector (VAD) algorithm using Support Vector Machines (SVM) is proposed in the paper, and the new VAD effectiveness is validated. Sequential Minimal Optimization (SMO) algorithm for fast training support vector machines is adopted. the proposed VAD algorithm via SVM (SVM-VAD) also uses the characteristic parameters set used by 6729 Annex B (G.729B) VAD. Comparing SVM-VAD with G.729B VAD shows that it is effective for applying SVM to VAD. the new proposed VAD algorithm is integrated with G.729B instead of G.729B VAD, informal listening tests show that the integrated speech coding system has a little better efficiency over the G.729B VAD in perceptivity.
On the basis of DBF nets proposed by Wang Shoujue, the model and implement of DBF neural network were discussed in this paper. When applied in patternrecognition, the algorithm and implement on hardware were presente...
详细信息
the proceedings contain 33 papers. the special focus in this conference is on Technical Drawings, Validation, Symbol Segmentation, recognition and Perceptual Organization. the topics include: 3D reconstruction of pape...
ISBN:
(纸本)3540440666
the proceedings contain 33 papers. the special focus in this conference is on Technical Drawings, Validation, Symbol Segmentation, recognition and Perceptual Organization. the topics include: 3D reconstruction of paper based assembly drawings;interpretation of low-level CAD data for knowledge extraction in early design stages;an efficient form classification method;robust frame extraction and removal for processing form documents;issues in ground-truthing graphic documents;sketch-based user interface for inputting graphic objects on small screen devices;experimental evaluation of a trainable scribble recognizer for calligraphic interfaces;an error-correction graph grammar to recognize texture symbols;perceptual organization as a foundation for graphics recognition;exploiting perceptual grouping for map analysis, understanding and generalization;extraction of contextual information existing among component elements of origami books;semantic analysis and recognition of raster-scanned color cartographic images;structure based interpretation of unstructured vector maps;generating logic descriptions for the automated interpretation of topographic maps;using software component algebra for intelligent document generation;interpreting sloppy stick figures by graph rectification and constraint-based matching;using a generic document recognition method for mathematical formulae recognition;smoothing and compression of lines obtained by raster-to-vector conversion;a scale and rotation parameters estimator application to technical document interpretation;structural rectification of non-planar document images;an effective vector extraction method on architectural imaging using drawing characteristics;a recognition method of matrices by using variable block pattern elements generating rectangular area and extended summary of the arc segmentation contest.
the proceedings contain 24 papers. the special focus in this conference is on Implementation and Application of Automata. the topics include: Using finite state technology in natural language processing of Basque;subm...
ISBN:
(纸本)3540004009
the proceedings contain 24 papers. the special focus in this conference is on Implementation and Application of Automata. the topics include: Using finite state technology in natural language processing of Basque;submodule construction and supervisory control;counting the solutions of presburger equations without enumerating them;finite automata for compact representation of language models in NLP;scheduling hard sporadic tasks by means of finite automata and generating functions;bounded-graph construction for noncanonical discriminating-reverse parsers;finite-state transducer cascade to extract proper names in texts;compilation methods of minimal acyclic finite-state automata for large dictionaries;improving raster image run-length encoding using data order;enhancements of partitioning techniques for image compression u sing weighted finite automata;on the size of deterministic finite automata;minimal adaptive pattern-matching automata for efficient term rewriting;typographical nearest-neighbor search in a finite-state lexicon and its application to spelling correction;on the software design of cellular automata simulators for ecological modeling and supernondeterministic finite automata.
In this paper according to the concept of Generalized Fishr Disrminnt(GFD) presented by D.H. Foley and J.W. Sammon, Generalized Kemel Function Fisher Discriminant(GKFD) is investigated and proved based on Linear Fishe...
详细信息
In this paper, we consider the general problem of technical document interpretation, applied to the documents of the French Telephonic Operator, France Telecom. At GREC.99, we presented a new set of features, based on...
详细信息
暂无评论