In a multi-lingual country like India, a document page may contain more than one script form. Under the three-language formula, the document may be printed in English, Devnagari and one of the other official indian la...
详细信息
In a multi-lingual country like India, a document page may contain more than one script form. Under the three-language formula, the document may be printed in English, Devnagari and one of the other official indian languages. For OCR of such a document page, it is necessary to separate these three script forms before feeding them to the OCRs of individual scripts. In this paper, an automatic technique of separating the text lines using script characteristics and shape based features is presented. At present, the system has an overall accuracy of about 98.5%.
Over the last decade or so, remarkable developments in computer technology have given a major impetus to research in the field of multimedia. With the proliferation of the Internet and the increasingly widespread use ...
详细信息
Over the last decade or so, remarkable developments in computer technology have given a major impetus to research in the field of multimedia. With the proliferation of the Internet and the increasingly widespread use of sophisticated computers, the multimedia revolution has arrived in India as well. It is therefore time to take stock of the situation: to evaluate how existing techniques can be used in the indian context and to determine what new methods have to be developed. This paper summarizes the current state of multimedia technology in India and points to directions for further work. As more and more people in India begin to use computers and the Internet, multimedia capabilities will start playing a vital role in solving problems in many different areas. Education is probably one of the most important areas where multimedia technology can have a major impact. Already, multimedia educational systems are being developed in indian languages. Several interactive encyclopaedia-like environments are also being marketed on CD-ROMs, and cover topics ranging from indian classical music to indian history, using text, images and sound. Some of the other possible applications of multimedia technology are: the development of digital libraries, news and information dissemination services, medicine, business and commerce, and the entertainment industry. Multimedia information technology is thus poised to become an exciting area for research and development activities in India.
There are many types of documents where machine-printed and hand-written texts appear intermixed. Since the optical character recognition (OCR) methodologies for machine-printed and hand-written texts are different, i...
详细信息
There are many types of documents where machine-printed and hand-written texts appear intermixed. Since the optical character recognition (OCR) methodologies for machine-printed and hand-written texts are different, it is necessary to separate these two types of text before feeding them to the respective OCR systems. In this paper, we present such a scheme for both Bangla and Devnagari characters. The scheme is based on the structural and statistical features of the machine-printed and hand-written text lines. The classification scheme has an accuracy of about 98.3%.
We propose simple and fast algorithms for detection of italic, bold and all-capital words without doing actual character recognition. We present a statistical study which reveals that the detection of such words may p...
详细信息
We propose simple and fast algorithms for detection of italic, bold and all-capital words without doing actual character recognition. We present a statistical study which reveals that the detection of such words may play a key role in automatic information retrieval from documents. Moreover, detection of italic words can be used to improve the recognition accuracy of a text recognition system. Considerable number of document images have been tested and our algorithms give accurate results on all the tested images, and the algorithms are very easy to implement.
Skew angle detection of scanned documents containing most popular indian scripts (Devnagari and Bangla) is considered. Most characters in these scripts have horizontal lines at the top, called headlines. The character...
详细信息
Skew angle detection of scanned documents containing most popular indian scripts (Devnagari and Bangla) is considered. Most characters in these scripts have horizontal lines at the top, called headlines. The character head lines mostly join one another in a word and the word appears as a single component. In the proposed method the components are at first labeled. The upper envelope of a component is found by columnwise scanning from an imaginary line above the component. Portions of upper envelope satisfying the properties of digital straight line are detected. They are clustered as belonging to single text lines. Estimates from individual clusters are combined to get the skew angle. Apart from accuracy and efficiency, an advantage of the method is that character segmentation and zone detection can be readily done from head line information, which is useful in Optical Character recognition approaches of these scripts.
Clustering techniques such as K-means and Forgy as well as their improved version ISODATA group data around one seed point for each cluster. It is well known that these methods do not work well if the shape of the clu...
详细信息
Clustering techniques such as K-means and Forgy as well as their improved version ISODATA group data around one seed point for each cluster. It is well known that these methods do not work well if the shape of the cluster is elongated or nonconvex. We argue that for a elongated or nonconvex shaped cluster, more than one seed is needed, In this paper a multiseed clustering algorithm is proposed. A density based representative point selection algorithm is used to choose the initial seed points. To assign several seed points to one cluster, a minimal spanning tree guided novel technique is proposed. Also, a border point detection algorithm is proposed for the detection of shape of the cluster. This border in turn signifies whether the cluster is elongated or not. Experimental results show the efficiency of this clustering technique.
In this paper we propose a neural network model to synthesise texture images. The model is based on a continuous Hopfield-like network where each pixel of the image is occupied by a neuron that is eight-connected to i...
详细信息
In this paper we propose a neural network model to synthesise texture images. The model is based on a continuous Hopfield-like network where each pixel of the image is occupied by a neuron that is eight-connected to its neighbours. A state of the neuron denotes a certain grey level of the corresponding pixel. The firing of the neuron changes its state, and hence the grey level of the corresponding pixel. Different two-tone and grey-tone texture images can be synthesised by manipulating the connection weights and by varying the algorithm iteration number. For grey-tone texture synthesis, a Markov chain principle has been employed to decide on the multiple state transition of a neuron. The model can be employed for texture propagation with the advantage that it allows propagation without showing any blocky effect.
This paper presents an application of backpropagation neural network for the detection of linear structures in remote-sensing images. The purpose of the approach is two-fold. First, to exploit the advantages of a neur...
详细信息
This paper presents an application of backpropagation neural network for the detection of linear structures in remote-sensing images. The purpose of the approach is two-fold. First, to exploit the advantages of a neural network classifier over the tranditional ones. Second, to avoid the strategic phases of enhancement and thresholding. Once the network is learnt, the classification scheme is real-time. Two critical issues in the present approach an the selection of the network architecture and the rate of convergence of learning. Solutions to these two problems are proposed. Experimental results on IRS and SPOT images are presented. Satisfactory classification results have been obtained using the network.
In a multi-lingual country like India, a document may contain more than one script forms. For such a document it is necessary to separate different script forms before feeding them to OCRs of individual script. In thi...
详细信息
In a multi-lingual country like India, a document may contain more than one script forms. For such a document it is necessary to separate different script forms before feeding them to OCRs of individual script. In this paper an automatic word segmentation approach is described which can separate Roman, Bangla and Devnagari scripts present in a single document. The approach has a tree structure where at first Roman script words are separated using the 'headline' feature. The headline is common in Bangla and Devnagari but absent in Roman. Next, Bangla and Devnagari words are separated using some finer characteristics of the character set although recognition of individual character is avoided. At present, the system has an overall accuracy of 96.09%.
An OCR system is proposed that can read two indian language scripts: Bangla and Devnagari (Hindi), the most popular ones in the indian subcontinent. These scripts, having the same origin in ancient Brahmi script, have...
详细信息
An OCR system is proposed that can read two indian language scripts: Bangla and Devnagari (Hindi), the most popular ones in the indian subcontinent. These scripts, having the same origin in ancient Brahmi script, have many features in common and hence a single system can be modeled to recognize them. In the proposed model, document digitization, skew detection, text line segmentation and zone separation, word and character segmentation, character grouping into basic, modifier and compound character category are done for both scripts by the same set of algorithms. The feature sets and classification tree as well as the knowledge base required for error correction (such as lexicon) differ for Bangla and Devnagari. The system shows a good performance for single font scripts printed on clear documents.
暂无评论