Normally images obtained from satellites are of low-contrast type which hides major information carried by the image. Hence, image restoration is necessary in the image processing domain to extract all the information...
详细信息
Tamil is a classical south Indian language in India that contains more letters among other Indian languages. The motivation behind our work was to develop an efficient character recognition technique for analyzing the...
详细信息
ISBN:
(纸本)9781450329088
Tamil is a classical south Indian language in India that contains more letters among other Indian languages. The motivation behind our work was to develop an efficient character recognition technique for analyzing the scanned Tamil documents. Initially character recognition was done with the help of frequently used Tamil characters. Based on that, we have created a new datasets by the process of segmentation using levelsets, binarization, and skew correction. In this paper we have recognized the Tamil characters by generating the random features using Random Kitchen Sinks. It gives the accuracy about 98.7%. Copyright 2014 ACM.
The paper proposes the design of linear phase filters using the optimization techniques based on methods for sparse signal restoration. The filter response is treated as weighted sum of the phase shifted versions of t...
详细信息
A short text gets updated every now and then. With the global upswing of such micro posts, the need to retrieve information from them also seems to be incumbent. This work focuses on the knowledge extraction from the ...
详细信息
The progression of social media contents, similar like Twitter and Facebook messages and blog post, has created, many new opportunities for language technology. The user generated contents such as tweets and blogs in ...
详细信息
The progression of social media contents, similar like Twitter and Facebook messages and blog post, has created, many new opportunities for language technology. The user generated contents such as tweets and blogs in most of the languages are written using Roman script due to distinct social culture and technology. Some of them using own language script and mixed script. The primary challenges in process the short message is identifying languages. Therefore, the language identification is not restricted to a language but also to multiple languages. The task is to label the words with the following categories L1, L2, Named Entities, Mixed, Punctuation and Others This paper presents the AmritaCen-NLP team participation in FIRE2015-Shared Task on Mixed Script Information Retrieval Subtask 1: Query Word Labeling on language identification of each word in text, Named Entities, Mixed, Punctuation and Others which uses sequence level query labelling with Support Vector Machine.
This paper aims at implementing Named Entity Recognition (NER) for four languages such as English, Tamil, Hindi and Malayalam. The results obtained from this work are submitted to a research evaluation workshop Forum ...
详细信息
The present work is done as part of shared task in Sentiment Analysis in Indian Languages (SAIL 2015), under constrained category. The task is to classify the twitter data into three polarity categories such as positi...
详细信息
Non-native English writers often make preposition errors in English language. The most commonly occurring preposition errors are preposition replacement, preposition missing and unwanted preposition. So, in this metho...
详细信息
This paper is an attempt to show that a dependency parser for Malayalam language can be produced using Integer Linear Programming Approach. We describe a process for developing a parser by incorporating the Paninian G...
详细信息
Video watermarking is relatively a new technology to ensure protection of intellectual property rights and to stop video piracy. The ownership information or watermark is normally hidden in the video sequences. In thi...
详细信息
暂无评论