Keyword-based search algorithms may sometimes yield weird results despite the fact that they usually do their jobs well. Despite of the fact that the web is the largest database, comparing to relational databases, the...
详细信息
ISBN:
(纸本)9781479914210
Keyword-based search algorithms may sometimes yield weird results despite the fact that they usually do their jobs well. Despite of the fact that the web is the largest database, comparing to relational databases, the set of search operations for the web is still primitive. this paper proposes two ways to remedy this: first, advancedinformation sources should be created. the role of advancedinformation sources of the web is analogous to views of relational databases. Second, we propose several data processing tools based on the concept of advancedinformation sources. Withthese two mechanisms, the researcher tries to distinguish the data-centric view from the presentation view of the web.
In this paper, we suggest a process called Listener that support so that client can manage easily information retrieval system(IRS) and apply this to KRISTAL-information Retrieval and Management System(IRMS). Usual v,...
详细信息
ISBN:
(纸本)9780769529301
In this paper, we suggest a process called Listener that support so that client can manage easily information retrieval system(IRS) and apply this to KRISTAL-information Retrieval and Management System(IRMS). Usual v, in relational database management svstem(RDBMS) such as Oracle, there are many applications and tools that client may manage system and its data. On the other hand, in information retrieval system applications for management has been performed in server-side due to the structure and purpose of the IR systems, it is difficult to manage the system. Using the proposed Listener, administrator can achieve control of search daemon process, database structure alteration, data management on client computer. this process has been designed based on XML considering scalability and readability and supports the client API by C and Java and make developer can do easily maintenance of the application for management database.
Relevance analysis is a regular and important task in many technical fields. We can get the relevance score by measuring (or quantifying) the result of relevance analysis. In this paper we have reviewed two main tools...
详细信息
ISBN:
(纸本)9780769529301
Relevance analysis is a regular and important task in many technical fields. We can get the relevance score by measuring (or quantifying) the result of relevance analysis. In this paper we have reviewed two main tools for relevance measure, which are the covariance and the mutual information, and we have discussed that there may be some problems in relevance measure if we use the above two methods, then we give the definition on Partial Condition Entropy(PCE) based on the informationtheory and presented a new method for relevance measure by using the PCE. there are mainly three advantages for relevance measure by using our method: (1) the relevance degree can be compared more easy than other methods because the score of relevance calculation is equal to a numeral between 0 and 1;(2) By using the method, we can not only know whether there is relevance between the considered events but also get a special score that represents the relevance degree of these events;(3) When we calculate the PCE, we needn't know all the conditional probability density, so our method is more flexible than the calculation of mutual information. To demonstrate the usefulness of our method for relevance measure, we apply it to the sentence relevance analysis in Natural languageprocessing(NLP). We find that our result of relevance measure is a more truly reflection on the relationship between the sentences.
this research paper presents the development step of our Arab Gloss proposed system [1][2] mades for the ArabSTS (Arab Sign language Translating System). ArabSTS aims to translate the Arabic text to Arabic Sign Langua...
详细信息
ISBN:
(纸本)9781538613528;9781538644607
this research paper presents the development step of our Arab Gloss proposed system [1][2] mades for the ArabSTS (Arab Sign language Translating System). ArabSTS aims to translate the Arabic text to Arabic Sign language (ArbSL). this work is a part of a big project developed in Latice Laboratory which is the webSign project [3]. webSign is a web Application taking as input a natural sentence aiming to translate this sentence in a visual and body language in real time. the Animation level is based on the technology of the Avatar. Our work focuses on the Arabic language as the text in the input, which needs many treatments due to the particularity of this language. ArabSTS starts from the linguistic treatment of the Arabic sentence, passing through the definition and the development of the Arabic Annotation Gloss system, which is the main topic of this paper, and coming finally to the generation of an animated sentence using the avatar technology.
Query-directed multi-document summarization aims to provide a more effective characterization of a document set accounting to the user's information need when generating a summary. In this paper, we propose a prac...
详细信息
the traditional English text chunking approach identifies phrases by using only one model and phrases withthe same types of features. It has been shown that the limitations of using only one model are that: the use o...
详细信息
ISBN:
(纸本)9780769529301
the traditional English text chunking approach identifies phrases by using only one model and phrases withthe same types of features. It has been shown that the limitations of using only one model are that: the use of the same types of features is not suitable for all phrases, and data sparseness may also result. In this paper, a divide-conquer strategy is proposed and applied in the identification of English phrases. And then, this strategy is rapid transplanted to Chinese text chunking. this strategy divides the task of chunking into several sub-tasks according to sensitive features of each phrase and identifies different phrases in parallel. then, a two-stage decreasing conflict strategy is used to synthesize each sub-task's answer, where the main features are: one, each phrase uses its own sensitive features;two, avoidance of data sparseness. through testing on public corpus (English) and Chinese Penn Treebank (Chinese), F score of English chunking achieves to 95.14% and that of Chinese chunking is 95.23%. these results are state of the art withthe best results that have been reported..
Ontologies are increasingly used in many domains, such as knowledge management, information extraction, the semantic web and so on. More and more ontologies have been published on the web, with different scope and qua...
详细信息
ISBN:
(纸本)9780769532738
Ontologies are increasingly used in many domains, such as knowledge management, information extraction, the semantic web and so on. More and more ontologies have been published on the web, with different scope and quality. Ontology evaluation has been proposed as one strategy for quality assurance of ontologies. In this paper, we propose a democratic ranking system analogous to that used by *** and eBay, with an innovation that we separate the reviewers into deferent groups of domain experts, ontology researchers, and common users. By using the server side Java technologies and Ajax technology, we developed a prototype system which can evaluate ontologies uploaded by users in both an objective and a subjective way.
Recently, many researches of various fields that used a ubiquitous sensor network are going on. Specially, various experiment had a function of monitoring without human is preceded in the environmental fields where ac...
详细信息
ISBN:
(纸本)9780769532738
Recently, many researches of various fields that used a ubiquitous sensor network are going on. Specially, various experiment had a function of monitoring without human is preceded in the environmental fields where access of a person is difficult. For example, in Korea, various pilot u-city projects are driven by nation. this paper designed a service as it used a wireless sensor network to measure and monitor environmental information in the campus.
the 3D video is the core technology of next generation multimedia service for providing the best quality service to the users. this paper propose the structure of the ObjectDescriptor(OD) steam which can provide the m...
详细信息
An Improved TBL based post-processing approach is proposed for Japanese named entity recognition (NER) in this paper. Firstly, tuning rules are automatically acquired from the results of Japanese NER by error driven l...
详细信息
ISBN:
(纸本)9780769532738
An Improved TBL based post-processing approach is proposed for Japanese named entity recognition (NER) in this paper. Firstly, tuning rules are automatically acquired from the results of Japanese NER by error driven learning. And then, the tuning rules are optimized according to given threshold conditions. After filtered, the rules are used to revise the results of Japanese NER. Above all, this approach could be used in special domains perfectly for its learning domain linguistic knowledge automatically, the learnt rules could not go over fit as well. the experimental results show that a high result can be achieved in precision for Japanese NER.
暂无评论