Temporal information carries information about changes and time of the changes. Consider a company investing in another company. The former may choose to inject the money gradually with the amount and frequency depend...
The architecture of learned software for searching of semantics in text documents is proposed. In a basis of performance and the recognition of NIL semantics the following fundamental principles are proposed: 1. Orien...
详细信息
ISBN:
(纸本)8978686184
The architecture of learned software for searching of semantics in text documents is proposed. In a basis of performance and the recognition of NIL semantics the following fundamental principles are proposed: 1. Orientation to a recognition of semantics with minimum usage of knowledge about syntax of the language, 2. Creation of hierarhies from concepts with horizontal (associative) links between nodes of these hierarhies as result of processing of text documents, 3. Recognition of words and collocations on maximum similar with usage of neural algorithms. The main algorithms of learning of software and searching of documents are considered. Also the features of learning (creation of knowledge base) of proposed software are analyzed. Now research prototype of software with this architecture is implemented.
A lexical knowledge base is an important component of any intelligent information processing system. The WordNet developed at the Cognitive Systems Laboratories at Princeton has served as a lexical reference system fo...
详细信息
naturallanguageprocessing is a technique that includes both naturallanguage understanding and naturallanguage generation. Translating one naturallanguage into another becomes complex due to structural difference,...
详细信息
naturallanguageprocessing (NLP) techniques have been explored to enhance the performance of Information Retrieval (IR) methods with varied results. Most efforts in using NLP techniques have been to identify better i...
详细信息
naturallanguageprocessing (NLP) techniques have been explored to enhance the performance of Information Retrieval (IR) methods with varied results. Most efforts in using NLP techniques have been to identify better index terms for representing documents. This use in the indexing phase of IR has implicit effect on retrieval performance. However, the explicit use of NLP techniques during the retrieval or information seeking phase has been restricted to interactive or dialogue systems. Recent advances in IR are based on using Statistical language Models (SLM) to represent documents and ranking them based on their model generating a given user query. This paper presents a novel method for using NLP techniques on user queries, specifically, a syntactic parse of a query, in the statistical language modeling approach to IR. In the proposed method, named Concept language Models, a query is viewed as a sequence of concepts and a concept as a sequence terms. The paper presents different approximations to estimate the concept and term probabilities and compute the query likelihood estimate for documents. Some empirical results on TREC test collections comparing Concept language Models with smoothed N-gram language models are presented. Copyright 2003 ACM.
A modern system extracting the significant information (objects with attributes and links, groups of objects composing the events) from free text in naturallanguage is considered. This information is represented in t...
详细信息
ISBN:
(纸本)1932415114
A modern system extracting the significant information (objects with attributes and links, groups of objects composing the events) from free text in naturallanguage is considered. This information is represented in the knowledge base (KB) in the form of semantic networks and is processed at the level of networks. System uses KB for analytical processing of texts and fuzzy search. For discovering in texts the significant and analytical information the system uses special semantic filters. Methods of discovery and of analytical processing are considered. The system has been applied for the logical-analytical tasks of accident reports processing. The system can be tuned to another application by changing a linguistic knowledge to indicate the significant objects, links and contexts. The system was tuned to texts in Russian about commercial banks to extract significant information about them and to determine the bank range. Another application is connected with DB. The system can read free texts and fill the empty fields of DB.
The knowledge management is becoming more and more important in organizations, either over the intranet or Internet. In this paper we present an ontology-based web knowledge management (KM) framework based on web onto...
详细信息
We describe a system for extracting mentions of terms such as company and product names, in a large and noisy corpus of documents, such as the World Wide Web. Since naturallanguage terms are highly ambiguous, a signi...
详细信息
We describe a system for extracting mentions of terms such as company and product names, in a large and noisy corpus of documents, such as the World Wide Web. Since naturallanguage terms are highly ambiguous, a significant challenge in this task is disambiguating which occurrences of each term are truly related to the right meaning, and which are not. We describe our approach for disambiguation, and show that it achieves very high accuracy with only limited training. This serves as a necessary first step for applications that strive to do analytics on term mentions. Copyright 2003 ACM.
This paper presents an iterative algorithm for logic form identification in English texts. The advantage of the iterative approach is simplicity of the derivation engine while providing a performance of more than 90% ...
详细信息
ISBN:
(纸本)0889863962
This paper presents an iterative algorithm for logic form identification in English texts. The advantage of the iterative approach is simplicity of the derivation engine while providing a performance of more than 90% accuracy for noun and adverb dictionary definitions.
Large amounts of technical documentation axe available in machine readable form, however there is a lack of effective ways to access them. In this paper we propose an approach based on linguistic techniques, geared to...
详细信息
ISBN:
(纸本)3540408037
Large amounts of technical documentation axe available in machine readable form, however there is a lack of effective ways to access them. In this paper we propose an approach based on linguistic techniques, geared towards the creation of a domain-specific knowledge Base, starting from the available technical documentation. We then discuss an effective way to access the information encoded in the knowledge Base. Given a user question phrased in naturallanguage the system is capable of retrieving the encoded semantic information that most closely matches the user input, and present it by highlighting the textual elements that were used to deduct it.
暂无评论