In this paper we review the main intermediate forms proposed in text mining, and we briey study some fuzzy counterparts. The concept of intermediate form applies to any knowledge rep- resentation employed to represent...
详细信息
ISBN:
(纸本)8476538723
In this paper we review the main intermediate forms proposed in text mining, and we briey study some fuzzy counterparts. The concept of intermediate form applies to any knowledge rep- resentation employed to represent in a structured way the semantic content of a text corpus. In- termediate forms play a central role in the text mining process since it is necessary to transform plain text into a form in order to apply mining techniques. Since the semantics of text use to be imprecise, the use of fuzzy intermediate forms seems to be a natural solution in many cases. We discuss about fuzzy intermediate forms and the corresponding fuzzy text mining techniques that may be applicable on them.
This paper analyzes the role that membrane dissolution rules play in order to characterize (in the framework of recognizer P systems with membrane creation) the tractability of decision problems -that is, the efficien...
详细信息
This paper describes a functioning system that associates semantic annotations (in the form of a table of triples of identifiers) with digital video. The first part describes a subsystem (in existence since 2002, desk...
详细信息
This paper describes a functioning system that associates semantic annotations (in the form of a table of triples of identifiers) with digital video. The first part describes a subsystem (in existence since 2002, desktop version 1995) of adding annotations to digital video. The second part builds on that subsystem, creating additional semantic annotations that enable semantic-web retrieval. The key component of the second subsystem is a glossary for the text being indexed, created by the user in interaction with Princeton's WordNet. The resulting glossary is a microformat within an XHTML document, which we validate using our microformat validator. Additional techniques for harvesting metadata from the domain expert are described.
Generic web search is designed to serve all users, independent of the individual needs and without any adaptation to personal requirements. We propose a novel technique1 that performs post-categorization to the result...
详细信息
ISBN:
(纸本)0769522742
Generic web search is designed to serve all users, independent of the individual needs and without any adaptation to personal requirements. We propose a novel technique1 that performs post-categorization to the results of popular search engines at the client's side. A user profile is built based on user's choices from a category hierarchy (explicitly given requirements) and user's search history (implicitly logged choices). Caching is utilized in order to provide improved responses. An experimental prototype has been implemented based on results coming from a popular search engine. The experimental results indicate strongly that the proposed mechanism is both effective and efficient.
In this report, different fiber-chip coupling concepts are presented. Automated coupling process, as well as different modules for laser welding, gluing and mechanical fixation are also discussed. Thermal stress tests...
详细信息
In this report, different fiber-chip coupling concepts are presented. Automated coupling process, as well as different modules for laser welding, gluing and mechanical fixation are also discussed. Thermal stress tests are reported
We present a browser extension to dynamically learn to filter unwanted images (such as advertisements or flashy graphics) based on minimal user feedback. To do so, we apply the weighted majority algorithm using pieces...
详细信息
ISBN:
(纸本)1595930515
We present a browser extension to dynamically learn to filter unwanted images (such as advertisements or flashy graphics) based on minimal user feedback. To do so, we apply the weighted majority algorithm using pieces of the Uniform Resource Locators of such images as predictors. Experimental results tend to confirm that the accuracy of the predictions converges quickly to very high levels.
Due to the convenience of pervasive information environment, many people use various computing devices to perform plenty kinds of tasks. In the field of education, there are various applications to facilitate learner,...
详细信息
ISBN:
(纸本)0769522459
Due to the convenience of pervasive information environment, many people use various computing devices to perform plenty kinds of tasks. In the field of education, there are various applications to facilitate learner, especially for e-learning. However, some computing devices suffer from the limited resources and can not accept rich information content. Therefore, the information content has to be tailored into different kinds of presentation depending on the types of computing devices. Context sensitivity is an application software system's ability to sense and analyze context from various sources. In this paper, we aim to customize static documents using context-sensitive middleware (CSM) to sense the computing device, and then using the agent-based parser to provide suitable content representation dynamically.
Building decentralised Information Retrieval Systems is one of the key challenges for the future Internet. These systems will most probably take the shape of open multiagent systems - societies of Information Agents t...
详细信息
ISBN:
(纸本)0780385667
Building decentralised Information Retrieval Systems is one of the key challenges for the future Internet. These systems will most probably take the shape of open multiagent systems - societies of Information Agents that coordinate in a peer-to-peer fashion. In this paper we first analyse current IR methods so as to identify shortcomings in scalability and personalization of the information services. We then draw upon ideas from peer-to-peer technologies, to determine the requirements, and sketch the structure, of future societies of Information Agents.
We describe a method to automatically discover translation collocations from a bilingual corpus and how these improve a machine translation system. The process of inference of collocations is iterative: An alignment i...
详细信息
ISBN:
(纸本)9781586034528
We describe a method to automatically discover translation collocations from a bilingual corpus and how these improve a machine translation system. The process of inference of collocations is iterative: An alignment is used to derive an initial set of collocations, these are used in turn to improve the alignment and this new alignment is used to generate new collocations. This process is repeated until no more collocations are found. The final alignment and the set of collocations are used to train a translation model. We use a model that is based on finite state transducers and word clusters and has been modified to work with collocations in addition to single words. We present experiments in which we show that automatic collocations improve translation quality without prior linguistic information.
We study the quality of LP-based approximation methods for pure combinatorial problems. We found that the quality of the LPrelaxation is a direct function of the underlying constrainedness of the combinatorial problem...
详细信息
暂无评论