Translation techniques are often employed by cross-lingual ontology mapping (CLOM) approaches to turn a cross-lingual mapping problem into a monolingual mapping problem which can then be solved by state of the art mon...
详细信息
We present a language independent approach for conflation that does not depend on predefined rules or prior knowledge of the target language. The proposed unsupervised method is based on an enhancement of the pure n-g...
详细信息
Integration of disparate information resources has long been a significant research topic. Semantic approaches can help by allowing expression of concepts divorced from syntax and allowing rich, structured meta-data t...
详细信息
The majority of studies in Personalized Information Retrieval (PIR) literature have focused on monolingual IR, and only relatively little work has been done concerning multilingual IR. In this paper we propose a novel...
详细信息
The explosive growth of the Internet has seen it exceed over two billion users in 2010. However an analysis of the demography of this user base indicates an ever growing diversity. Currently only 38.8% of internet use...
详细信息
ISBN:
(纸本)9781450308977
The explosive growth of the Internet has seen it exceed over two billion users in 2010. However an analysis of the demography of this user base indicates an ever growing diversity. Currently only 38.8% of internet users originate from the countries such as Europe, America and Australia whereas 61.2% internet users come from the Africa, Asia and Middle East1. Moreover, these figures are changing even farther in favour of Africa, Asia and Middle East countries since their current internet penetration levels are relatively low e.g. the penetration of the internet in China/Asia is only at 21%, and Africa is only 10%. It is clear that the diversity of the user base of the web is growing rapidly. Moreover research is showing that each individual uses the WWW in different ways that suit their own personal needs, preferences. However, it is also clear that these differences extends far beyond just the appropriateness of content selection, and encompasses many dimensions e.g. tasks & activities, cultural preferences, language and social interaction etc. From a language diversity perspective, this growing diversity of internet users is increasingly apparent with English only accounting for 27% of all languages on the Internet in 2010. Other evidence of user diversity is demonstrated in social networking sites such as Facebook where in 2007 it supported 50M users in only one language (English) whilst by 2010 it had grown to 600M users and supported 77 different languages. By 2010 55% (approximately 13.75 Billion) tweets on Twitter were non-English. The expansion of the internet is not just in user number but has also resulted in vast quantities and great diversity of WWW accessible content where user generated content has for some time exceeded traditional web hosted content. In 2011, mobile access to the Internet and WWW has exceeded that accessed from desktop computers. Increasingly digital content on the internet is reaching users, not just through traditional web queries b
Community mining is a prominent approach for identifying (user) communities in social and ubiquitous contexts. While there are a variety of methods for community mining and detection, the effective evaluation and vali...
详细信息
With the increased popularity of Web 2.0 services in the last years data privacy has become a major concern for users. The more personal data users reveal, the more difficult it becomes to control its disclosure in th...
详细信息
ISBN:
(纸本)9781450307321
With the increased popularity of Web 2.0 services in the last years data privacy has become a major concern for users. The more personal data users reveal, the more difficult it becomes to control its disclosure in the web. However, for Web 2.0 service providers, the data provided by users is a valuable source for offering effective, personalised data mining services. One major application is the detection of spam in social bookmarking systems: in order to prevent a decrease of content quality, providers need to distinguish spammers and exclude them from the system. They thereby experience a conflict of interests: on the one hand, they need to identify spammers based on the information they collect about users, on the other hand, they need to respect privacy concerns and process as few personal data as possible. It would therefore be of tremendous help for system developers and users to know which personal data are needed for spam detection and which can be ignored. In this paper we address these questions by presenting a data privacy aware feature engineering approach. It consists of the design of features for spam classification which are evaluated according to both, performance and privacy conditions. Experiments using data from the social bookmarking system BibSonomy show that both conditions must not exclude each other.
The heterogeneity of learner models in structure, syntax and semantics makes sharing them a significant challenge for existing educational web systems. Creating mappings between the different types of learner models i...
详细信息
ISBN:
(纸本)9789898425515
The heterogeneity of learner models in structure, syntax and semantics makes sharing them a significant challenge for existing educational web systems. Creating mappings between the different types of learner models is one technique that is used when attempting to overcome these issues. This paper presents an overview of research currently being conducted in the area of learner model exchange and defines a categorization, derived from existing educational web systems, of the different mapping types that are required for learner model mapping. Following this, a framework is presented that supports the creation and validation of these different mapping types and the exchange of learner information between multiple heterogeneous educational web systems.
We propose a radial user interface which supports phrasing and interactive visual refinement of vague queries in order to search and explore large document sets. The core idea is to provide an integrated view of queri...
详细信息
We propose a radial user interface which supports phrasing and interactive visual refinement of vague queries in order to search and explore large document sets. The core idea is to provide an integrated view of queries and related results, where both queries and results can be interactively manipulated and changes are immediately visualized. Furthermore, the relevance of queries and results can be gradually changed and thus it is possible for a user to explore effects even of slight query changes. Besides the interface itself, we present results of a first user study. The proposed interface can be applied in many interactive text retrieval scenarios. However, it can also be used to support decision making processes where an exploration and interpretation of complex data sets is required.
Federated policy systems are required to support the emergent complexity and organizational heterogeneity of modern Internet service delivery. This paper presents a distributed policy management approach which utilize...
详细信息
暂无评论