The majority of studies in Personalized Information Retrieval (PIR) literature have focused on monolingual IR, and only relatively little work has been done concerning multilingual IR. In this paper we propose a novel...
详细信息
Nowadays, there are many events reported by News Media everyday, which contains a massive number of news. People are getting more and more interested in understanding how an event evolves after it happens. News relate...
详细信息
ISBN:
(纸本)9781450327657
Nowadays, there are many events reported by News Media everyday, which contains a massive number of news. People are getting more and more interested in understanding how an event evolves after it happens. News related to the same event or similar events usually has more common entities and stronger topic correlations, which is a new perspective to study news event. Due to the complexity of event evolving process, event visualization has been a big challenge for a long time. In this paper, we design a novel four-phase framework NEI(News Event Insight) that focuses on visualizing a news event properly and clearly, namely (1)Entity Topic Modeling. We extract topics and entities through timeline. (2)Temporal Topic Correlation Analysis. Based on the topic modeling result, we design two methods to select hot topics and build links for them. (3)Keyword Extraction. Specially, we combine string frequency with syntax features and use language models to acquire candidate keywords for representing topics. (4)Visualization. Visualization demonstrates the quantifying properties of topics related to a certain event. A case study shows our framework achieves promising results on both single event and similar events. Copyright 2014 ACM.
We consider k mobile agents initially located at distinct nodes of an undirected graph (on n nodes, with edge lengths). The agents have to deliver a single item from a given source node s to a given target node t. The...
详细信息
The explosive growth of the Internet has seen it exceed over two billion users in 2010. However an analysis of the demography of this user base indicates an ever growing diversity. Currently only 38.8% of internet use...
详细信息
ISBN:
(纸本)9781450308977
The explosive growth of the Internet has seen it exceed over two billion users in 2010. However an analysis of the demography of this user base indicates an ever growing diversity. Currently only 38.8% of internet users originate from the countries such as Europe, America and Australia whereas 61.2% internet users come from the Africa, Asia and Middle East1. Moreover, these figures are changing even farther in favour of Africa, Asia and Middle East countries since their current internet penetration levels are relatively low e.g. the penetration of the internet in China/Asia is only at 21%, and Africa is only 10%. It is clear that the diversity of the user base of the web is growing rapidly. Moreover research is showing that each individual uses the WWW in different ways that suit their own personal needs, preferences. However, it is also clear that these differences extends far beyond just the appropriateness of content selection, and encompasses many dimensions e.g. tasks & activities, cultural preferences, language and social interaction etc. From a language diversity perspective, this growing diversity of internet users is increasingly apparent with English only accounting for 27% of all languages on the Internet in 2010. Other evidence of user diversity is demonstrated in social networking sites such as Facebook where in 2007 it supported 50M users in only one language (English) whilst by 2010 it had grown to 600M users and supported 77 different languages. By 2010 55% (approximately 13.75 Billion) tweets on Twitter were non-English. The expansion of the internet is not just in user number but has also resulted in vast quantities and great diversity of WWW accessible content where user generated content has for some time exceeded traditional web hosted content. In 2011, mobile access to the Internet and WWW has exceeded that accessed from desktop computers. Increasingly digital content on the internet is reaching users, not just through traditional web queries b
Despite the significant research over the last ten years, commercial ubiquitous computing environments and pervasive applications remain thin on the ground. This paper looks at the explosion in application creativity ...
详细信息
In our approach, we applied a few modifications to the 50-layered Residual Network. Our preliminary experiments with the Plant-CLEF 2016 dataset showed that the modifications improved classification performance. We ha...
详细信息
In our approach, we applied a few modifications to the 50-layered Residual Network. Our preliminary experiments with the Plant-CLEF 2016 dataset showed that the modifications improved classification performance. We have trained three models based on the modified Residual Network configuration with different combinations of trusted and noisy PlantCLEF 2017 datasets. Using confidence scores extracted from the three models, we have submitted four runs and our methods showed competitive classification performance.
This work addresses the instability in asynchronous data parallel optimization. It does so by introducing a novel distributed optimizer which is able to efficiently optimize a centralized model under communication con...
详细信息
This paper presents an undertaken research work about the development of an Adaptive Tourism Modeling System which attempts to correctly model a tourism web application user profile. This paper will follow the methodo...
详细信息
Multi-Constrained Graph Pattern Matching (MC-GPM) aims to match a pattern graph with multiple attribute constraints on its nodes and edges, and has garnered significant interest in various fields, including social-bas...
Multi-Constrained Graph Pattern Matching (MC-GPM) aims to match a pattern graph with multiple attribute constraints on its nodes and edges, and has garnered significant interest in various fields, including social-based e-commerce and trust-based group discovery. However, the existing MC-GPM methods do not consider situations where the number of each node in the pattern graph needs to be fixed, such as finding experts group with expert quantities and relations specified. In this paper, a Multi-Constrained Strong Simulation with the Fixed Number of Nodes (MCSS-FNN) matching model is proposed, and then a Trust-oriented Optimal Multi-constrained Path (TOMP) matching algorithm is designed for solving it. Additionally, two heuristic optimization strategies are designed, one for combinatorial testing and the other for edge matching, to enhance the efficiency of the TOMP algorithm. Empirical experiments are conducted on four real social network datasets, and the results demonstrate the effectiveness and efficiency of the proposed algorithm and optimization strategies.
Similarity calculation has many applications, such as information retrieval, and collaborative filtering, among many others. It has been shown that link-based similarity measure, such as SimRank, is very effective in ...
详细信息
ISBN:
(纸本)9781424452422
Similarity calculation has many applications, such as information retrieval, and collaborative filtering, among many others. It has been shown that link-based similarity measure, such as SimRank, is very effective in characterizing the object similarities in networks, such as the Web, by exploiting the object-to-object relationship. Unfortunately, it is prohibitively expensive to compute the link-based similarity in a relatively large graph. In this paper, based on the observation that link-based similarity scores of real world graphs follow the power-law distribution, we propose a new approximate algorithm, namely Power-SimRank, with guaranteed error bound to efficiently compute link-based similarity measure. We also prove the convergence of the proposed algorithm. Extensive experiments conducted on real world datasets and synthetic datasets show that the proposed algorithm outperforms SimRank by four-five times in terms of efficiency while the error generated by the approximation is small.
暂无评论