It is insufficient to search temporal text by only focusing on either time attribute or keywords today as we pay close attention to the evolution of event with time. Both temporal and textual constraints need to be co...
详细信息
ISBN:
(纸本)9781450325981
It is insufficient to search temporal text by only focusing on either time attribute or keywords today as we pay close attention to the evolution of event with time. Both temporal and textual constraints need to be considered in one single query, called Top-k Interval Keyword Query (TIKQ). In this paper, we presents a cloud-based system named INK that supports efficient execution of TIKQs with appropriate effectiveness on Hadoop and HBase. In INK, an Adaptive Index Selector (AIS) is devised to choose the better execution plan for various TIKQs adaptively based on the proposed cost model, and leverage two novel hybrid index modules (TriI and IS-Tree) to combine keyword and interval filtration seamlessly.
Finding credible pages is a challenging problem on the Web. Our key observation in this paper is that credible pages usually link to credible content-related pages, which is different from a normal page usually links ...
详细信息
Schema summarization on large-scale databases is a challenge. In a typical large database schema, a great proportion of the tables are closely connected through a few high degree tables. It is thus dificult to separat...
详细信息
knowledge management (KM) technologies and applications are fundamental to the success of KM programs. This paper aims to find out linkages between newly-developed KM theories and various technologies and their releva...
详细信息
data-centric storage is an effective and important technique in the wireless sensor networks. It stores the sensing data according to their values by mapping them to some point in the network in order to avoid routing...
详细信息
ISBN:
(纸本)9781595939111
data-centric storage is an effective and important technique in the wireless sensor networks. It stores the sensing data according to their values by mapping them to some point in the network in order to avoid routing all the values outside the network and flooding the queries. However, in most data-centric storage schemes, there is a "hotspot" problem due to the skewness of data and randomness of the mapping functions. Large number of sensor readings (events) may be routed to the same point by the predefined hashed function. In this paper, we propose a new Dynamic BAlanced data-centric Storage (DBAS) scheme, a cooperative strategy between the base station and the in-network processing in wireless sensor network. Our scheme, which utilizes the rich resources in the base station and is aware of the data distributions of the network, dynamically adjusts the mappings from readings to the storage points to balance the storage and workload in the network, as well as to reduce the cost of storing these readings. Moreover, it takes advantage of perimeter routing algorithm of the GPSR routing protocol to store multiple copies of readings to improve the robustness of the network with little overhead. Simulation results show that DBAS is more balanced and energy efficient than the traditional data-centric storage mechanism in wireless sensor network.
knowledge base question generation (KBQG) aims to generate natural language questions from a set of triplet facts extracted from KB. Existing methods have significantly boosted the performance of KBQG via pre-trained ...
详细信息
Journal discriminative capacity refers to the degree of difference between the journals in research subjects, and is of great significance for detecting the level of journal differentiation. Current research on journa...
详细信息
Big graph data is different from traditional data and they usually contain complex relationships and multiple attributes. With the help of graph pattern matching, a pattern graph can be designed, satisfying special pe...
详细信息
In the era of big data, data redundancy has become an obstacle to deep reading. The objective of linked data as a new data organization model is to transform data into structured data following unified standards. The ...
详细信息
Purpose:To reveal the research hotpots and relationship among three research hot topics in b iomedicine,namely CRISPR,iPS(induced Pluripotent Stem)cell and Synthetic ***/methodology/approach:We set up their keyword co...
详细信息
Purpose:To reveal the research hotpots and relationship among three research hot topics in b iomedicine,namely CRISPR,iPS(induced Pluripotent Stem)cell and Synthetic ***/methodology/approach:We set up their keyword co-occurrence networks with using three indicators and information visualization for metric ***:The results reveal the main research hotspots in the three topics are different,but the overlapping keywords in the three topics indicate that they are mutually integrated and interacted each *** limitations:All analyses use keywords,without any other *** implications:We try to find the information distribution and structure of these three hot topics for revealing their research status and interactions,and for promoting biomedical ***/value:We chose the core keywords in three research hot topics in biomedicine by using h-index.
暂无评论