Link Detection (abbr. LDT) is to determine whether two stories discuss the same topic in Topic Detection and Tracking (abbr. TDT) track. The key issue is to correctly measure the relevance between two stories. Most re...
详细信息
Link Detection (abbr. LDT) is to determine whether two stories discuss the same topic in Topic Detection and Tracking (abbr. TDT) track. The key issue is to correctly measure the relevance between two stories. Most researches on LDT use a series of independent words to describe stories (each story is a text specially discussing news), and the relevance between two stories is determined based on the percentage and weight of overlapping words between them. Although substantial improvement has been achieved, inadequate descriptions of word sense and semantics still have negative influences on the accuracy of LDT. In this paper we propose an online semantic tree, which is hierarchically constructed by the most relevant words extracted from previous story streams. In online semantic tree, word sense is described by a series of words in a sense closed-loop, and semantic relation among words is measured by depth and width of level that words locate in. In LDT, online semantic tree is built for each story, and the relevance between two stories is determined by measuring the KL divergence between their online semantic trees. The method performs quite well on TDT4 corpus. The Min Norm CDet of the method in testing is 0.2274 lower than that of the baseline.
PLSA(Probabilistic Latent Semantic Analysis) is a popular topic modeling technique for exploring document collections. Due to the increasing prevalence of large datasets, there is a need to improve the scalab.lity of ...
详细信息
We propose a cascaded linear model for joint Chinese word segmentation and partof- speech tagging. With a character-based perceptron as the core, combined with realvalued features such as language models, the cascaded...
详细信息
Among syntax-based translation models, the tree-based approach, which takes as input a parse tree of the source sentence, is a promising direction being faster and simpler than its string-based counterpart. However, c...
详细信息
Utility services provided by cloud computing rely on virtual customer communities forming spontaneously and evolving continuously. Clarifying the explicit boundaries of these communities is thus essential to the quali...
详细信息
The precise prediction of bus routes or the arrival time of buses for a traveler can enhance the quality of bus service. However, many social factors influence people's preferences for taking buses. These social f...
详细信息
Due to the limitation of hardware resources, the traditional people flow monitoring system based on computer vision in public places can't meet different crowd-scale scenarios. Therefore, a people flow monitoring ...
详细信息
The mobile robot adapts to the more complicated indoor and outdoor environments, and can expand its scope of application. In order to reduce the influence of the cumulative error caused by navigation in complex enviro...
详细信息
作者:
Tong, ZhanWu, ZhanYang, YangMao, WeilongWang, ShijieLi, YinshengChen, YangSoutheast University
Laboratory of Image Science and Technology Nanjing210096 China Southeast University
Ministry of Education Key Laboratory of Computer Network and Information Integration Nanjing210096 China Chinese Academy of Sciences
Research Center for Medical Artificial Intelligence Shenzhen Institutes of Advanced Technology Shenzhen518055 China School of Computer Science and Engineering
Key Lab. of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications Jiangsu Provincial Joint International Research Laboratory of Medical Information Processing The Laboratory of Image Science and Technology Nanjing210096 China
Computed Tomography (CT) is an imaging technique widely used in clinical diagnosis. However, high-attenuation metallic implants result in the obstruction of low-energy Xrays and further lead to metal artifacts in the ...
详细信息
Recent research has shown that keyword search is a friendly and potentially effective way to retrieve information of interest over relational databases. Existing work has generally focused on implementing keyword sear...
详细信息
暂无评论