This paper proposes a strategy for Chinese multi-document summarization based on clustering and sentence extraction. It adopts the term vector to represent the linguistic unit in Chinese document, which obtains higher...
详细信息
This paper proposes a strategy for Chinese multi-document summarization based on clustering and sentence extraction. It adopts the term vector to represent the linguistic unit in Chinese document, which obtains higher representation quality than traditional word-based vector space model in a certain extent. As for clustering, we propose two heuristics to automatically detect the proper number of clusters: the first one makes full use of the summary length fixed by the user; the second is a stability method, which has been applied to other unsupervised learning problems. We also discuss a global searching method for sentence selection from the clusters. To evaluate our summarization strategy, an extrinsic evaluation method based on classification task is adopted. Experimental results on news document set show that the new strategy can significantly enhance the performance of Chinese multi-document summarization
Recently, many methods based on fuzzy rough sets are proposed to reduce fuzzy attributes. The common characteristic of these methods is that all of them are based on fuzzy equivalence relation. In other words, the und...
详细信息
Recently, many methods based on fuzzy rough sets are proposed to reduce fuzzy attributes. The common characteristic of these methods is that all of them are based on fuzzy equivalence relation. In other words, the underlying concept of rough sets, indispensability relation, is generalized to fuzzy equivalence relation. Here fuzzy equivalence relation is the binary relation, which is reflexive, symmetric and transitive. This paper tries to generalize the fuzzy equivalence relation to fuzzy similarity relation, which is more helpful to keeping the fuzzy information of initial data than fuzzy equivalence relation. Based on the fuzzy similarity relation, fuzzy matrix computation for information system is proposed which can be used to reduce fuzzy attributes. Firstly, fuzzy similarity relation who is isomorphic with the fuzzy similarity matrix is given as fuzzy indispensability relation. Then all the information of initial data, such as the similarity among objects and fuzzy inconsistence degree between two objects, can be represented by fuzzy similarity matrix. Secondly, by considering that the small perturbation of the fuzzy similarity matrix can be ignorable, we propose some basic concepts of knowledge reduction such as fuzzy attributes reduct, core and fuzzy significance of attributes etc in this paper. Thirdly, a heuristic algorithm based on the fuzzy significance of attributes is proposed to find close-to-minimal fuzzy attributes reduct. Finally, experimental comparisons with other methods of attributes reduction are given. The experimental results show that our method is feasible and effective
The multi-document summarizer using genetic algorithm-based sentence extraction (MSBGA) regards summarization process as an optimization problem where the optimal summary is chosen among a set of summaries formed by t...
详细信息
The multi-document summarizer using genetic algorithm-based sentence extraction (MSBGA) regards summarization process as an optimization problem where the optimal summary is chosen among a set of summaries formed by the conjunction of the original articles sentences. To solve the NP hard optimization problem, MSBGA adopts genetic algorithm, which can choose the optimal summary on global aspect. The evaluation function employs four features according to the criteria of a good summary: satisfied length, high coverage, high informativeness and low redundancy. To improve the accuracy of term frequency, MSBGA employs a novel method TFS, which takes word sense into account while calculating term frequency. The experiments on DUC04 data show that our strategy is effective and the ROUGE-1 score is only 0.55% lower than the best participant in DUC04
Wireless grids bring more challenging issues of resource allocation and task scheduling, and mobility and power management are major additional concerns in a wireless grid. Based on a proxy-based architecture, a hiera...
详细信息
Wireless grids bring more challenging issues of resource allocation and task scheduling, and mobility and power management are major additional concerns in a wireless grid. Based on a proxy-based architecture, a hierarchical scheduling model is proposed to efficiently utilize the energy of wireless nodes with respect to quality of service (QoS). The first level scheduler is responsible for mapping tasks among proxy-nodes and other fixed grid nodes; the second will conduct scheduling in each proxy-centric wireless domain. Under the guidance of four principles suitable for the intermittence, the first level performs overall scheduling based on the FIFS algorithm (first input first service). After modeling a consumed power objective function, a revised min-min heuristic algorithm is enforced to the second level in order to efficiently map tasks to wireless devices. In the latter algorithm, mobile node selection is targeted to minimize the energy consumed due to communication and computation, and its last solution is evaluated to guarantee requirement for the task deadline. This research simulation suggests that the power-aware hierarchical scheduling improves energy utilization of the overall system, and also decreases the ratio of failure of scheduled tasks in wireless grids
Dynamic Bayesian Networks (DBNs) are directed graphical models of stochastic processes, How to learn the structure of DBNs from data is a hot problem of research. In this paper the author presents an Immune evolutiona...
详细信息
A good Intrusion Detection System (IDS) should have high precision on detecting attacks and low false alarm rates. machinelearning techniques for IDS usually yield high false alarm rate. In this work, we propose to c...
详细信息
This paper is mainly to study a covering model of granular computing in a set-theoretic setting. The model is based on the assumption that a finite set of universe is granulated through a family of overlapping granule...
详细信息
Intelligibility, a vital concern of a speech transmission channel, is quantified using Speech Transmission Index (STI). The standard STI method relies on noisy test signals and thus hinders in-use measurements. Altern...
详细信息
This paper presents a new large margin learning approach, namely structured large margin machine (SLMM), which incorporates both merits of "structured" learning models and advantages of large margin learning...
详细信息
This paper investigates convergence theorems of (HNNMDs).(The discrete Hopfield network with multiple delays). We have demonstrated how to transform a network with multiple delays to a new network with a single delay....
详细信息
暂无评论