The Traditional Chinese Medicine's ancient literature recorded the massive medical theories and abundant medical experiences. To better understand and utilize, the knowledge from the literature, the Acupuncture an...
详细信息
Semi-supervised clustering improves learning performance as long as it uses a small number of labeled samples to assist un-tagged samples for *** paper implements and compares unsupervised and semi-supervised clusteri...
详细信息
Semi-supervised clustering improves learning performance as long as it uses a small number of labeled samples to assist un-tagged samples for *** paper implements and compares unsupervised and semi-supervised clustering analysis of BOA-Argo ocean text *** K-Means and Affinity Propagation(AP)are two classical clustering *** Election-AP algorithm is proposed to handle the final cluster number in AP clustering as it has proved to be difficult to control in a suitable ***-supervised samples thermocline data in the BOA-Argo dataset according to the thermocline standard definition,and use this data for semi-supervised cluster *** semi-supervised clustering algorithms were chosen for comparison of learning performance:Constrained-K-Means,Seeded-K-Means,SAP(Semi-supervised Affinity Propagation),LSAP(Loose Seed AP)and CSAP(Compact Seed AP).In order to adapt the single label,this paper improves the above algorithms to SCKM(improved Constrained-K-Means),SSKM(improved Seeded-K-Means),and SSAP(improved Semi-supervised Affinity Propagationg)to perform semi-supervised clustering analysis on the data.A DSAP(Double Seed AP)semi-supervised clustering algorithm based on compact seeds is proposed as the experimental data shows that DSAP has a better clustering *** unsupervised and semi-supervised clustering results are used to analyze the potential patterns of marine data.
Most of the existing methods for community discovery only deal with social network with a fixed structure, so they can not effectively deal with dynamic social network. This paper proposes a Multi-agent system method ...
详细信息
The core idea of clustering algorithm is the division of data into groups of similar objects. Some clustering algorithms are proven good performance on document clustering, such as k-means and UPGMA etc. However, few ...
详细信息
User-specified trust relations are often very sparse and dynamic, making them difficult to accurately predict from online social media. In addition, trust relations are usually unavailable for most social media *** is...
详细信息
User-specified trust relations are often very sparse and dynamic, making them difficult to accurately predict from online social media. In addition, trust relations are usually unavailable for most social media *** issues pose a great challenge for predicting trust relations and further building trust networks. In this study,we investigate whether we can predict trust relations via a sparse learning model, and propose to build a trust network without trust relations using only pervasively available interaction data and homophily effect in an online world. In particular, we analyze the reliability of predicting trust relations by interaction behaviors, and provide a principled way to mathematically incorporate interaction behaviors and homophily effect in a novel framework,b Trust. Results of experiments on real-world datasets from Epinions and Ciao demonstrated the effectiveness of the proposed framework. Further experiments were conducted to understand the importance of interaction behaviors and homophily effect in building trust networks.
The speech interaction in-vehicle was mainly realized by the speech recognition. The human-machine interaction around was usually disturbed by the noise, and the speech received by the receiver was not the original pu...
详细信息
In recent years, neural topic modeling has increasingly raised extensive attention due to its capacity on generating coherent topics and flexible deep neural structures. However, the widely used Dirichlet distribution...
详细信息
The quantitative understanding of human behavior is a central question of modern science. Because of the complexity of human behavior, it is almost impossible to seek regularities in human dynamics. It is assumed that...
详细信息
The quantitative understanding of human behavior is a central question of modern science. Because of the complexity of human behavior, it is almost impossible to seek regularities in human dynamics. It is assumed that human actions are randomly distributed in time in current models for human dynamics. While the characteristics of human behavior combined with the queue model are considered as model for human dynamics based on habit to explain bursts and heavy tails in human dynamics more exactly. Normal distribution is used to simulate intervals of succession of events, and random parameters are set as unexpected events disturbing habit behaviors. Moreover, duration of events are proposed to imitate continual attention to some events in human behaviors.
Structural similarity computation plays a crucial role in many applications such as in searching similar documents, in comparing chemical compounds, in finding genetic similarities, etc. We propose in this paper to us...
详细信息
ISBN:
(纸本)9781424401956
Structural similarity computation plays a crucial role in many applications such as in searching similar documents, in comparing chemical compounds, in finding genetic similarities, etc. We propose in this paper to use structural information content (SIC) for measuring structural information, considering both the nodes and edges of trees. We utilize a binary encoding approach for assigning the weights of different layer nodes and determining if some tree is a subtree of another tree. By defining a fast kernel and recursively computing SICs, we evaluate the structural information similarities of data trees to pattern trees. In the paper, we present the algorithm for calculating SICs with computation complexity of O(n), and use simple examples to instantiate the performance of the proposed method..
Identification of Transcription Factor Binding Sites (TFBS) from the upstream region of genes remains a highly important and unsolved problem particularly in higher eukaryotic genomes. In this paper, we propose a nove...
Identification of Transcription Factor Binding Sites (TFBS) from the upstream region of genes remains a highly important and unsolved problem particularly in higher eukaryotic genomes. In this paper, we propose a novel approach to identify transcription factor binding sites. This approach combines greedy method and genetic algorithm (CGGA) to search conserved segment in the given sequence set. A new greedy method which can efficiently search a local optimal result is proposed. In order to solve the high complexity of this algorithm, we also give an effective improvement for this method. Then, we describe how to combine genetic algorithm with this greedy method to find the more optimal results. Greedy method is combined to the fitness function of the genetic algorithm. We apply this approach on two different TFBS sets and the results show that it can find correct result both effective and efficient, and for CRP binding sites, it get a more accurate result than Gibbs Sampler, AlignACE and MDGA.
暂无评论