An important and widespread topic in cloud computing is text *** often use topic model which is a popular and effective technology to deal with related *** all the topic models,sLDA is acknowledged as a popular superv...
详细信息
ISBN:
(纸本)9781509012473
An important and widespread topic in cloud computing is text *** often use topic model which is a popular and effective technology to deal with related *** all the topic models,sLDA is acknowledged as a popular supervised topic model,which adds a response variable or category label with each document,so that the model can uncover the latent structure of a text dataset as well as retains the predictive power for supervised ***,sLDA needs to process all the documents at each iteration in the training *** the size of dataset increases to the volume that one node cannot deal with,sLDA will no longer be *** this paper we propose a novel model named *** which extends sLDA with stochastic variational inference(SVI) and *** can reduce the computational burden of sLDA and MapReduce extends the algorithm with *** makes the training become more efficient and the training method can be easily implemented in a large computer cluster or cloud *** results show that our approach has an efficient training process,and similar accuracy with sLDA.
The problem of community detection in networks has received wide attention and proves to be computationally challenging. In recent years, with the surge of signed networks with positive links and negative links, to fi...
详细信息
The problem of community detection in networks has received wide attention and proves to be computationally challenging. In recent years, with the surge of signed networks with positive links and negative links, to find community structure in such signed networks has become a research focus in the area of network science. Although many methods have been proposed to address the problem, their performance seriously depends on the predefined optimization objectives or heuristics which are usually difficult to accurately describe the intrinsic structure of community. In this study, we present a statistical inference method for community detection in signed networks, in which a probabilistic model is proposed to model signed networks and the expectation-maximization–based parameter estimation method is deduced to find communities in signed networks. In addition, to efficiently analyze signed networks without any a priori information, a model selection criterion is also proposed to automatically determine the number of communities. In our experiments, the proposed method is tested in the synthetic and real-word signed networks and compared with current methods. The experimental results show the proposed method can more efficiently and accurately find the communities in signed networks than current methods. Notably, the proposed method is a mathematically principled method.
Partially observable Markov decision processes (POMDPs) provide a rich mathematical framework for planning tasks in partially observable stochastic environments. The notion of the covering number, a metric of captur...
详细信息
Partially observable Markov decision processes (POMDPs) provide a rich mathematical framework for planning tasks in partially observable stochastic environments. The notion of the covering number, a metric of capturing the search space size of a POMDP planning problem, has been proposed as a complexity measure of approximate POMDP planning. Existing theoretical results are based on POMDPs with finite and discrete state spaces and measured in the l1- metric space. When considering heuristics, they are assumed to be always admissible. This paper extends the theoretical results on the covering numbers of different search spaces, including the newly defined space reachable under inadmissible heuristics, to the ln-metric spaces. We provide a simple but scalable algorithm for estimating covering numbers. Experimentally, we provide estimated covering numbers of the search spaces reachable by following different policies on several benchmark problems, and analyze their abilities to predict the runtime of POMDP planning algorithms.
To overcome the drawbacks of traditional convex evidence, in this paper we proposed a modified convex evidence theory model, we presented the modified combination function and use it to combine mass function of ordere...
详细信息
ISBN:
(纸本)9781509045006
To overcome the drawbacks of traditional convex evidence, in this paper we proposed a modified convex evidence theory model, we presented the modified combination function and use it to combine mass function of ordered propositions, we present the calculation of the parameters of the proposed combination function, and proposed a more accurate method to find the proposition which is most likely true. The theoretical analysis and experimental results demonstrate that the proposed method has higher accuracy than traditional convex evidence.
With the rapid development of information technology, semantic web data present features of massiveness and complexity. As the data-centric science, social computing have great influence in collecting and analyzing se...
详细信息
Gaussian LDA integrates topic modeling with word embeddings by replacing discrete topic distribution over word types with multivariate Gaussian distribution on the embedding space. This can take semantic information o...
详细信息
Predicting epidemic dynamics is of great value in understanding and controlling diffusion processes, such as infectious disease spread and information propagation. This task is intractable, especially when surveillanc...
详细信息
Peak search for three-dimensional rotation electron diffraction image is almost the most important step in crystal structure determination. The difference of Gaussian method is the traditional approach for this task w...
详细信息
ISBN:
(纸本)9781509055227
Peak search for three-dimensional rotation electron diffraction image is almost the most important step in crystal structure determination. The difference of Gaussian method is the traditional approach for this task with the disadvantage that values of three tunable parameters need to be determined by users. To address this drawback, this paper presents a local gradient based peak search algorithm that needs only one tunable parameter. Experiments show that our proposed method is as effective as the difference of Gaussian method in peak detection task, but has obvious advantages in speed and convenience.
暂无评论