sequential pattern mining is aimed at extracting correlations among temporal data. Many different methods were proposed to either enumerate sequences of set valued data (i.e., itemsets) or sequences containing dimensi...
详细信息
sequential pattern mining is aimed at extracting correlations among temporal data. Many different methods were proposed to either enumerate sequences of set valued data (i.e., itemsets) or sequences containing dimensional items. However, in real-world scenarios, data sequences are described as combination of both multidimensional items and itemsets. These heterogeneous descriptions cannot be handled by traditional approaches. In this paper we propose a new approach called MMISP (Mining multidimensional Itemset sequentialpatterns) to extract patterns from complex sequential database including both multidimensional items and itemsets. The novelties of the proposal lies in: (i) the way in which the data are efficiently compressed;(ii) the ability to reuse and adopt sequential pattern mining algorithms and (iii) the extraction of new kind of patterns. We introduce a case-study on real-world data from a regional healthcare system and we point out the usefulness of the extracted patterns. Additional experiments on synthetic data highlights the efficiency and scalability of the approach MMISP.
A scalable and effective algorithm called AMGMSP (Approximate Mining of Global multidimensional sequential patterns) is proposed to solve the problem of mining the multidimensional sequential patterns for large databa...
详细信息
ISBN:
(纸本)9781424409723
A scalable and effective algorithm called AMGMSP (Approximate Mining of Global multidimensional sequential patterns) is proposed to solve the problem of mining the multidimensional sequential patterns for large databases in the distributed environment. First, the multidimensional information is embedded into the corresponding sequences in order to convert the mining on the multidimensional sequential patterns to sequentialpatterns. Then the sequences are clustered, summarized, and analyzed on the distributed sites, and the local patterns could be obtained by the effective approximate sequential pattern mining method. Finally, the global multidimensional sequential patterns could be mined by high vote sequentialpatterns after collecting all the local patterns on one site. Both the theories and the experiments indicate that this method could simplify the problem of mining the multidimensional sequential patterns and avoid mining the redundant information. The global sequentialpatterns could be obtained effectively by the scalable method after reducing the cost of communication.
A scalable and effective algorithm called AMGMSP (Approximate Mining of Global multidimensional sequential patterns) is proposed to solve the problem of mining the multidimensional sequential patterns for large databa...
详细信息
A scalable and effective algorithm called AMGMSP (Approximate Mining of Global multidimensional sequential patterns) is proposed to solve the problem of mining the multidimensional sequential patterns for large databases in the distributed environment First, the multidimensional information is embedded into the corresponding sequences in order to convert the mining on the multidimensional sequential patterns to sequential *** the sequences are clustered, summarized, and analyzed on the distributed sites, and the local patterns could be obtained by the effective approximate sequential pattern mining ***, the global multidimensional sequential patterns could be mined by high vote sequentialpatterns after collecting all the local patterns on one *** the theories and the experiments indicate that this method could simplify the problem of mining the multidimensional sequential patterns and avoid mining the redundant *** global sequentialpatterns could be obtained effectively by the scalable method after reducing the cost of communication.
暂无评论