咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >On-the-Fly Output Compression ... 收藏

On-the-Fly Output Compression for Join-Based Graph Mining Algorithms

作     者:Rasel, Mostofa Kamal Lee, Young-Koo 

作者机构:Kyung Hee Univ Dept Comp Sci & Engn Seoul South Korea 

出 版 物:《IEEE ACCESS》 (IEEE Access)

年 卷 期:2018年第6卷

页      面:64008-64022页

核心收录:

基  金:Korea Government (MEST) through the National Research Foundation of Korea [2018R1A2A2A05023669] 

主  题:Graph mining data compression encoding scheme on-the-fly-compression 

摘      要:Many join-based graph mining (JGM) algorithms, such as triangle listing and clique enumeration, typically output data of such a large size that it often dominates the mining cost. Despite the significance of the I/O cost, only a few research focused on reducing the size of the output data. However, those techniques have limitation that they are highly specific to their corresponding graph mining algorithms. In this paper, through careful observations of output patterns, we propose a general compression solution that can be applied to an arbitrary JGM algorithm. The proposed technique first categorizes the nodes of a set of cliques into two lists, which contain overlapping and non-overlapping nodes, respectively. Then, we remove the redundant nodes from multiple sets of cliques by creating a Clique Constructor Tree using the overlapping and non-overlapping nodes. Finally, we compress the clique constructor trees by applying a novel flag-aligned word encoding technique. Our proposed technique performs compression on-the-fly and can easily be adopted by various JGM algorithms. Experiments on real datasets show that our proposed technique, which is adopted in a triangle listing algorithm, reduces the size of the output data and running time by seven times and about four times, respectively. Experimental results also show that a maximal clique listing algorithm can reduce the size of the output data by a factor of three.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分