检索结果-内蒙古大学图书馆

Parallel Implementation of apriori algorithm Based on MapReduce

INTERNATIONAL JOURNAL OF NETWORKED AND DISTRIBUTED COMPUTING 2013年第2期1卷 89-96页

作者： Li, Ning Zeng, Li He, Qing Shi, Zhongzhi Chinese Acad Sci Inst Comp Technol Key Lab Intelligent Informat Proc Beijing 100190 Peoples R China Grad Univ Chinese Acad Sci Beijing 100139 Peoples R China Hebei Univ Coll Math & Comp Sci Key Lab Machine Learning & Computat Intelligence Baoding 071002 Hebei Peoples R China

Searching frequent patterns in transactional databases is considered as one of the most important data mining problems and apriori is one of the typical algorithms for this task. Developing fast and efficient algorithms that can handle large volumes of data becomes a challenging task due to the large databases. In this paper, we implement a parallel apriori algorithm based on MapReduce, which is a framework for processing huge datasets on certain kinds of distributable problems using a large number of computers (nodes). The experimental results demonstrate that the proposed algorithm can scale well and efficiently process large datasets on commodity hardware.

关键词： apriori algorithm Frequent itemsets MapReduce Parallel implementation Large database

来源：评论

学校读者我要写书评

暂无评论

Packet Signature Mining for Application Identification Using an Improved apriori algorithm 3

Packet Signature Mining for Application Identification Using...

引用

3rd IEEE International Conference on Progress in Informatcs and Computing (IEEE PIC)

作者： Tao, Linhui Liu, Guangjie Liu, Weiwei Dai, Yuewei Nanjing Univ Sci & Technol Sch Automat Nanjing Jiangsu Peoples R China Jiangsu Univ Sci & Technol Sch Elect & Informat Zhenjiang Peoples R China

ISBN: (纸本)9781467390880

Extracting packet signatures automatically and accurately are the foundation of traffic identification for most network monitoring and forensics application. The apriori algorithm is a common and useful method to fulfill the task. For huge amount Internet traffic, the traditional apriori algorithm, produce huge candidate itemsets and will occupy large I/O costs in scanning database. An improvement method is proposed in this paper. Based on the pruning to the candidate and the public signature database, it dynamically reduced the number of the scanning itemsets to make the scanning efficient. The experiment proved that the proposed algorithm can also effectively improve the mining rate.

关键词： apriori algorithm deep packet inspecting signatures of protocols introduction traffic identification

来源：评论

学校读者我要写书评

暂无评论

Mining Medical Data to Identify Frequent Diseases using apriori algorithm

Mining Medical Data to Identify Frequent Diseases using Apri...

引用

International Conference on Pattern Recognition, Informatics and Mobile Engineering (PRIME)

作者： Ilayaraja, M. Meyyappan, T. Alagappa Univ Dept Comp Sci & Engn Karaikkudi Tamil Nadu India

ISBN: (纸本)9781467358453

The data mining is a process of analyzing a huge data from different perspectives and summarizing it into useful information. The information can be converted into knowledge about historical patterns and future trends. Data mining plays a significant role in the field of information technology. Health care industry today generates large amounts of complex data about patients, hospitals resources, diseases, diagnosis methods, electronic patients records, etc,. The data mining techniques are very useful to make medicinal decisions in curing diseases. The healthcare industry collects huge amount of healthcare data which, unfortunately, are not "mined" to discover hidden information for effective decision making. The discovered knowledge can be used by the healthcare administrators to improve the quality of service. In this paper, authors developed a method to identify frequency of diseases in particular geographical area at given time period with the aid of association rule based apriori data mining technique.

关键词： Frequent Diseases Data Mining Medical Data Association Rule apriori algorithm

来源：评论

学校读者我要写书评

暂无评论

Research on parallelization of apriori algorithm in association rule mining 10

Research on parallelization of Apriori algorithm in associat...

引用

10th Annual International Conference of Information and Communication Technology (ICICT)

作者： Wang, Huan-Bin Gao, Yang-Jun Airforce Engn Univ Coll Equipment Management & Unmanned Aerial Vehic Xian 710051 Shanxi Peoples R China

Aiming at the performance bottleneck of traditional apriori algorithm when the data set is slightly large, this paper adopts the idea of parallelization and improves the apriori algorithm based on MapReduce model. Firstly, the local frequent itemsets on each sub node in the cluster are calculated, then all the local frequent itemsets are merged into the global candidate itemsets, and finally, the frequent itemsets that meet the conditions are filtered according to the minimum support threshold. The advantage of the improved algorithm is that it only needs to scan the transaction database twice and calculate the frequent item set in parallel, which improves the efficiency of the algorithm. (C) 2021 The Authors. Published by Elsevier B.V.

关键词： Association rules apriori algorithm MapReduce Parallelization

来源：评论

学校读者我要写书评

暂无评论

Reliable frequent itemsets mining with actor-based apriori algorithm

Reliable frequent itemsets mining with actor-based Apriori a...

引用

44th WILGA Symposium on Photonics Applications and Web Engineering

作者： Puscian, Marek Warsaw Univ Technol Inst Comp Sci Nowowiejska 15-19 PL-00665 Warsaw Poland

ISBN: (纸本)9781510630666

This paper presents an actor-based apriori algorithm enhanced with fault tolerance mechanism. All phases of the algorithm including candidate generation and support counting operations are performed by asynchronous actors. When an error occurs during the execution of the algorithm, calculations are interrupted locally for specific actors. The actor state is restored from the snapshot and the operations that caused the failure are either repeated or skipped. Other actors progress with their current tasks. The algorithm can be executed in parallel and distributed environments. Proposed enhancements have been successfully implemented using JAVA and Akka library. This paper discusses the results of the performance of actor-based apriori algorithm against different datasets. The presented approach has been illustrated with many experiments and measurements performed using multiprocessor and multithreaded computer.

关键词： apriori algorithm frequent items mining fault tolerance algorithm parallelization

来源：评论

学校读者我要写书评

暂无评论

AN IMPLEMENTATION OF IMPROVED apriori algorithm

AN IMPLEMENTATION OF IMPROVED APRIORI ALGORITHM

引用

International Conference on Machine Learning and Cybernetics

作者： Yang, Gang Zhao, Hong Wang, Lei Liu, Ying Hebei Univ Coll Math & Comp Sci Baoding 071002 Peoples R China Univ Agr Coll Informat Sci & Technol Baoding 071002 Peoples R China Hebei Software Inst Coll Network Engn Baoding 071002 Peoples R China

ISBN: (纸本)9781424447053

Data mining is the analysis of (often large) observational data sets to find unsuspected relationships and to summarize the data in novel ways that are both understandable and useful to the data owner. Association rules are highly popular data mining method. Association rules show attributes value conditions that occur frequently together in a given dataset And apriori is an efficient association rule mining algorithm.

关键词： Data mining Association rules apriori algorithm

来源：评论

学校读者我要写书评

暂无评论

The Optimization of apriori algorithm Based on Directed Network

The Optimization of Apriori Algorithm Based on Directed Netw...

引用

3rd International Symposium on Intelligent Information Technology Application

作者： Wang, Yan-hua Feng, Xia Civil Aviat Univ China Sch Comp Sci & Technol Tianjin Peoples R China

ISBN: (纸本)9780769538594

Association rule mining is an important topic in data mining field. On the basis of the association rule mining and apriori algorithm, this paper proposes an improved algorithm based on the directed network It reduces consumption and improve the efficiency of algorithms by reduce scanning datasets and improving the efficiency of the pruning step. Finally, this paper gives an experiment to analyze and compare the difference between the two algorithms and the result shows that the improved algorithm promotes the efficiency of computing.

关键词： association rule apriori algorithm Directed Network Distance Matrix Path

来源：评论

学校读者我要写书评

暂无评论

The Java Implementation of apriori algorithm Based on Agile Design Principles

The Java Implementation of Apriori algorithm Based on Agile ...

引用

3rd IEEE International Conference on Computer Science and Information Technology (ICCSIT)

作者： Li, Yong Xinjiang Normal Univ Coll Comp Sci & Technol Urumqi Peoples R China

ISBN: (纸本)9781424455379

Association rules model is widely used in data mining and the apriori is the most famous association rule mining algorithm. Based on the classic apriori association rules algorithm, this paper gives the UML class design diagram based on the agile design principles and selects the popular OOP language Java to achieve. In practical applications, it can be used in a variety of data application.

关键词： apriori algorithm UML Java Implementation Agile Design Principles

来源：评论

学校读者我要写书评

暂无评论

Transformer Defect Correlation Analysis Based on apriori algorithm 5

Transformer Defect Correlation Analysis Based on Apriori Alg...

引用

IEEE International Conference on High Voltage Engineering and Application (ICHVE)

作者： Chen, Yufeng Du, Xiuming Zhou, Liwei Shandong Elect Power Co Shandong Elect Power Res Inst Jinan Shandong Peoples R China Chongqing Univ Sch Elect Engn State Key Lab Power Transmiss Equipment & Syst Se Chongqing Peoples R China

ISBN: (纸本)9781509004966

The association rule from data mining technology was applied into transformer defect analysis so that the frequent pattern, the dependency and the causality between classification and decision attributes could be found based on data of defects. As a result, correlation properties among grid fault elements were seized macroscopically. In this paper which focused on the frequent item mining algorithm research for transformer defect correlation analysis, the definitions related to the association rule were introduced. Specific to weaknesses of traditional apriori algorithm, an efficient analogous frequent item set mining algorithm was presented. With regard to the instance, association rule analysis was carried out for data of transformer defect in Shandong. Relevant results indicated that diverse attribute items were undoubtedly associated with each other to different degrees;in addition, the correlation obtained was adopted to perform operational maintenance for auxiliary equipment and parts, etc. that are vulnerable to defects.

关键词： transformer defect correlation analysis data mining apriori algorithm

来源：评论

学校读者我要写书评

暂无评论

Associating IDS Alerts by an Improved apriori algorithm

Associating IDS Alerts by an Improved Apriori Algorithm

引用

3rd International Symposium on Intelligent Information Technology and Security Informatics

作者： Wang Taihua Guo Fan Jiangxi Normal Univ Sch Comp & Informat Engn Nanchang 330022 Jiangxi Peoples R China

ISBN: (纸本)9780769540207

Among a large number of association rule mining algorithms, apriori algorithm is the most classic one, but the apriori algorithm has three deficiencies, namely: the need for scanning databases many times, generating a large number of Candidate Anthology, as well as frequent itemsets iteratively. The paper presents a method that solves the maximal frequent itemsets through one intersection operation. The degree of support is obtained through the times of intersection without having to scan the transaction database, by numbering some of the properties to reduce memory space and search the candidate set list easily, thereby enhancing the efficiency of the algorithm. Finally, it can generate association rules for Intrusion Detection System. Experimental results show that the optimized algorithm can effectively improve the efficiency of mining association rules.

关键词： data mining association rules apriori algorithm itemsets

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：