检索结果-内蒙古大学图书馆

Performance Evaluation of distributed Association Rule mining Algorithms

Procedia Computer Science 2016年 79卷 127-134页

作者： Vinaya Sawant Ketan Shah Assistant Professor IT Department D. J. Sanghvi College of Engineering Mumbai India Professor IT Department MPSTME Mumbai India

Association Rule mining (ARM) is a popular and well researched method for discovering interesting relations between variables in large databases. It is intended to identify strong rules discovered in databases using different measures of interestingness. Most ARM algorithms focus on a sequential or centralized environment where no external communication is required. distributed ARM algorithms (DARM), aim to generate rules from different data sets spread over various geographical sites; hence, they require external communications throughout the entire process. DARM algorithm efficiency is highly dependent on data distribution. The Classical algorithms used in DARM are Count Distribution Algorithm (CDA), Fast distributed mining (FDM) Algorithm and Optimized distributed Association mining (ODAM) Algorithm. This paper presents the implementation details and experimental results of above mentioned algorithms. The paper also highlights the issues of message exchange size in a distributed environment of current DARM algorithms that can affect the communication costs in a distributed environment.

关键词： Association Rule mining distributed data mining

来源：评论

学校读者我要写书评

暂无评论

Study on distributed data mining Model in Wireless Sensor Networks

Study on Distributed Data Mining Model in Wireless Sensor Ne...

引用

2010 International Conference on Intelligent Computing and Integrated Systems

作者： Hong Yuehua,Xu Shuang,Wu Huajian Yulin Normal University YuLin,P.R.China

Aiming at the severe energy and computing resource constraints of Wireless Sensor Network(WSN),based on rough set theory and ART2 network,a distributed data mining model for WSN is *** model poses a three-layer MLP for data aggregation in the clustered sensor network. And the input layer neuron and the first layer neuron are located in every cluster member,while the second layer neuron and the output layer neuron are located in every cluster head. The features of the training samples were extracted to build up the decision table;the rough set theory was applied to reduce the decision ***,the reduced decision attributes were used to construct ART2 neural network classification data. Constructed data mining algorithm can be integrated in each sensor network *** results prove data dimension is reduced and data redundancy is eliminated after the raw-data is processed by data mining algorithm,and the communication traffic is decreased and the life of WSN is extended.

关键词： wireless sensor network distributed data mining rough set theory ART2 neural network

来源：评论

学校读者我要写书评

暂无评论

Parallelization of data mining Algorithms for Multicore Processors 4

Parallelization of Data Mining Algorithms for Multicore Proc...

引用

4th Mediterranean Conference on Embedded Computing (MECO

作者： Kholod, Ivan Kuprianov, Mikhail Shorov, Andrey St Petersburg Electrotech Univ LETI Fac Comp Sci & Technol St Petersburg Russia

ISBN: (纸本)9781479989997

The article describes a approach of parallel data mining algorithms to be executed on multicore processors of various architecture. The suggested method presents an algorithm as a consequence of pure functions with unified interfaces. For parallel execution additional functions are introduced to share data and models between the parallel threads. Besides such functions allow to obtain various parallel algorithm structures and implement various strategies of execution for different environment conditions. Application of the described method is illustrated through algorithm Naive Bayes.

关键词： data mining parallel data mining data mining algorithms distributed data mining parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Privacy-Preserving Naive Bayes Classification 8th

Privacy-Preserving Naive Bayes Classification

引用

8th International Conference on Knowledge Science, Engineering and Management (KSEM)

作者： Huai, Mengdi Huang, Liusheng Yang, Wei Li, Lu Qi, Mingyu Univ Sci & Technol China Sch Comp Sci & Technol Hefei 230026 Peoples R China Univ Sci & Technol China Suzhou Inst Adv Study Suzhou 215123 Peoples R China

ISBN: (纸本)9783319251592;9783319251585

In this paper, we propose differentially private protocols for Naive Bayes classification over distributed data. Compared with existing works, the privacy and security models in the proposed protocols are stronger: firstly, both the miner and parties can be arbitrarily malicious and can collude with each other to violate the remaining honest parties privacy;secondly, all communication channels between them can be assumed to be insecure. Specifically, we build a guarantee of differential privacy into the cryptographic construction so that the proposed protocols can tolerate collusions and resist eavesdropping attacks which are caused by insecure communication channels. Additionally, the proposed protocols can be implemented at lower computation and communication costs, and some extensions to our protocols (e.g. supporting parties dynamic joins or leaves) are also proposed in this paper. Both theoretical analysis and simulation results show that the proposed privacy-preserving protocols for Naive Bayes have strong security and better classification performance than the standard one.

关键词： distributed data mining Naive Bayes Differential privacy

来源：评论

学校读者我要写书评

暂无评论

A Windowing based GPU optimized strategy for the induction of Decision Trees in JaCa-DDM 18

A Windowing based GPU optimized strategy for the induction o...

引用

18th International Conference of the Catalan-Association-for-Artificial-Intelligence (CCIA)

作者： Limon, Xavier Guerra-Hernandez, Alejandro Cruz-Ramirez, Nicandro Acosta-Mesa, Hector-Gabriel Grimaldo, Francisco Univ Veracruzana Ctr Invest Inteligencia Artificial Sebastian Camacho 5 Xalapa 91000 Veracruz Mexico Univ Valencia Dept Informat Burjassot 46100 Spain

ISBN: (纸本)9781614995784;9781614995777

When inducing Decision Trees, Windowing consists in selecting a random subset of the available training instances (the window) to induce a tree, and then enhance it by adding counter examples, i.e., instances not covered by the tree, to the window for inducing a new tree. The process iterates until all instances are well classified or no accuracy is gained. In favorable domains, the technique is known to speed up the induction process, and to enhance the accuracy of the induced tree;while reducing the number of training instances used. In this paper, a Windowing based strategy exploiting an optimized search of counter examples through the use of GPUs is introduced to cope with distributed data mining (DDM) scenarios. The strategy is defined and implemented in JaCa-DDM, a novel system founded on the Agents & Artifacts paradigm. Our approach is well suited for DDM problems generating large amounts of training instances. Some experiments in diverse domains compare our strategy with the traditional centralized approach, including an exploratory case study on pixel-based segmentation for the detection of precancerous cervical lesions on colposcopic images.

关键词： Windowing Decision Trees GPU computation Multi-Agent Systems distributed data mining

来源：评论

学校读者我要写书评

暂无评论

Creation of data mining Cloud Service on the Actor Model 15th

引用

15th International Conference on Next-Generation Wired/Wireless Advanced Networks and Systems (NEW2AN) and 8th Conference on Internet of Things and Smart Spaces (ruSMART)

作者： Kholod, Ivan Petuhov, Ilya Kapustin, Nikita St Petersburg Electrotech Univ LETI St Petersburg Russia

ISBN: (纸本)9783319231266;9783319231259

This article describes the approach to building data mining cloud service based on actor model. The article describes the mapping of the algorithm decomposed into functional blocks on the set of actors. Also it describes the architecture and implementation of cloud service to perform data mining algorithms for actors. As an example, it describes the implementation and experiments with neural network learning algorithm on the cluster actors.

关键词： data mining distributed data mining Cloud computing Actor model

来源：评论

学校读者我要写书评

暂无评论

MapReduce-based H-mine algorithm 5

MapReduce-based H-mine algorithm

引用

Fifth International Conference on Instrumentation & Measurement, Computer, Communication, and Control (IMCCC)

作者： Feng, Xingjie Zhao, Jie Zhang, Zhiyuan CAUC Comp Sci & Technol Tianjin Peoples R China

ISBN: (纸本)9781467377232

Frequent Itemset mining (FIM) is a very effective method for knowledge acquisition from data, but with the advent of the era of big data, traditional algorithms based on memory are facing severe challenges such as the computation speed and storage capacity. Fortunately, MapReduce model provides an efficient framework for distributed programming and operation framework. This paper proposes a novel MapReduce-based H-mine algorithm (MRH-mine), a version of H-mine algorithm in the distributed operation environment. Experimental results show that MRH-mine algorithm has a better performance and scalability than traditional H-Mine when facing massive data growth.

关键词： distributed data mining MapReduce H-mine parallelization

来源：评论

学校读者我要写书评

暂无评论

An Efficient Approach for Privacy Preserving distributed mining of Association Rules in Unsecured Environment

An Efficient Approach for Privacy Preserving Distributed Min...

引用

International Conference on Advances in Computing, Communications and Informatics ICACCI

作者： Modi, Chirag N. Patil, Ashwini R. Doshi, Nishant Natl Inst Technol Goa Ponda Goa India Natl Inst Technol Surat Surat Gujarat India MEFGI Dept Comp Engn Rajkot 360003 Gujarat India

ISBN: (纸本)9781479987924

distributed data mining techniques are widely used for many applications viz;marketing, decision making, statistical analysis etc. In distributed data environment, each of the involving sites contains local information which will be collaborated to extract global mining result. However, these techniques have been investigated in terms of privacy and security concerns of individual site's information. To solve this problem, many cryptography techniques have been investigated. Still there is a room for further improvement. In this paper, we propose an efficient approach for privacy preserving distributed association rule mining. We use onion routing protocol in order to exchange information among involving sites. We use an elliptic curve (EC) based cryptography in order to achieve security and privacy of individual site's information in unsecured distributed environment. Finally, we analyze proposed solution in terms of security, privacy, computational cost and communication cost.

关键词： distributed data mining Association Rules Onion Routing Elliptic Curve Cryptography

来源：评论

学校读者我要写书评

暂无评论

Energy-Aware Migration of Virtual Machines Driven by Predictive data mining Models 23

Energy-Aware Migration of Virtual Machines Driven by Predict...

引用

23rd Euromicro International Conference on Parallel, distributed, and Network-Based Processing (PDP)

作者： Altomare, Albino Cesario, Eugenio Talia, Domenico ICAR CNR Arcavacata Di Rende Italy Univ Calabria Arcavacata Di Rende Italy

ISBN: (纸本)9781479984909

Consolidation of virtual machines (VM) is one of the key strategies used to reduce the power consumption of Cloud servers. For this reason it is extensively studied. Nevertheless, the effectiveness of a consolidation strategy strongly depends on the forecast of the VM resource needs. This paper describes the design and development of a system for energy-aware allocation of virtual machines, driven by predictive data mining models. In particular, migrations are driven by the forecast of the future computational needs (CPU, RAM) of each virtual machine, in order to efficiently allocate those on the available servers. Experimental results, performed on data of a real Cloud data center, show encouraging benefits in terms of energy saving.

关键词： distributed data mining Energy-aware Cloud Computing virtual machines data mining Consolidation Power consumption Automobile driving Migrations Voltmeter Servers optical coherence tomography angiography device system parameter

来源：评论

学校读者我要写书评

暂无评论

data mining Algorithms Parallelizing in Functional Programming Language for Execution in Cluster 15th

引用

15th International Conference on Next-Generation Wired/Wireless Advanced Networks and Systems (NEW2AN) and 8th Conference on Internet of Things and Smart Spaces (ruSMART)

作者： Kholod, Ivan Malov, Aleksey Rodionov, Sergey St Petersburg Electrotech Univ LETI St Petersburg Russia Motorola Solut Business Ctr T4 St Petersburg 192019 Russia

ISBN: (纸本)9783319231266;9783319231259

This article describes an approach to parallelizing of data mining algorithms, implemented in functional programming language, for distributed data processing in cluster. Here are provided requirements for the functions which form these algorithms for their conversion into parallel type. As an example we describe Naive Bayes algorithm implementation in Common Lisp language, its conversion into parallel type and execution on cluster with MPI system.

关键词： data mining distributed data mining distributed information processing Functional language

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：