版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Data Science Group Computing & Communications Lancaster University UK Chair of Excellence Carlos III University of Madrid Spain Carlos III University of Madrid Computer Science Department Spain
出 版 物:《Procedia Computer Science》
年 卷 期:2015年第53卷
页 面:9-18页
主 题:Big Data TEDA AnYa Evolving Systems for Big Data Analytics
摘 要:In the era of big data, huge amounts of data are generated and updated every day, and their processing and analysis is an important challenge today. In order to tackle this challenge, it is necessary to develop specific techniques which can process large volume of data within limited run times. TEDA is a new systematic framework for data analytics, which is based on the typicality and eccentricity of the data. This framework is spatially-aware, non-frequentist and non-parametric. TEDA can be used for development of alternative machine learning methods, in this work, we will use it for classification ( TEDAClass ). Specifically, we present a TEDAClass based approach which can process huge amounts of data items using a novel parallelization technique. Using this parallelization, we make possible the scalability of TEDAClass . In that way, the proposed approach is particularly useful for various applications, as it opens the doors for high-performance big data processing, which could be particularly useful for healthcare, banking, scientific and many other purposes.