How to handle missing information is essential for system efficiency and robustness in the field of the database. Missing information in big data environment tends to have richer semantics, leading to more complex com...
详细信息
This paper presents analytical study of intruder detection system in big data environment. In recent years, the size of data increases at high speed, through daily increasing in number of people and online application...
详细信息
In this modern society, migraine belongs to a kind of common disease. The hospitals begin to use clinical decision support system a lot to improve the accuracy of diagnosis, this kind of system can help the physicians...
详细信息
Low-power processors have attracted attention due to their energy-efficiency. A large market, such as the mobile one, relies on these processors for this very reason. Even High Performance computing (HPC) systems are ...
详细信息
ISBN:
(纸本)9781538678794
Low-power processors have attracted attention due to their energy-efficiency. A large market, such as the mobile one, relies on these processors for this very reason. Even High Performance computing (HPC) systems are starting to consider low-power processors as a way to achieve exascale performance within 20MW, however, they must meet the right performance/Watt balance. Current low-power processors contain in-order cores, which cannot re-order instructions to avoid data dependency-induced stalls. Whilst this is useful to reduce the chip's total power consumption, it brings several challenges. Due to the evolving performance gap between memory and processor, memory is a significant bottleneck. In-order cores cannot reorder instructions and are memory latency bound, something data prefetching can help alleviate by ensuring data is readily available. In this work, we do an exhaustive analysis of available data prefetching techniques in state-of-the-art in-order cores. We analyze 5 static prefetchers and 2 dynamic aggressiveness and destination mechanisms applied to 3 data prefetchers on a set of HPC mini- and proxy-applications, whilst running on in-order processors. We show that next-line prefetching can achieve nearly top performance with a reasonable bandwidth consumption when throttled, whilst neighbor prefetchers have been found to be best, overall.
The proceedings contain 60 papers. The special focus in this conference is on advanceddatamining and Applications. The topics include: Effective monotone knowledge integration in kernel support vector machines;textu...
ISBN:
(纸本)9783319495859
The proceedings contain 60 papers. The special focus in this conference is on advanceddatamining and Applications. The topics include: Effective monotone knowledge integration in kernel support vector machines;textual cues for online depression in community and personal settings;confidence-weighted bipartite ranking;mining distinguishing customer focus sets for online shopping decision support;community detection in networks with less significant community structure;prediction-based, prioritized market-share insight extraction;interrelationships of service orchestrations;outlier detection on mixed-type data;an energy-based approach;low-rank feature reduction and sample selection for multi-output regression;biologically inspired pattern recognition for E-nose sensors;improving cytogenetic search with GPUs using different string matching schemes;a framework for interlinking smart things in the internet of things;efficient mining of pan-correlation patterns from time course data;dynamic reverse furthest neighbor querying algorithm of moving objects;an ensemble approach for better truth discovery;single classifier selection for ensemble learning;community detection in dynamic attributed graphs;secure computation of skyline query in mapreduce;recommending features of mobile applications for developer;causality-guided feature selection;temporal interaction biased community detection in social networks;extracting key challenges in achieving sobriety through shared subspace learning;unified weighted label propagation algorithm using connection factor;metric learning for cold-start recommendations;time series forecasting on engineering systems using recurrent neural networks;effective traffic flow forecasting using taxi and weather data and discovering trip hot routes using large scale taxi trajectory data.
In today's data-driven economy, operators that integrate vast stores of fundamental reservoir and production data with the highperformance predictive analytics solutions can emerge as winners in the contest of max...
详细信息
ISBN:
(纸本)9781510862531
In today's data-driven economy, operators that integrate vast stores of fundamental reservoir and production data with the highperformance predictive analytics solutions can emerge as winners in the contest of maximizing estimated ultimate recovery (EUR). The scope of this study is to demonstrate a new workflow coupling earth sciences with data analytics to operationalize well completion optimization. The workflow aims to build a robust predictive model that allows users to perform sensitivity analysis on completion designs within a few hours. Current workflows for well completion and production optimization in unconventional reservoirs require extensive earth modeling, fracture simulation, and production simulations. With considerable effort and wide scale of sensitivity, studies could enable optimized well completion design parameters such as optimal cluster spacing, optimal proppant loading, optimal well spacing, etc. Yet, today, less than 5% of the wells fractured in North America are designed using advanced simulation due to the required level of data, skillset, and long computing times. Breaking these limitations through parallel fracture and reservoir simulations in the cloud and combining such simulation with data analytics and artificial intelligence algorithms helped in the development of a powerful solution that creates models for fast, yet effective, completion design. The approach was executed on Eagle Ford wells as a case study in 2016. Over 2000 data points were collected with completion sensitivity performed on a multithreaded cluster environment on these wells. advanced machine learning and datamining algorithms of data analytics such as random forest, gradient boost, linear regression, etc. were applied on the data points to create a proxy model for the fracturing and numerical production simulator. With the gradient boost technique, over 90% accuracy was achieved between the proxy model and the actual results. Hence, the proxy model could predict
The concept of big data comes into concern when challenges have been identified in digital capacity, velocity, and type of the data gathered. The domain of data source is diverse: social networking, mobile phones, sen...
详细信息
Traffic accidents constitutes the first cause of death and injury in many developed countries. However, traffic accidents information and data provided by public organisms can be exploited to classify these accidents ...
详细信息
The proceedings contain 86 papers. The topics discussed include: active semi-supervised classification based on multiple clustering hierarchies;combining static and dynamic features for multivariate sequence classific...
ISBN:
(纸本)9781509052066
The proceedings contain 86 papers. The topics discussed include: active semi-supervised classification based on multiple clustering hierarchies;combining static and dynamic features for multivariate sequence classification;harvester: influence optimization in symmetric interaction networks;a framework for description and analysis of sampling-based approximate triangle counting algorithms;limiting the diffusion of information by a selective PageRank-preserving approach;anomaly detection in automobile control network data with long short-term memory networks;infinite Langevin mixture modeling and feature selection;efficient identification of Tanimoto nearest neighbors;a parallel framework for grid-based bottom-up subspace clustering;impact of query sample selection bias on information retrieval system ranking;learning multifaceted latent activities from heterogeneous mobile data;the semantic knowledge graph: a compact, auto-generated model for real-time traversal and ranking of any relationship within a domain;what would a data scientist ask? automatically formulating and solving predictive problems;behavior-oriented time segmentation for mining individualized rules of mobile phone users;and online experimentation diagnosis and troubleshooting beyond AA validation.
暂无评论