One fundamental challenge in data stream processing is to cope with the ubiquity of disorder of tuples within a stream caused by network latency, operator parallelization, merging of asynchronous streams, etc. High re...
详细信息
ISBN:
(纸本)9781450332866
One fundamental challenge in data stream processing is to cope with the ubiquity of disorder of tuples within a stream caused by network latency, operator parallelization, merging of asynchronous streams, etc. High result accuracy and low result latency are two conflicting goals in out-of-order stream processing. Different applications may prefer different extent of trade-offs between the two goals. However, existing disorder handling solutions either try to meet one goal to the extreme by sacrificing the other, or try to meet both goals but have shortcomings including unguaranteed result accuracy or increased complexity in operator implementation and application logic. To meet different application requirements on the latency versus result accuracy trade-off in out-of-order stream processing, in this paper, we propose to make this trade-off user-configurable. Particularly, focusing on sliding window aggregates, we introduce AQ-K-slack, a buffer-based quality-driven disorder handling approach. AQ-K-slack leverages techniques from the fields of sampling-based approximate query processing and control theory. It can adjust the input buffer size dynamically to minimize the result latency, while respecting user-specified threshold on relative errors in produced query results. AQ-K-slack requires no a priori knowledge of disorder characteristics of data streams, and imposes no changes to the query operator implementation or the application logic. Experiments over real-world out-of-order data streams show that, compared to the state-of-art, AQ-K-slack can reduce the average buffer size, thus the average result latency, by at least 51% while respecting user-specified requirement on the accuracy of query results.
The proceedings contain 11 papers. The special focus in this conference is on Scalable Data Analytics and Big Data applications. The topics include: Scalable similarity search for big data;content-based analytics of d...
ISBN:
(纸本)9783319168678
The proceedings contain 11 papers. The special focus in this conference is on Scalable Data Analytics and Big Data applications. The topics include: Scalable similarity search for big data;content-based analytics of diffusion on social big data;multi-modal similarity retrieval with a shared distributed data store;an efficient approach for complex data summarization using multiview clustering;a novel approach for network traffic summarization;heart disease diagnosis using co-clustering;an investigation of scalable anomaly detection techniques for a large network of Wi-Fi hotspots;link scheduling for data collection in multichannel wireless sensor networks;a design of sensor data ontology for a large scale crop growth environment system and real-time data flow language processing system for handling streams of data.
This work studies utilization of shared caches by applications running concurrently on different cores of multicore systems. Knowledge about program contention due to shared resources is important for various design p...
详细信息
This work studies utilization of shared caches by applications running concurrently on different cores of multicore systems. Knowledge about program contention due to shared resources is important for various design problems concerning multicore architectures. It is needed for power estimation, scheduling of parallelapplications and design of shared memories. Moreover, deep understanding of programs behavior is especially needed for the development of accurate models that are able to predict misses caused by shared resources in multicore systems. We present a methodology that is able to examine the interaction of applications in shared caches. Our experiments show a positive impact of data sharing by minimizing misses in shared L2 caches over a wide range of L2 cache sizes for applications from the Media bench suite. Up to 25% lower misses in the last level cache can be observed for embedded applications, when data are allowed to be shared among programs running on different cores.
Cloud computing is a new computing model which uses virtualization technology, distributed computing, parallel computing and other existing technologies to achieve cloud service virtualization and economies of scale, ...
详细信息
ISBN:
(纸本)9783319198484;9783319198477
Cloud computing is a new computing model which uses virtualization technology, distributed computing, parallel computing and other existing technologies to achieve cloud service virtualization and economies of scale, whilst increasingly overwhelming cloud security issues has brought great challenges and concerns to the cloud services providers and cloud users, especially trust and privacy issues with regard to cloud computing and cloud shared storage associated security issues. In the paper, we expound the basic concepts of cloud computing, deployment models, service models and key features, analyze and outline the currently highlighted cloud security issues, report the status quo of cloud computing security, investigate the prevalent and typical cloud computing security problem key solving techniques, and thus render a comprehensive cloud computing security technical reference model, which is composed of associated cloud security solving techniques that result from inevitably multi-faceted cloud security issues. The model is expected to alleviate prominent cloud security issues. This paper generalizes cloud security technology research directions and further development space of cloud security technology and standardization.
The problem of deepening memory hierarchy towards exascale is becoming serious for applications such as those based on stencil kernels, as it is difficult to satisfy both high memory bandwidth ad capacity requirements...
详细信息
The problem of deepening memory hierarchy towards exascale is becoming serious for applications such as those based on stencil kernels, as it is difficult to satisfy both high memory bandwidth ad capacity requirements simultaneously. This is evident even today, where problem sizes of stencil-based applications on GPU supercomputers are limited by aggregated capacity of GPU device memory. Locality improvement techniques such as temporal blocking is known to preserve performance, but integrating the technique into existing stencil applications results in substantially higher programming cost, especially for complex applications and as a result are not typically utilized. We alleviate this problem with a run-time GPU-MPI process virtualization library we call HHRT that automates data movement across the memory hierarchy, and a systematic methodology to convert and optimize the code to accommodate temporal blocking. The proposed methodology has shown to significantly eases the adaptation of real applications, such as the whole-city airflow simulator embodying more than 12,000 lines of code; with careful tuning, we successfully maintain up to 85% performance even with problems whose footprint is four time larger than GPU device memory capacity, and scale to hundreds of GPUs on the TSUBAME2.5 supercomputer.
Drones have become ubiquitous in performing risky and labor intensive areal tasks cheaply and safely. To allow them to be autonomous, their flight plan needs to be pre-built for them. Existing works do not precalcu-la...
详细信息
ISBN:
(纸本)9781467384124
Drones have become ubiquitous in performing risky and labor intensive areal tasks cheaply and safely. To allow them to be autonomous, their flight plan needs to be pre-built for them. Existing works do not precalcu-late flight paths but instead focus on navigation through camera based image processingtechniques, genetic or geometric algorithms to guide the drone during flight. That makes flight navigation complex and risky. In this paper we present automated flight plan builder DIFPL which pre-builds flight plans for drones to survey a large area. The flight plans are built for subregions and fed into drones which allow them to navigate autonomously. DIFPL employs distributed paradigm on Hadoop MapReduce framework. Distribution is achieved by processing sections or subregions in parallel. Experiments performed with network and elevation datasets validate the efficiency of DIFPL in building optimal flight plans.
GPU cluster is important for high performance computing with its high performance/cost ratio. However, it is still very hard for application developers to write parallel codes on GPU. MPI is mostly used for parallel p...
详细信息
Big data is often generated incrementally in the real word. Existing incremental query optimization is mainly used in the streaming data environment. Due to the constraints of real-time streaming data applications, ex...
详细信息
The increasing demand of computation capacity has made many-core parallelprocessing (MPP) a compelling choice for computation-intensive applications. The networks-on-chip (NoC) architecture is an effective way to int...
详细信息
ISBN:
(纸本)9781479984923
The increasing demand of computation capacity has made many-core parallelprocessing (MPP) a compelling choice for computation-intensive applications. The networks-on-chip (NoC) architecture is an effective way to interconnect dozens of processing cores, while the logic circuits and the actual performance need to be verified in specific platform. We proposed and implemented the MACRON platform to provide verification for complicated applications based on NoC architecture by coordinating the software tool and the hardware devices closely. In MACRON, the virtual output queue with look-ahead routing is proposed to reduce the transmission delay through the NoC router. The heterogeneous processing elements: vector processor core, scalar processor core and accelerator core are designed, thus a thorough 'soft' signal processing can be approached. A real-time 4G wireless communication system based on NoC is demonstrated on this MACRON platform.
The proceedings contain 79 papers. The special focus in this conference is on Web Application Modelling, Mobile Web applications, Social Web applications and Semantic Web applications. The topics include: Leveraging s...
ISBN:
(纸本)9783319198897
The proceedings contain 79 papers. The special focus in this conference is on Web Application Modelling, Mobile Web applications, Social Web applications and Semantic Web applications. The topics include: Leveraging social patterns in web application design;liquid stream processing across web browsers and web servers;identifying inter-component control flow in web applications;a JavaScript plugin framework for extensible client and server-side components;a design method for modeling the new interaction style of mobile applications;profiling user activities with minimal traffic traces;user interface adaptation using web augmentation techniques;characterizing user interaction in an online social network for TV fans;a social task routing framework for online communities;transforming collaboration structures into deployable informal processes;quantitative data extraction from twitter to describe events;keyword pattern graph relaxation for selective result space expansion on linked data;approximate continuous query answering over streams and dynamic linked data sets;using caching for local link discovery on large data sets;a quantitative comparison of semantic web page segmentation approaches;a semantic framework for sequential decision making;adaptive faceted search for product comparison on the web of data;REST web service description for graph-based service discovery;distributed service discovery in mobile IoT environments using hierarchical bloom filters;a methodology and tool support for widget-based web application development;framework to support motion-based web interaction techniques and challenges in android wear application development.
暂无评论