this paper is concerned with a new variant of Traditional Bin Packing (TBP) called Priority-Based Bin Packing with Subset Constraints (PBBP-SC). In a TBP instance, we are given a collection of items { a1, a2, … an}, ...
详细信息
the Gene regulatory network analysis is one of the gene expression data analysis tasks. Gene regulatory network goal is determining the topological order of genes interactions. Moreover, the regulatory network is a vi...
详细信息
Graph clustering is a technique for grouping vertices having similar characteristics into the same cluster. It is widely used to analyze graph data and identify its characteristics. Recently, a large-capacity large-sc...
详细信息
In this paper, we propose a distributed, unordered, label-correcting distance-1 Grundy (vertex) coloring algorithm, namely, Distributed Control (DC) coloring algorithm. Our algorithm eliminates the need for vertex-cen...
详细信息
ISBN:
(纸本)9781728136134
In this paper, we propose a distributed, unordered, label-correcting distance-1 Grundy (vertex) coloring algorithm, namely, Distributed Control (DC) coloring algorithm. Our algorithm eliminates the need for vertex-centric barriers and global synchronization for color refinement, relying only on atomic operations and local termination detection to update vertex color. DC proceeds optimistically, correcting the colors asynchronously as the algorithm progresses and depends on local ordering of tasks to minimize the execution of sub-optimal work. We implement our DC coloring algorithm and the well-known Jones-Plassmann algorithm and compare their performance with 4 different types of standard RMAT graphs and real-world graphs. We show that the elimination of waiting time of global and vertex-centric barriers and investing this time for local ordering leads to improved scaling for graphs with prominent power-law characteristics and densely interconnected local subgraphs.
Hoeffding tree algorithm is a popular online decision tree algorithm capable of learning from huge data streams. the algorithm involves complex time consuming computations in the leaves of the tree for each data insta...
详细信息
Hoeffding tree algorithm is a popular online decision tree algorithm capable of learning from huge data streams. the algorithm involves complex time consuming computations in the leaves of the tree for each data instance. these computations involve a lot of parallelisms which can be exploited and implemented in a field programmable gate array to achieve speedup. this paper presents a hardware accelerator for Hoeffding tree algorithm with adaptive naive bayes predictor in the leaves. the proposed system is capable of accelerating data streams with both nominal and numeric attributes using minimum hardware resources for huge datasets. It is implemented on a Xilinx VC707 board based on Virtex-7 XC7VX485T field programmable gate array. the implemented system is about 9x faster than StreamDm(C++), a well known reference software implementation for the standard forest cover type dataset.
IoT, being a field of great interest and importance for the coming generations, involves certain challenging and improving aspects for the IoT application developers and researchers to work upon. A wireless sensor mes...
详细信息
Data generation, collection, and processing is an important workload of modern computer architectures. Stream or high-intensity data flow applications are commonly employed in extracting and interpreting the informati...
详细信息
the proceedings contain 66papers. the special focus in this conference is on algorithms and architectures for parallelprocessing. the topics include: Pipelining computation and optimization strategies for scaling GRO...
ISBN:
(纸本)9783319654812
the proceedings contain 66papers. the special focus in this conference is on algorithms and architectures for parallelprocessing. the topics include: Pipelining computation and optimization strategies for scaling GROMACS on the sunway many-core processor;exploring FPGA-GPU heterogeneous architecture for ADAS;new huge page allocator with main memory compression;an fpga-based real-time moving object tracking approach;automatic acceleration of stencil codes in android devices;optimizing concurrent evacuation transfers for geo-distributed datacenters in SDN;energy-balanced and depth-controlled routing protocol for underwater wireless sensor networks;on the energy efficiency of sleeping and rate adaptation for network devices;private and efficient set intersection protocol for big data analytics;a topology-aware framework for graph traversals;adaptive traffic signal control with network-wide coordination;a novel parallel dual-character string matching algorithm on graphical processing units;distributed nonnegative matrix factorization with HALS algorithm on mapreduce;GPU-accelerated block-max query processing;KD-tree and healpix-based distributed cone search indexing system for multi-band astronomical catalogs;an out-of-core branch and bound method for solving the 0-1 knapsack problem on a GPU;the curve boundary design and performance analysis for DGM based on openFOAM;leakage-resilient password-based authenticated key exchange;secure encrypted data deduplication with ownership proof and user revocation;optimally selecting the timing of zero-day attack via spatial evolutionary game and performance analysis of a ternary optical computer based on M/M/1 queueing system.
Due to the ability of expressively representing narrative structures, proposition-aware learning models in text have been drawing more and more attentions in information extraction. Following this trend, recent studie...
详细信息
Triangle counting is a fundamental graph analytic operation that is used extensively in network science and graph mining. As the size of the graphs that needs to be analyzed continues to grow, there is a requirement i...
详细信息
ISBN:
(纸本)9781450362955
Triangle counting is a fundamental graph analytic operation that is used extensively in network science and graph mining. As the size of the graphs that needs to be analyzed continues to grow, there is a requirement in developing scalable algorithms for distributed-memory parallel systems. To this end, we present a distributedmemory triangle counting algorithm, which uses a 2D cyclic decomposition to balance the computations and reduce the communication overheads. the algorithm structures its communication and computational steps such that it reduces its memory overhead and includes key optimizations that leverage the sparsity of the graph and the way the computations are structured. Experiments on synthetic and real-world graphs show that our algorithm obtains an average relative speedup range between 3.24 to 7.22 out of 10.56 across the datasets using 169 MPI ranks over the performance achieved by 16 MPI ranks. Moreover, we obtain an average speedup of 10.2 times on comparison with previously developed distributed-memory parallelalgorithms.
暂无评论