Data-glyphs are a common, useful metaphor for visualizing multidimensional data. However, to represent a larger number of glyphs requires an increasing amount of computational power to derive the data-dependent geomet...
详细信息
Belief propagation (BP) is a message passing algorithm that infers over probabilistic graphical models. Its main computational workload, messages update, is suitable for GPU9;s massively parallel architecture. Howe...
详细信息
ISBN:
(纸本)9783030602451;9783030602444
Belief propagation (BP) is a message passing algorithm that infers over probabilistic graphical models. Its main computational workload, messages update, is suitable for GPU's massively parallel architecture. However, the efficiency of fully parallel BP is low, and traditional algorithms implemented on GPUs occupy amount of computing and memory resources. In this paper, we propose several GPU-friendly BP algorithms optimized by coloring. Color Wave (CW) algorithm performs multi-step coloring on residuals of non-convergent vertices to quickly obtain multiple disjoint partitions and vertices withthe largest residuals in each partition, and then updates batches of messages in a fixed order. these operations are all suitable for parallelization and require little additional memory. To save time in each iteration, the Color Extract (CE) algorithm only update messages on edges withthe largest residuals among all adjacent edges. the Random Drop (RD) algorithm steadily increases the convergence degree by progressively reducing the messages update ratio of non-convergent edges. the experiments on different GPUs show that our algorithms perform well throughout the calculation process. Compared with state-of-the-art algorithms, CW algorithm converged most of the messages in previous iterations. the convergence degree of CE is higher than all other algorithms in most calculation processes. RD converges fast and always has a high degree of convergence.
this paper proposes a binary code sequence based tracking algorithm in wireless sensor network. the proposed algorithm can release the influence of sensed data on localization results via building the map between targ...
详细信息
ISBN:
(纸本)9783030389611;9783030389604
this paper proposes a binary code sequence based tracking algorithm in wireless sensor network. the proposed algorithm can release the influence of sensed data on localization results via building the map between target's occurrence region and a binary code sequence. To solve the ambiguity problem existing in occurrence region determination, the paper further gives a Voronoi diagram based location refinement algorithm. the simulation results show the tracking results under difference trajectories.
Finding the centrality measures of nodes in a graph is a problem of fundamental importance due to various applications from social networks, biological networks, and transportation networks. Given the large size of su...
详细信息
We present a way to implement term rewriting on a GPU. We do this by letting the GPU repeatedly perform a massively parallel evaluation of all subterms. We find that if the term rewrite systems exhibit sufficient inte...
详细信息
Recommender systems have nowadays been widely used in a variety of applications such as Amazon and Ebay. Traditional recommendation techniques mainly focus on recommendation accuracy only. In reality, other metrics su...
详细信息
ISBN:
(纸本)9783030602390;9783030602383
Recommender systems have nowadays been widely used in a variety of applications such as Amazon and Ebay. Traditional recommendation techniques mainly focus on recommendation accuracy only. In reality, other metrics such as diversity and novelty also play a key role for modern recommendation systems. Although some works based on multi-objective evolutionary algorithm have been proposed for multi-objective recommendation, they are usually very time-consuming because of the large data size of the RSs and the long-term evolution iterations and hence it greatly limits their application in practice. To address these shortcomings, this paper first designs a multi-objective recommendation system, taking into account diversity and novelty as well as accuracy. then, a novel parallel multi-objective evolutionary algorithm called CC-MOEA is proposed to optimize these conflicting metrics. CC-MOEA is devised grounded on NSGA-II and a cooperative coevolutionary island model, and a parallel global non-dominated selection method is introduced to reduce the runtime of finding the global optimal individuals. Furthermore, a new initialization method and a crossover operator are specifically designed. the experimental results reveal that CC-MOEA outperforms some state-of-the-art algorithms in terms of hypervolume and runtime.
the performance of parallelalgorithms is often inconsistent withtheir preliminary theoretical analyses. Indeed, the difference is increasing between the ability to theoretically predict the performance of a parallel...
详细信息
GPU heterogeneous cluster is extensively utilized in the field of data analysis and processing. Nevertheless, research and studies on collaborative activity model in computing elements of GPU heterogeneous clusters ar...
详细信息
the B+-tree is an important index in the fields of data warehousing and database management systems. Withthe development of new hardware technologies, the B+-tree needs to be revisited to fully take advantage of hard...
详细信息
ISBN:
(纸本)9783030602451;9783030602444
the B+-tree is an important index in the fields of data warehousing and database management systems. Withthe development of new hardware technologies, the B+-tree needs to be revisited to fully take advantage of hardware resources. In this paper, we focus on optimization techniques to increase the searching performance of B+-trees on the coupled CPU-GPU architecture. First, we propose a hierarchical searching approach on the single coupled GPU to efficiently deal with leaf nodes of B+-trees. It adopts a flexible strategy to determine the number of work items in a work group to search one key in order to reduce irregular memory accesses and divergent branches in the work group. Second, we present a co-processing pipeline method on the coupled architecture. the CPU and the integrated GPU process the sorting and searching tasks simultaneously to hide sorting and partial searching latencies. A distribution model is designed to support the workload balance strategy based on real-time performance. Our performance study shows that the hierarchical searching scheme provides an improvement up to 36% on the GPU compared to the baseline algorithm with fixed number of work items and the co-processing pipeline method further increases the throughput by a factor of 1.8. To the best of our knowledge, this paper is the first study to consider boththe CPU and the coupled GPU to optimize B+-trees searches.
the recent prevalence of positioning sensors and mobile devices generates a massive amount of spatial-temporal data from moving objects in real-time. As one of the fundamental processes in data analysis, the clusterin...
详细信息
ISBN:
(纸本)9783030602451;9783030602444
the recent prevalence of positioning sensors and mobile devices generates a massive amount of spatial-temporal data from moving objects in real-time. As one of the fundamental processes in data analysis, the clustering on spatial-temporal data creates various applications, like event detection and travel pattern extraction. However, most of the existing works only focus on the offline scenario, which is not applicable to online time-sensitive applications due to their low efficiency and ignorance of temporal features. In this paper, we propose a distributed streaming framework for spatial-temporal data clustering, which accepts various clustering algorithms while ensuring low resource consumption and result correctness. the framework includes a dynamic partitioning strategy for continuous load-balancing and a cluster-merging algorithm based on convex hulls [10], which guarantees the result correctness. Extensive experiments on real dataset prove the effectiveness of our proposed framework and its advantage over existing solutions.
暂无评论