We define a new class of predicates called equilevel predicates on a distributive lattice which eases the analysis of parallel algorithms. Many combinatorial problems such as the vertex cover problem, the bipartite ma...
详细信息
Data parallelism is a powerful design paradigm for clustering tasks involving large datasets. However, existing solutions suffer from three problems: (i) using non-identical distribution based partitioning methods may...
详细信息
ISBN:
(纸本)9789819754946;9789819754953
Data parallelism is a powerful design paradigm for clustering tasks involving large datasets. However, existing solutions suffer from three problems: (i) using non-identical distribution based partitioning methods may pose the risk of data skew;(ii) frequent communication among the divided data partitions may lead to potential performance degradation;and (iii) unnecessary full computation results in significant computational overhead. In order to address these issues, we propose a density peak-based statistical parallel clustering algorithm for big data (DPSPC). Our sampling-based approach creates equal-sized data blocks with the same statistical measures of clusters, reducing data skew and eliminating inter-block communication. By sampling a subset of blocks for computation, we avoid full computation. The experimental results suggest that the NMI index of the DPSPC algorithm is generally not 10% lower than that of other distributed density peak clustering algorithms, with runtime about one-tenth and the lowest communication volume.
Fair clustering problems have been paid lots of attention recently. In this paper, we study the k-Center problem under the group fairness and data summarization fairness constraints, denoted as Group Fair k-Center (GF...
详细信息
The growing need to perform Neural network inference with low latency is giving place to a broad spectrum of heterogeneous devices with deep learning capabilities. Therefore, obtaining the best performance from each d...
详细信息
Abnormality detection is a very popular research direction in the medical field;however, most of the existing methods have high requirements on the data type of the dataset and require tedious feature engineering. To ...
详细信息
Concurrent hash tables are one of the fundamental building blocks for cloud computing. In this paper, we introduce lock-free modifications to in-memory bucketized cuckoo hashing. We present a novel concurrent strategy...
详细信息
ISBN:
(纸本)9783031396977;9783031396984
Concurrent hash tables are one of the fundamental building blocks for cloud computing. In this paper, we introduce lock-free modifications to in-memory bucketized cuckoo hashing. We present a novel concurrent strategy in designing a lock-free hash table, called LFBCH, that paves the way towards scalability and high space efficiency. To the best of our knowledge, this is the first attempt to incorporate lock-free technology into in-memory bucketized cuckoo hashing, while still providing worst-case constant-scale lookup time and extremely high load factor. All of the operations over LFBCH, such as get, put, "kick out" and rehash, are guaranteed to be lock-free, without introducing notorious problems like false miss and duplicated key. The experimental results indicate that under mixed workloads with up to 64 threads, the throughput of LFBCH is 14%-360% higher than other popular concurrent hash tables.
The fast distributed garden planning system in colleges and universities is one of the mainstream applications of contemporary computers. The latest traditional methods usually use the GPU full virtualization technolo...
详细信息
Blockchain technology is a very high powered technology that encounters a vast variety of features. For cloud-edge systems to do data integrity audits blockchain is an authentic tool. Using blockchain hospitals can be...
详细信息
With the ever-growing network traffic and the vast amount of abnormal traffic being created, anomaly detection methods have attracted close attention in the cybersecurity domain. Generative adversarial networks (GANs)...
详细信息
Modern Python programs in high-performance computing call into compiled libraries and kernels for performance critical tasks. However, effectively parallelizing these finer-grained, and often dynamic, kernels across m...
详细信息
ISBN:
(纸本)9798350364613;9798350364606
Modern Python programs in high-performance computing call into compiled libraries and kernels for performance critical tasks. However, effectively parallelizing these finer-grained, and often dynamic, kernels across modern heterogeneous platforms remains a challenge. First, we perform an experimental study to examine the impact of Python's Global Interpreter Lock (GIL), and potential speedups under a GIL-less PEP703 future, to guide runtime design. Using our optimized runtime, we explore scheduling tasks with constraints that require resources across multiple, potentially diverse, devices through the introduction of new programming abstractions and runtime mechanisms. We extend an existing Python tasking library, Parla, to augment its performance and add support for such multi -device tasks. Our experimental analysis, using tasks graphs from synthetic and real applications, shows at least a 3x (and up to 6x) performance improvement over its predecessor in scenarios with high GIL contention. When scheduling multi-GPU tasks, we observe an Ax reduction in per-task launching overhead compared to a multi-process system.
暂无评论