The application of GPU to accelerate large-scale smoke simulation is a hot research topic in computational fluid dynamics. However, the current smoke parallelcomputing methods for different scale smoke flow field, th...
详细信息
Desktop grid as a Service is a new concept for implementing high-throughput computing basing on cloud computing and distributedcomputing technologies of the Desktop grid type. This paper demonstrates new advances of ...
详细信息
The proceedings contain 68 papers. The topics discussed include: study on the relation between image recognition and art counsel using deep learning;changes in marketing brought by AI;the impact of capital structure o...
ISBN:
(纸本)9780738125510
The proceedings contain 68 papers. The topics discussed include: study on the relation between image recognition and art counsel using deep learning;changes in marketing brought by AI;the impact of capital structure on the profitability efficiency of ICT firms;a study on the change in service quality in the era of the 4th industrial revolution compared to the existing service quality evaluation;a deep learning approach for human face sentiment classification;development of craft copyright industry using blockchain technology;the design of industrial security tasks and capabilities required in industrial site;research and implementation of image recognition of tea based on deep learning;and the effects of local culture experience on revisit intention.
With the rapid development of the information age, computer technology and network technology are more and more mature, and the application of Internet technology is more and more extensive. On the basis of the gradua...
详细信息
Containers are widely deployed in clouds. There are two common container architectures: operating system-level (OS-level) container and virtual machine-level (VM-level) container. Typical examples are runc and Kata. I...
详细信息
ISBN:
(纸本)9798350386066;9798350386059
Containers are widely deployed in clouds. There are two common container architectures: operating system-level (OS-level) container and virtual machine-level (VM-level) container. Typical examples are runc and Kata. It is well known that VM-level containers provide better isolation than OS-level containers, but at a higher overhead. Although there are quantitative analyses of the performance gap between these two container architectures, they rarely discuss the performance gap under the constrained resources provisioned to containers. Since the high-density deployment of containers is demanding in the cloud, each container is provisioned with limited resources specified by the cgroup mechanism. In this paper, we provide an in-depth analysis of the storage and network (two key aspects) performance differences between runc and Kata under varying resource constraints. We identify configuration implications that are crucial to performance and find that some of them are not exposed by the Kata interfaces. Based on that, we propose a profiling tool to automatically offer configuration suggestions for optimizing container performance. Our evaluation shows that the auto-generated configuration can improve the performance of MySQL by up to 107% in the TPCC benchmark compared with the default Kata setup.
An innovative architecture called execute-order-validate (EOV) has been proposed by Hyperledger Fabric that enables concurrent processing of transactions. However, the architecture suffers from issues such as excessiv...
详细信息
ISBN:
(纸本)9798350386066;9798350386059
An innovative architecture called execute-order-validate (EOV) has been proposed by Hyperledger Fabric that enables concurrent processing of transactions. However, the architecture suffers from issues such as excessive invalid transactions and serialization limitations in scenarios with high transaction conflicts, which restrict its applicability in real-time and high-performance settings. To address the aforementioned limitations, we propose ParFabric to enhance the EOV architecture. Firstly, we analyze four essential characteristics required for the transaction reordering algorithm within this architecture. We propose a heuristic dynamic reordering algorithm to reduce the number of invalid transactions. This is achieved through real-time identification and early abortion of transactions based on weighted pre-ordering and the construction of a transaction conflict graph. Secondly, leveraging the transaction conflict graph, we introduce a novel optimal block packing strategy based on transaction dependencies. This strategy replaces the total transaction order with partial order, enabling parallel validation and commit at the block level, thereby leading to increased system throughput while reducing transaction latency. Experimental results indicate that, ParFabric demonstrates excellent performance in terms of vertical scaling of peers. Additionally, at the same infrastructure cost, ParFabric provides 2.2x and 1.6x higher throughput than FabricPlusPlus and FabricSharp in high-conflict scenarios.
We introduce a distributed memory parallel algorithm for force-directed node embedding that places vertices of a graph into a low-dimensional vector space based on the interplay of attraction among neighboring vertice...
详细信息
ISBN:
(纸本)9798350364613;9798350364606
We introduce a distributed memory parallel algorithm for force-directed node embedding that places vertices of a graph into a low-dimensional vector space based on the interplay of attraction among neighboring vertices and repulsion among distant vertices. We develop our algorithms using two sparse matrix operations, SDDMM and SpMM. We propose a configurable pull -push -based communication strategy that optimizes memory usage and data transfers based on computing resources and asynchronous MPI communication to overlap communication and computation. Our algorithm scales up to 256 nodes on distributed supercomputers by surpassing the performance of state-of-the-art algorithms
Subgraph isomorphism enumerates all embeddings in a data graph that are identical to a query graph. It is a well-known NP-hard problem widely used in various domains, such as bioinformatics, chem-informatics, and soci...
详细信息
ISBN:
(纸本)9798400717932
Subgraph isomorphism enumerates all embeddings in a data graph that are identical to a query graph. It is a well-known NP-hard problem widely used in various domains, such as bioinformatics, chem-informatics, and social network analysis. Recent works are focused on using GPUs for subgraph isomorphism. Due to the massive scale of intermediate results, current GPU implementations face challenges in scaling across multiple nodes due to high communication costs. The computational power of CPUs is not fully utilized in this process. We present a distributed framework for subgraph isomorphism that leverages CPU and GPU heterogeneous computing. It eliminates the intermediate results on GPU and significantly reduces communication overhead during the load-balancing process. The experiments indicate that our algorithm can be extended to multiple nodes with an almost linear efficiency improvement. Furthermore, our method also significantly outperforms other existing works on GPUs. It can reach an improvement of up to 21x compared to the state-of-the-art implementation CuTS in the distributed environment.
Due to the depletion of fossil fuels globally and the rapid advancement of solar power generating technologies, solar power generation has increasingly emerged as a viable method for energy development. The concept of...
详细信息
Synthetic Aperture Radar (SAR) tomography is an advanced technique for monitoring deformations of the Earth's surface. However, the computational complexity of SAR tomography algorithms often restricts their appli...
详细信息
暂无评论