Task-based execution frameworks, such as parallel programming libraries, computational workflow systems, and function-as-a-service platforms, enable the composition of distinct tasks into a single, unified application...
详细信息
ISBN:
(数字)9798350365610
ISBN:
(纸本)9798350365627
Task-based execution frameworks, such as parallel programming libraries, computational workflow systems, and function-as-a-service platforms, enable the composition of distinct tasks into a single, unified application designed to achieve a computational goal and abstract the parallel and distributed execution of those tasks on arbitrary hardware. Research into these task executors has accelerated as computational sciences increasingly need to take advantage of parallel compute and/or heterogeneous hardware. However, the lack of evaluation standards makes it challenging to compare and contrast novel systems against existing implementations. Here, we introduce TaPS, the Task Performance Suite, to support continued research in distributed task executor frameworks. TaPS provides (1) a unified, modular interface for writing and evaluating applications using arbitrary execution frameworks and data management systems and (2) an initial set of reference synthetic and real-world science applications. We discuss how the design of TaPS supports the reliable evaluation of frameworks and demonstrate TaPS through a survey of benchmarks using the provided reference applications.
This paper introduces an aspect-oriented library aimed to support efficient execution of Java applications on multi-core systems. The library is coded in AspectJ and provides a set of parallel programming abstractions...
详细信息
ISBN:
(纸本)9780769551173
This paper introduces an aspect-oriented library aimed to support efficient execution of Java applications on multi-core systems. The library is coded in AspectJ and provides a set of parallel programming abstractions that mimics the OpenMP standard. The library supports the migration of sequential Java codes to multi-core machines with minor changes to the base code, intrinsically supports the sequential semantics of OpenMP and provides improved integration with object-oriented mechanisms. The aspect-oriented nature of library enables the encapsulation of parallelism-related code into well-defined modules. The approach makes the parallelisation and the maintenance of large-scale Java applications more manageable. Furthermore, the library can be used with plain Java annotations and can be easily extended with application-specific mechanisms in order to tune application performance. The library has a competitive performance, in comparison with traditional parallel programming in Java, and enhances programmability, since it allows an independent development of parallelism-related code.
Nowadays, in the different areas of knowledge, there is an increase in the amount of information needed to process, reason why many solutions have been generated for the implementation of high-performance computing, t...
详细信息
ISBN:
(数字)9798350379945
ISBN:
(纸本)9798350379952
Nowadays, in the different areas of knowledge, there is an increase in the amount of information needed to process, reason why many solutions have been generated for the implementation of high-performance computing, these available solutions depend on many factors, from the use of available different architectures. This research work presents a method for the configuration of a low-cost solution for the implementation of asolution based on HPC, using the OpenMP and OpenMPI libraries. The processes necessary for the implementation of programs to exploit these two libraries that are used in the application of parallel programming are described. As a result, the study presents the application of the methodology using file compression, which was implemented Huffman's algorithm, the results demonstrate the optimization in parallel work working with OpenMP and OpenMPI libraries, which allows working with all processors available in the different computer architectures that are available. The study indicates the mode of use and application of the methodology described.
The ubiquity of networking infrastructure in modern life necessitates scrutiny into networking fundamentals to ensure the safety and security of that infrastructure. The formalization of concurrent algorithms, a corne...
详细信息
parallel programming is often regarded as one of the hardest programming disciplines. On the one hand, parallel programs are notoriously prone to concurrency errors;and, while trying to avoid such errors, achieving pr...
详细信息
ISBN:
(纸本)9783642400476
parallel programming is often regarded as one of the hardest programming disciplines. On the one hand, parallel programs are notoriously prone to concurrency errors;and, while trying to avoid such errors, achieving program performance becomes a significant challenge. As a result of the multicore revolution, parallel programming has however ceased to be a task for domain experts only. And for this reason, a large variety of languages and libraries have been proposed that promise to ease this task. This paper presents a study to investigate whether such approaches succeed in closing the gap between domain experts and mainstream developers. Four approaches are studied: Chapel, Cilk, Go, and Threading Building Blocks (TBB). Each approach is used to implement a suite of benchmark programs, which are then reviewed by notable experts in the language. By comparing original and revised versions with respect to source code size, coding time, execution time, and speedup, we gain insights into the importance of expert knowledge when using modern parallel programming approaches.
This paper addresses the problem of providing a novel approach to sourcing significant training data for LLMs focused on science and engineering. In particular, a crucial challenge is sourcing parallel scientific code...
详细信息
ISBN:
(数字)9798350383454
ISBN:
(纸本)9798350383461
This paper addresses the problem of providing a novel approach to sourcing significant training data for LLMs focused on science and engineering. In particular, a crucial challenge is sourcing parallel scientific codes in the ranges of millions to billions of codes. To tackle this problem, we propose an automated pipeline framework called LASSI, designed to translate between parallel programming languages by bootstrapping existing closed- or open-source LLMs. LASSI incorporates autonomous enhancement through self-correcting loops where errors encountered during the compilation and execution of generated code are fed back to the LLM through guided prompting for debugging and refactoring. We highlight the bidirectional translation of existing GPU benchmarks between OpenMP target offload and CUDA to validate LASSI. The results of evaluating LASSI with different application codes across four LLMs demonstrate the effectiveness of LASSI for generating executable parallel codes, with 80% of OpenMP to CUDA translations and 85% of CUDA to OpenMP translations producing the expected output. We also observe approximately 78% of OpenMP to CUDA translations and 62% of CUDA to OpenMP translations execute within 10% of or at a faster runtime than the original benchmark code in the same language.
This sequence alignment stands as a pivotal method in the realm of bioinformatics, meticulously employed to ascertain the degree of similarity between diverse sequences such as DNA, RNA, and amino acids. Among the myr...
详细信息
ISBN:
(数字)9798350383027
ISBN:
(纸本)9798350383034
This sequence alignment stands as a pivotal method in the realm of bioinformatics, meticulously employed to ascertain the degree of similarity between diverse sequences such as DNA, RNA, and amino acids. Among the myriad techniques utilized in tackling sequence alignment challenges, the Longest Common Subsequence (LCS) takes center stage. This paper delves into the realm of enhancing LCS efficiency through the implementation of thread parallelization. Drawing inspiration from the seminal work of Wagner and Fischer in 1974, both sequential and parallel techniques exhibit remarkable consistency in identifying the maximum length of LCS. However, this research goes a step further by introducing thread parallelization, which leverages multithreading, resource synchronization, and task decomposition within the domain of parallel programming. The meticulous integration of these advanced techniques results in a notable enhancement in terms of running time compared to the conventional iterative sequential approach. The experimentation and evaluation of both sequential and parallel approaches were conducted using Netbeans, a robust Integrated Development Environment (IDE) tailored for the Java programming Language. The findings underscore the superior performance of the thread parallelization strategy, establishing its prowess in optimizing the execution time of LCS problem resolution.
Nowadays, latency-critical, high-performance applications are parallelized even on power-constrained client systems to improve performance. However, an important scenario of fine-grained tasking on simultaneous multit...
详细信息
The Touch programming language for swarm intelligent building application (APP) development effectively reduces the development difficulty and user programming threshold, making the building more intelligent. However,...
详细信息
ISBN:
(数字)9798350387780
ISBN:
(纸本)9798350387797
The Touch programming language for swarm intelligent building application (APP) development effectively reduces the development difficulty and user programming threshold, making the building more intelligent. However, the features of Touch language such as intuitive modeling of building elements, parallel programming, and the implicit specification of internode communication lead to great challenges in the compilation process of Touch language to the low-level executable object code of swarm intelligent buildings, and the APP development efficiency is not high. This paper proposes a code conversion method from Touch to C language and its supporting tools, designs code conversion algorithms for Touch language elements used to describe distributed building physical objects and parallel computing mode, which supports the automatic conversion of high-level Touch language, which is user-oriented and shielded from the details of the underlying interactions, into the C language code for underlying execution, thus realizing an integrated process from high-level APP development to low-level hardware platform execution and improving the APP development efficiency.
This paper focuses on developing algorithms for parallel determinant processing, a crucial task in linear algebra and computational mathematics. The aim is to improve efficiency in high-performance computing environme...
详细信息
ISBN:
(数字)9798350387568
ISBN:
(纸本)9798350387575
This paper focuses on developing algorithms for parallel determinant processing, a crucial task in linear algebra and computational mathematics. The aim is to improve efficiency in high-performance computing environments by designing and analyzing algorithms that use parallel processing to expedite determinant computation for various matrices range. The research explores methods like Laplace expansion, LU decomposition, eigenvalue decomposition, Gaussian elimination, and cofactor expansion, assessing their efficiency, scalability, and applicability in different computational environments. The study employs advanced parallel programming techniques and architectures, utilizing multi-core processors with the focus aim into utilization of Chio’s method of rectangular determinants processing in parallel etc. The research also investigates the mathematical underpinnings of parallel determinant algorithms, addressing challenges like load balancing, data distribution, and synchronization. The results show significant improvements in determinant calculations efficiency, reducing computation times for large matrices.
暂无评论