检索结果-内蒙古大学图书馆

parallel programming in mobile devices with FancyJCL

JOURNAL OF SUPERCOMPUTING 2024年第9期80卷 12891-12909页

作者： Afonso, Sergio Gomez-Cardenes, Oscar Exposito, Paula Blanco, Vicente Almeida, Francisco Univ La Laguna ULL Comp Sci & Syst Dept San Francisco de Paula s n San Cristobal la Laguna 38270 Spain

Mobile devices and handheld systems, such as the smartphones and tablets universally extended, are becoming increasingly powerful. Their basic hardware configuration is usually state-of-the-art heterogeneous architectures consisting of multi-core processors and some kind of accelerator such as GPUs or DSPs. Specific code adapted to the architecture is mandatory if high-performance computation is required and low-level libraries and parallelism are needed, which constitutes an important barrier for the usual developer in such devices. In this context, we propose the FancyJCL framework. It provides a high-level abstraction layer that hides implementation details and allows to develop parallel programs for mobile devices. The target platform for FancyJCL is mainly Android and Java developers due to their high market penetration. A very simple, seemingly sequential encoding results in parallel efficient OpenCL code. FancyJCL is itself based on the Fancier framework, which enables optimal memory management across memory spaces on unified memory systems. Benchmarks of FancyJCL code developed for a wide range of image processing algorithms show good performance with low development effort.

关键词： Application programming interfaces Hardware acceleration Heterogeneous systems Image processing Mobile computing parallel programming Performance analysis

来源：评论

学校读者我要写书评

暂无评论

Evaluating ChatGPT's strengths and limitations for data race detection in parallel programming via prompt engineering

引用

JOURNAL OF SUPERCOMPUTING 2025年第6期81卷 1-27页

作者： Alsofyani, May Wang, Liqiang Univ Cent Florida Dept Comp Sci Orlando FL 32816 USA

Large Language Models have significantly advanced software engineering, enabling tasks like code comprehension and fault detection. However, their ability to detect complex bugs, such as data races in parallel programming, remains uncertain. Fault detection in parallel programming (Pthreads) requires a deep understanding of thread-based logic, as data races occur when threads access shared data concurrently without proper synchronization. This paper explores ChatGPT's potential in Pthreads fault detection by addressing three questions: (1) Can ChatGPT effectively debug parallel programming threads? (2) How can dialogue assist with the detection of faults? (3) How can prompt engineering help to improve ChatGPT's fault detection performance?. We examine advanced prompt engineering techniques, such as Zero-Shot, Few-Shot, Chain-of-Thought, and Retrieval-Augmented Generation prompts. Additionally, we introduce three hybrid prompting techniques to enhance performance, including Chain-of-Thought with Few-Shot Prompting, Retrieval-Augmented Generation with Few-Shot Prompting, and Prompt Chaining with Few-Shot Prompting, while evaluating ChatGPT's strengths and limitations for data race detection.

关键词： large language model LLM4SE Fault detection Data race parallel programming

来源：评论

学校读者我要写书评

暂无评论

Investigating the Progression of the Mental Models Formed by Programmers Learning parallel programming

引用

ACM TRANSACTIONS ON COMPUTING EDUCATION 2024年第1期25卷 1-31页

作者： Bidlake, Leah Aubanel, Eric Voyer, Daniel Univ New Brunswick Fac Comp Sci Fredericton NB Canada Univ New Brunswick Dept Psychol Fredericton NB Canada

Research on mental model representations developed by programmers during parallel program comprehension is important for informing and advancing teaching methods including model-based learning and visualizations. The goals of the research presented here were to determine: how the mental models of programmers change and develop as they learn parallel programming, the quality of their mental models after learning parallel programming, and what type of information is part of their mental models when examining code for the presence of data races. Participants were experienced C programmers and included both university students and professionals. The mental models of participants were analyzed by having them perform a code tracing task where they externalized their mental models by drawing diagrams while tracing the execution of parallel code. We also analyzed their mental models by having participants determine the presence of data races in parallel code and then answer multiple choice and open-ended questions related to the code. The results presented in this article indicate that programmers' mental models progress from a weaker execution model and a stronger situation model before learning parallel programming, to a stronger execution model and a weaker situation model after learning parallel programming. The thematic analysis of the openended responses that indicate what components of code programmers used to determine whether or not a data race was present provides insight into the topics that should be emphasized when teaching parallel programming.

关键词： computer science education mental representations progression of mental models psychology of programming parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel programming with Pictures: Choosing Your Own Adventure

Parallel Programming with Pictures: Choosing Your Own Advent...

引用

37th IEEE International parallel and Distributed Processing Symposium (IPDPS)

作者： Feng, W. Davis-Wallace, L. Virginia Tech Dept Comp Sci Blacksburg VA 24061 USA

ISBN: (纸本)9798350311990

Given the ubiquity of parallel computing hardware, we introduced parallelprogramming with pictures to the block-based Snap! environment and called it pSnap!, short for parallel Snap! We then created an accessible curriculum for students of all ages to learn how to program serially and then how to program with explicit parallelism. This paper presents a new and innovative extension to our curriculum on parallel programming with pSnap!, one that broadens its appeal to the masses by teaching the application of parallel programming as a "choose your own learning adventure" activity, inspired by the Choose Your Own Adventure book series of the 1980s and 1990s. Specifically, after students learn the basics of parallel programming with pictures, they are ready to choose their next learning adventure, which applies their newfound parallel programming skills to create a video game of their choice, i.e., Missile Command or Do You Want to Build a Snowman?

关键词： block-based programming curriculum computer science education parallel programming pictures Scratch Snap pSnap

来源：评论

学校读者我要写书评

暂无评论

parallel programming Approaches to Optimize Performance and Energy Consumption on Heterogeneous Computing Systems

Parallel Programming Approaches to Optimize Performance and ...

引用

作者： Bauer, Brian Harvard University

学位级别：A.L.M., Master of Liberal Arts

parallel programming offers the ability to simultaneously improve the performance and reduce the energy consumption of software running on heterogeneous computing systems. Software developers have long preferred to avoid parallel programming, if possible, for reasons such as perceived difficulty, lack of portability between systems, and the pace of improvement in computer hardware. However, generational changes in computer hardware are now focused on specialized components and increased computational cores, and the continued evolution of these systems places increased emphasis on achieving improvements via the use of these components. This thesis investigates parallel programming techniques that make use of components common to modern heterogeneous systems, and proposes that the difficulty and lack of portability need not be barriers to large improvements. Using a variety of heterogeneous systems, algorithms were implemented and then transformed using multiple cores, SIMD execution units, and GPUs. Reductions in execution time ranging from 71-94% and energy consumption of 76-98% were observed, demonstrating the effectiveness of using specialized components for improved performance and reduced energy consumption.

关键词： Energy consumption GPU programming Heterogeneous systems parallel programming SIMD programming Software engineering

来源：评论

学校读者我要写书评

暂无评论

A Real-Time parallel programming Approach for Rust

Ada User Journal

引用

Ada User Journal 2023年第3期44卷 232-236页

作者： Carvalho, Tiago Silva, Hugo Pinho, Luís Miguel Instituto Superior de Engenharia do Porto Porto Portugal

The development of real-time systems is one of the areas with the highest relevance in computer science, and the number of critical systems has increased significantly. These systems considers several applications running concurrently, and inside each of those applications code might be parallelized to improve their performance and control the priority of each parallelizable task. Several efforts have been done in different programming languages to provide real-time systems with parallel programming models, whether by code extensions or annotations, or with specific features in the actual language core. Rust is a recent programming language that have quickly grown in potential and already with a large community, being continuously formed. The language is a good candidate in terms of both real-time systems and parallel programming. However, there is a lack of work that joins these two important concepts in an efficient and reliable way. In this work we aim to design and provide a framework for real-time parallel systems. We conduct a study over the existing work in other programming languages and aim to bring their advantages and useful programming models into the Rust programming language, in the format of a real-time parallel programming library. © 2023, Ada-Europe. All rights reserved.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Efficient Power Flow Algorithm for Unbalanced Three-Phase Distribution Networks using Recursion and parallel programming 22

Efficient Power Flow Algorithm for Unbalanced Three-Phase Di...

引用

IEEE 22nd Mediterranean Electrotechnical Conference (MELECON)

作者： de Souza, Mariana Reiz, Cleberton Leite, Jonatas B. INESC Technol & Sci INESC TEC Ctr Power & Energy Syst CPES Porto Portugal Sao Paulo State Univ UNESP Dept Elect Engn DEE Ilha Solteira Brazil

ISBN: (纸本)9798350387032;9798350387025

In this work, the implementation of an efficient multi-threading algorithm for calculating the power flow in electricity distribution networks is carried out using recursion and parallel programming. With the integration of renewable energy, energy storage systems and distributed generation, the ability of power flow simulations becomes a crucial factor in finding the best solution in the shortest possible time. We propose the direct use of graph theory to represent distribution network topologies. In this data structure, the traversal algorithms are inherently recursive, thus enabling the development of algorithms with parallel programming to obtain the power flow calculation faster and more efficiently. Results under a 809 buses test system show that the implementation provides additional computation efficiency of 32% with recursion techniques and 27% with parallel programming, due the expense of threads' allocation the combined gain reaches 50%.

关键词： Distribution Network Power Flow Graph Theory Recursion parallel programming

来源：评论

学校读者我要写书评

暂无评论

Integrating interactive performance analysis in Jupyter Notebooks for parallel programming education

Integrating interactive performance analysis in Jupyter Note...

引用

1st International Conference on Smart Energy Systems and Artificial Intelligence (SESAI)

作者： Oden, Lena Noelp, Klaus Brauner, Philipp Univ Hagen Comp Engn Hagen Germany Rhein Westfal TH Aachen Human Comp Interact Ctr Aachen Germany

ISBN: (纸本)9798350364613;9798350364606

Understanding the performance behavior of parallel applications is important in many ways, but doing so is not easy. Most open source analysis tools are written for the command line. We are building on these proven tools to provide an interactive performance analysis experience within Jupyter Notebooks when developing parallel code with MPI, OpenMP, or both. Our solution makes it possible to measure the execution time, perform profiling and tracing, and visualize the results within the notebooks. For ease of use, it provides both a graphical JupyterLab extension and a C++ API. The JupyterLab extension shows a dialog where the user can select the type of analysis and its parameters. Internally, this tool uses Score -P, Scalasca, and Cube to generate profiling and tracing data. This tight integration gives students easy access to profiling tools and helps them better understand concepts such as benchmarking, scalability and performance bottlenecks. In addition to the technical development, the article presents hands-on exercises from our well-established parallel programming course. We conclude with a qualitative and quantitative evaluation with 19 students, which shows a positive effect of the tools on the students' perceived competence.

关键词： Jupyter parallel programming performance analysis interactive programming high perfonnance computing

来源：评论

学校读者我要写书评

暂无评论

Solving the Crossword Generation Problem Using parallel programming in Moderns Browsers: A Case Study 9th

Solving the Crossword Generation Problem Using Parallel Prog...

引用

9th International Congress on Information and Communication Technology (ICICT)

作者： Rodrigues, Daniel T. Bianchini, Calebe P. Univ Prebiteriana Mackenzie Comp & Informat Dept Sao Paulo Brazil

ISBN: (纸本)9789819735556;9789819735563

The use of modern browsers reveals itself more and more essential to the world. Features like Web Workers are becoming more adopted over the most used browsers of the Internet, enabling performance enhancements in web applications. As consequence, execution of tasks with higher computational demand inside the browser. Technique of task parallelization using Web Workers, presenting as study case an algorithm of crossword generation, being executed in a browser context. The results show even superlinear speedups for a parallel version of the algorithm.

关键词： parallel programming Rust Web Workers WebAssembly

来源：评论

学校读者我要写书评

暂无评论

Teaching parallel programming on the CPU Based on Matrix Multiplication Using MKL, OpenMP and SYCL Libraries 17

Teaching Parallel Programming on the CPU Based on Matrix Mul...

引用

17th International Conference on Computer Supported Education, CSEDU 2025

作者： Bober, Emilia Bylina, Beata Maria Curie-Sklodowska University Pl. M. Curie-Sklodowskiej 5 Lublin20-031 Poland

ISBN: (纸本)9789897587467

Matrix multiplication is a fundamental operation in engineering computations. With the widespread use of modern multi-core processors, this operation can be significantly accelerated through parallel programming. Consequently, it is essential to acquaint computer science students with parallel programming techniques. Matrix multiplication is well known to students, while additionally offering numerous possibilities for par-allelisation. This makes it an ideal example for introducing parallel programming while highlighting key considerations such as execution time, accuracy of the calculations, code complexity and the impact of the hardware architecture on the results obtained. Students can implement and test such software themselves. In this paper, the performance and accuracy of the MKL, SYCL and OpenMP libraries are investigated using matrix multiplication of different sizes as an example. OpenMP is discussed at some universities, so it may already be familiar to students, whereas SYCL is a newer and less commonly used standard but it offers great possibilities. Square matrices with double-precision elements and dimensions of 4096×4096, 8192×8192, and 16384×16384 were selected for testing. The experiments revealed significant computational speed-ups compared to the sequential algorithm, with no loss of accuracy. SYCL was found to be about 10 times faster than OpenMP, but the calculations performed with MKL are by far the fastest. Additionally, the results indicated that doubling the number of threads does not directly correlate to a twofold increase in execution speed, and doubling the matrix size in each dimension leads to an approximately tenfold increase in execution time. Copyright © 2025 by SCITEPRESS - Science and Technology Publications, Lda.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：