parallel and distributed programming is essential for exploiting the processing power of modern computing platforms. However, during the first years of a Computer Science course, students usually learn problem solving...
详细信息
ISBN:
(纸本)9781538655559
parallel and distributed programming is essential for exploiting the processing power of modern computing platforms. However, during the first years of a Computer Science course, students usually learn problem solving techniques, data structures and programming paradigms that are inherently sequential, hindering the transition to parallel architectures. parallel Programming Marathons organized in Brazil are similar to other Programming Competitions around the world and have been used for teaching and stimulating undergraduate and graduate students into learning to "think in parallel" and to develop applications for different parallel architectures, including multicores, clusters and accelerators. This paper presents the structure of this parallel Programming Marathon and an overview of how it supports regional and national contests. Also, this work presents use cases on parallel and distributed computing course from two different Brazilian universities that use a challenge based learning approach and employ marathon problems as course assignments. This approach contributed to increase students' interest towards High Performance computing.
Learning programming in early introductory classes is challenging for first year university students, and introducing parallel programming (PDC) in early classes along with traditional sequential programming is even m...
详细信息
ISBN:
(纸本)9781450363013
Learning programming in early introductory classes is challenging for first year university students, and introducing parallel programming (PDC) in early classes along with traditional sequential programming is even more challenging. Unplugged activities may help alleviate some of the difficulties for students. Unplugged activities have been shown to increase student interest, and to enhance student understanding of CS programming concepts [1]. We have used unplugged activities to teach PDC concepts before introducing parallel programming. Our experiences show that using unplugged activities to introduce the PDC concepts reduce the barrier to learn parallel programming.
This paper presents design, implementation, and performance evaluation results of a parallel particle filter (PF) and a particle flow filter (PFF) using a Graphics Processing Unit (GPU) as a parallelcomputing environ...
详细信息
ISBN:
(纸本)9786058631113
This paper presents design, implementation, and performance evaluation results of a parallel particle filter (PF) and a particle flow filter (PFF) using a Graphics Processing Unit (GPU) as a parallelcomputing environment to speedup the computation. Simulation results from a high dimensional nonlinear filtering problem show that, for the considered example, the parallel PFF implementation is significantly superior to the parallel PF implementation in both estimation accuracy and computational performance. It is demonstrated that using GPU can markedly accelerate both particle filters and particle flow filters through parallelization.
This paper covers the fast solution of large acoustic problems on low-resources parallel platforms. A domain decomposition method is coupled with a dynamic load balancing scheme to efficiently accelerate a geometrical...
详细信息
ISBN:
(纸本)9781479941698
This paper covers the fast solution of large acoustic problems on low-resources parallel platforms. A domain decomposition method is coupled with a dynamic load balancing scheme to efficiently accelerate a geometrical acoustic method. The geometrical method studied implements a beam-tracing method where intersections are handled as in a ray-tracing method. Beyond the distribution of the global processing upon multiple sub-domains, a second parallelization level is operated by means of multi-threading and shared memory mechanisms. Numerical experiments show that this method allows to handle large scale open domains for parallelcomputing purposes on few machines. Urban acoustic pollution arrising from car traffic was simulated on a large model of the Shinjuku district of Tokyo, Japan. The good speed-up results illustrate the performance of this new domain decomposition method.
Advances in data collection techniques and database technologies, such as remote sensing and satellite telemetry, have led to the collection of huge amounts of data distributed among large databases and heterogeneous ...
详细信息
Advances in data collection techniques and database technologies, such as remote sensing and satellite telemetry, have led to the collection of huge amounts of data distributed among large databases and heterogeneous remote sites. Intelligent and automatic processing of the distributed data and efficiently supporting scientific collaboration between both professional and casual users is a highly demanding task. It is also particularly challenging when the system must cope with active data that is processed on-demand. These requirements have generated an urgent need for more advanced software infrastructure to create, maintain, evolve, and federate these active digital libraries of scientific data. Traditional models of distributedcomputing are inadequate to support such complex applications. As part of the ongoing Synthetic Aperture Radar Atlas (SARA) Digital Library project, the research presented here proposes a collaborating mobile agent approach to on-demand processing of remote sensing data. The approach, which is based on autonomous data processing and enables different image analysis algorithms to be wrapped asmobile agents, is expected to be an improvement over the static CGI-based interface and inefficient information discovery that are currently used by SARA. We discuss the agent-based infrastructure we have developed. The SARA system allows users to dispatch their compute-intensive jobs as mobile agents. Since the agents can be programmed to satisfy their specific goals, even if they move and lose contact with their creators they can survive intermittent or unreliable network connections. During their lifetime, the agents can also move themselves autonomously from one server to another for load balancing, and to enhance data locality and fault tolerance. The SARA system relies on XML to support agent communications on clusters of servers. Although the examples presented are based mainly on the SARA system, the proposed techniques are applicable to other
This paper deals with an efficient and robust distributed framework for finite state machine coverage in the field model based testing theory. All final states coverage in large-scale automaton is inherently computing...
详细信息
ISBN:
(纸本)9783030659554;9783030659547
This paper deals with an efficient and robust distributed framework for finite state machine coverage in the field model based testing theory. All final states coverage in large-scale automaton is inherently computing-intensive and memory exhausting with impractical time complexity because of an explosion of the number of states. Thus, it is important to propose a faster solution that reduces the time complexity by exploiting big data concept based on Spark RDD computation. To cope with this situation, we propose a parallel and distributed approach based on Spark in-memory design which exploits A* algorithm for optimal coverage. The experiments performed on multi-node cluster prove that the proposed framework achieves significant gain of the computation time.
The realization of truly heterogeneous database systems is hampered by two principal obstacles. One is the unsuitability of traditional transaction models;this has led to the proposal of a number of new, advanced tran...
详细信息
In this contribution, recent architectural approaches and roofs for local-area and wide-area computing ruing clusters of sewers, workstations, and PCs as multicomputers (i.e. parallelcomputing irt workstation cluster...
详细信息
ISBN:
(纸本)0769509878;0769509886
In this contribution, recent architectural approaches and roofs for local-area and wide-area computing ruing clusters of sewers, workstations, and PCs as multicomputers (i.e. parallelcomputing irt workstation clusters) are classified and described. The goat of such systems is to concentrate available computing resources To solve computing problems. A special focus of this contribution is a description of recent research in the field of cost-efficient parallelcomputing with standard component multicomputer systems, concentrating on locally organized clusters for local-area computing and on wide-area multiclusters (hyperclusters or clusters of clusters) for wide-ar-ea computing. Selected examples are given demonstrating the improvement through high-speed interconnection networks and optimized protocol system architectures in local-area systems and optimized organizations in wide-area systems.
In this paper we describe the implementation of a service oriented environment that enables to couple a parallel application, which performs the 3D linear dynamic structural analysis of high-rise buildings, to a Grid ...
详细信息
ISBN:
(纸本)9783540713500
In this paper we describe the implementation of a service oriented environment that enables to couple a parallel application, which performs the 3D linear dynamic structural analysis of high-rise buildings, to a Grid computing infrastructure. The Grid service, developed under Globus Toolkit 4, exposes the dynamic simulation as a service to the structural scientific community. It employs the GMarte middleware, a metascheduler that enables to perform the computationally intensive simulations on the distributed resources of a Grid-based infrastructure.
Individual-based simulations are an important class of applications where a complex system is modeled as a collection of autonomous entities, each having its own identify and behavior in the underlying simulated space...
详细信息
ISBN:
(纸本)9783642038686
Individual-based simulations are an important class of applications where a complex system is modeled as a collection of autonomous entities, each having its own identify and behavior in the underlying simulated space. The main drawback of such simulations is that they are extremely compute-intensive. We consider the class of individual-based simulations where the simulated entities interact with one another indirectly through the underlying simulated space, significant performance improvement is attainable through parallelism on a network of machines. We present a data distribution and an approach to reduce the communication overhead;which leads to significant performance improvements while preserving the accuracy of the simulation.
暂无评论