Multicore hardware is ubiquitous, but billions of lines of code in performance-critical commodity software are still sequential. Although parallel libraries, design patterns, and best practice guidelines are available...
详细信息
ISBN:
(纸本)9781467376846
Multicore hardware is ubiquitous, but billions of lines of code in performance-critical commodity software are still sequential. Although parallel libraries, design patterns, and best practice guidelines are available, thinking parallel is still a big challenge for many software engineers. In this paper we present a case study on parallelizing commodity software using a commercial real-time audio application with over 700,000 lines of code. In contrast to best practice guidelines, our goal is to investigate what parallelization strategy can effectively be used in data stream-intensive applications. Performing an in-depth analysis of the software architecture and its run-time performance, we locate parallelization potential and propose three different parallelization strategies. We evaluate them with respect to their parallel performance impact. Regarding the application's intrinsic real-time requirement and a very short audio cycle turnaround time, a busy-waiting strategy offers the best audio performance of 327 mu s per cycle on an eight-core machine. With an efficiency of 99% this is close to the optimal schedule.
Performance analysis through visualization techniques usually suffers semantic limitations due to the size of parallel applications. Most performance visualization tools rely on data aggregation to work at scale, with...
详细信息
ISBN:
(纸本)9781479936069
Performance analysis through visualization techniques usually suffers semantic limitations due to the size of parallel applications. Most performance visualization tools rely on data aggregation to work at scale, without any attempt to evaluate the loss of information caused by such aggregations. This paper proposes a technique to evaluate the quality of aggregated representations - using measures from information theory - and to optimize such measures in order to build consistent multiresolution representations of large execution traces.
In this paper we try to describe how the Modular Interactive Tiles System (MITS) can be a valuable tool for introducing students to interactive parallel and distributed processing programming. This is done by providin...
详细信息
ISBN:
(纸本)9784990288051
In this paper we try to describe how the Modular Interactive Tiles System (MITS) can be a valuable tool for introducing students to interactive parallel and distributed processing programming. This is done by providing an educational hands-on tool that allows a change of representation of the abstract problems related to designing interactive parallel and distributedsystems. Indeed, MITS seems to bring a series of goals into the education, such as parallel programming, distributedness, communication protocols, master dependency, software behavioral models, adaptive interactivity, feedback, connectivity, topology, island modeling, user and multiuser interaction, which can hardly be found in other tools. Finally, we introduce the system of modular interactive tiles as a tool for easy, fast, and flexible hands-on exploration of these issues, and through examples show how to implement interactive parallel and distributed processing with different software behavioural models such as open loop, randomness based, rule based, user interaction based, AI and ALife based software.
This article deals with proposals of distributed and parallel architectures for extension of Behavioural Animation simulation systems. Each proposal describes architecture in accordance with a way of communication amo...
详细信息
ISBN:
(纸本)0780394747
This article deals with proposals of distributed and parallel architectures for extension of Behavioural Animation simulation systems. Each proposal describes architecture in accordance with a way of communication among simulation processes on the software solution level, together with advantages and disadvantages from the Behavioural Animation point of view.
The analysis of time and realizability of parallel solving complex problems on distributed computer systems (CS) is presented. The derivation of equation for calculating the efficiency indices is based on the assumpti...
详细信息
ISBN:
(纸本)0769525547
The analysis of time and realizability of parallel solving complex problems on distributed computer systems (CS) is presented. The derivation of equation for calculating the efficiency indices is based on the assumption that the time of problem solution on CS is a function of time of problem solution on one elementary machine, and the function has a finite number of discontinuities. The discontinuities have the probabilistic character and correspond to the CS failures that require reconfiguration of the CS (structure readjustability with regard to working machine only). A notion of complex CS reconfiguration is introduced and the reconfiguration is investigated. A set of integral equations for calculating the function of realizability of problem solution on distributed CSs is derived. A parallel algorithm for its computing is described.
With the rapid development of information technology, people have put forward higher requirements for the audio-visual experience and usage functions of conference spaces. The traditional, single-function conference r...
详细信息
ISBN:
(纸本)9798350391961;9798350391954
With the rapid development of information technology, people have put forward higher requirements for the audio-visual experience and usage functions of conference spaces. The traditional, single-function conference room configuration can no longer adapt to the diverse needs of modern work and interactive activities. How to achieve efficient management and control, seamless interconnection, and resource sharing of audio systems across spaces, while ensuring the acoustic characteristics and flexibility of each independent space, has become a core issue that needs to be urgently resolved in building a multi-hall, multi-functional conference room cluster. Based on the audio system project of the Academic Center of the Communication University of China, this paper proposes a solution for a conference room cluster audio system based on a distributed architecture. This solution not only overcomes the limitations of traditional systems in terms of scalability, collaborative work, and resource scheduling but also promotes lossless transmission, real-time processing, and adaptive configuration of audio signals. Thus, it ensures the consistency and high quality of the audio experience within the entire cluster, providing practical guidance for the design and optimization of future conference room audio systems.
Performance modeling is becoming an increasingly important part of the parallel application development process, particulary for expensive computations that will be run on very high-end systems where resources are sca...
详细信息
ISBN:
(纸本)9780769549712
Performance modeling is becoming an increasingly important part of the parallel application development process, particulary for expensive computations that will be run on very high-end systems where resources are scarce. We describe a performance modeling tool SIPMaP (Super Instruction Processor Modeling and Prediction) developed for the Super-Instruction Architecture (SIA). The SIA is designed for applications where the dominant data structures are large multi-dimensional arrays and it comprises a DSL, the Super-Instruction Assembly Language (SIAL) that supports expressing algorithms in terms of blocks (tiles), and its runtime system Super Instruction Processor (SIP) that manages distribution and disk storage of the arrays. SIPMaP generates performance models from the SIAL source code. In comparison with many applications where useful performance models have been developed and reported, these programs are irregular and have other difficult to model characteristics such as extensive overlapping of communication and computation.
Information systems are basically distributed while object oriented brings new issues in softwareengineering. Pairing distributedsystems with object paradigms results in a distributed object model. Choosing the appr...
详细信息
ISBN:
(纸本)0780382927
Information systems are basically distributed while object oriented brings new issues in softwareengineering. Pairing distributedsystems with object paradigms results in a distributed object model. Choosing the appropriate distributed object model is a multi-criteria decision problem. Performance is a key criterion that makes software developers be able to select the object-oriented system that best fit their requirements. Classical techniques and methodologies of performance analysis are either unsuitable or unnatural to capture performance behavior of Object-Oriented (OO) systems. In this paper, we present Performance-Based Model for distributed Object-Oriented software (DOOS) and a methodology to analyze and evaluate its performance. The new model evaluates the overall time cost of DOOS system considering the communication overheads. while preserving the OO features such as encapsulations, information hiding, inheritance, Etc.
In this paper, we consider the problem of scheduling a special kind of mixed data-parallel applications arising in the context of continuous integration. Continuous integration (CI) is a softwareengineering technique...
详细信息
ISBN:
(纸本)9780769546766
In this paper, we consider the problem of scheduling a special kind of mixed data-parallel applications arising in the context of continuous integration. Continuous integration (CI) is a softwareengineering technique, which consists in rebuilding and testing interdependent software components as soon as developers modify them. The CI tool is able to provide quick feedback to the developers, which allows them to fix the bug soon after it has been introduced. The CI process can be described as a DAG where nodes represent package build tasks, and edges represent dependencies among these packages;build tasks themselves can in turn be run in parallel. Thus, CI can be viewed as a mixed data-parallel application. A crucial point for a successful CI process is its ability to provide quick feedback. Thus, makespan minimization is the main goal. Our contribution is twofold. First we provide and analyze a large dataset corresponding to a build DAG. Second, we compare the performance of several scheduling heuristics on this dataset.
We argue for novel techniques to understand how cloud systems can fail, by enhancing fault injection with distributed tracing and anomaly detection techniques.
ISBN:
(纸本)9781538694435
We argue for novel techniques to understand how cloud systems can fail, by enhancing fault injection with distributed tracing and anomaly detection techniques.
暂无评论