The possibility of having available massive computer resources to users opens ideas for the future of interoperability between multiple infrastructure systems. This wide system should be composed of multiple high perf...
详细信息
作者:
Jorba, JosepMargalef, TomásLuque, Emilio
Estudis d'Informatica Multimedia i Telecomunicacio Rambla del Poblenou 156 ES-08018 Barcelona Spain
Computer Architecture and Operating Systems Department ES-08193 Bellaterra Spain
Performance is a crucial issue of parallel/distributed applications. One kind of useful tools, in this context, are the automatic performance analysis tools, that help developers in some of the phases of the performan...
详细信息
Computational science is often referred to as the third science,complementing both theoretical and laboratory science. In this field, new challenges are continuously arising. The asymptotic time complexity definition ...
详细信息
ISBN:
(纸本)9780769530154;076953015X
Computational science is often referred to as the third science,complementing both theoretical and laboratory science. In this field, new challenges are continuously arising. The asymptotic time complexity definition of both deterministic and non-deterministic algorithms to solve all kinds of problems is one of the key points in computer science. Knowing the limit of the execution time of an algorithm when the size of the problem goes to infinity is essential. In particular, data-dependent applications is an extremely challenging problem because for a specific issue the input data sets may cause variability in execution times. The development of an entire approach to define the asymptotic time complexity of a hard data-dependent parallel application that solves the traveling salesman problem (TSP) is the focus of this study. Two different parallel TSP algorithms are presented. One of these is used to show the usefulness and the profits of the proposed approach, and the other one is used as witness. The experimental results are quite promising.
Many authors have proposed power management techniques for general-purpose processors at the cost of degraded performance such as lower IPC or longer delay. Some proposals have focused on cache memories because they c...
详细信息
ISBN:
(纸本)1595933026
Many authors have proposed power management techniques for general-purpose processors at the cost of degraded performance such as lower IPC or longer delay. Some proposals have focused on cache memories because they consume a significant fraction of total microprocessor power. We propose a reconfigurable and adaptive cache microarchitecture based on field-programmable technology that is intended to deliver high performance at low energy consumption. In this paper, we evaluate the performance and energy consumption of a run-time algorithm when used to manage a field-programmable L1 data cache. The adaptation strategy is based on two techniques: a learning process provides the best cache configuration for each program phase, and a recognition process detects program phase changes by using data working-set signatures to activate a low-overhead reconfiguration mechanism. Our proposals achieve performance improvement and cache energy saving at the same time. Considering a design scenario driven by performance constraints, we show that processor execution time and cache energy consumption can be reduced on average by 15.2% and 9.9% compared to a non-adaptive high-performance microarchitecture. Alternatively, when energy saving is prioritized and considering a non-adaptive energy-efficient microarchitecture as baseline, cache energy and processor execution time are reduced on average by 46.7% and 9.4% respectively. In addition to comparing to conventional microarchitectures, we show that the proposed microarchitecture achieves better performance and more cache energy reduction than other configurable caches. Copyright 2006 ACM.
Distributed event simulation (DES) is a useful tool for high performance simulation. This type of simulation is recommended for complex models or when a high degree of model detail is needed. Fish School is an individ...
详细信息
Accurate indirect jump prediction is critical for some applications. Proposed methods are not efficient in terms of chip area. Our proposal evaluates a mechanism called target encoding that provides a better ratio bet...
详细信息
Wildland fire-risk assessment is a very significant issue. This risk assessment is usually based on ignition probability due to meteorological or human factors, but it does not usually consider propagation danger when...
详细信息
ISBN:
(纸本)3832243623
Wildland fire-risk assessment is a very significant issue. This risk assessment is usually based on ignition probability due to meteorological or human factors, but it does not usually consider propagation danger when a wildland fire has started. To evaluate propagation danger, it is necessary to apply some propagation model and simulate the behaviour of the fireline. However, this propagation danger must be evaluated considering many different possible scenarios. Therefore, the amount of simulations that must be carried out is enormous and it is necessary to apply high-performance computing techniques to make the methodology feasible. In this paper, a method for creating propagation danger maps based on factorial experimentation is described. The methodology was applied at a southern Europe scale during the 2004 summer season.
Multimedia applications are fast becoming one of the dominating workloads for modern computersystems. Since these applications normally have large data sets and little data-reuse, many researchers believe that they h...
ISBN:
(纸本)9781581133349
Multimedia applications are fast becoming one of the dominating workloads for modern computersystems. Since these applications normally have large data sets and little data-reuse, many researchers believe that they have poor memory behavior compared to traditional programs, and that current cache architectures cannot handle them well. It is therefore important to quantitatively characterize the memory behavior of these applications in order to provide insights for future design and research of memory systems. However, very few results on this topic have been published. This paper presents a comprehensive research on the memory requirements of a group of programs that are representative of multimedia applications. These programs include a subset of the popular MediaBench suite and several large multimedia programs running on the Linux, Windows NT and Tru UNIX operatingsystems. We performed extensive measurement and trace-driven simulation experiments. We then compared the memory utilization of these programs to that of SPECint95 applications. We found that multimedia applications actually have better memory behavior than SPECint95 programs. The high cache hit rates of multimedia applications can be contributed to the following three factors. Most multimedia applications apply block partitioning algorithms to the input data, and work on small blocks of data that easily fit into the cache. Secondly, within these blocks, there is significant data reuse as well as spatial locality. The third reason is that a large number of references generated by multimedia applications are to their internal data structures, which are relatively small and can also easily fit into reasonably-sized caches.
Parallel Virtual Machine (PVM) and Message Passing Interface (MPI) are the most frequently used tools for programming according to the message passing paradigm, which is considered one of the best ways to develop para...
详细信息
ISBN:
(数字)9783540481584
ISBN:
(纸本)9783540665496
Parallel Virtual Machine (PVM) and Message Passing Interface (MPI) are the most frequently used tools for programming according to the message passing paradigm, which is considered one of the best ways to develop parallel applications. This volume comprises 67 revised contributions presented at the Sixth European PVM/MPI Users' Group Meeting, which was held in Barcelona, Spain, 26-29 September 1999. The conference was organized by the computer Science department of the Universitat Autònoma de Barcelona. This conference has been previously held in Liverpool, UK (1998) and Cracow, Poland (1997). The first three conferences were devoted to PVM and were held at the TU Munich, Germany (1996), ENS Lyon, France (1995), and University of Rome (1994). This conference has become a forum for users and developers of PVM, MPI, and other message passing environments. Interaction between those groups has proved to be very useful for developing new ideas in parallel computing and for applying some of those already existent to new practical fields.
暂无评论