Current performance analysis and tuning tools must be able to improve the performance of large-scale parallel applications. To be effective, such analysis and tuning tools must be scalable and be able to manage the dy...
详细信息
Exascale computing requires complex runtime systems that need to consider affinity, load balancing and low time and message complexity for scheduling massive scale parallel computations. Simultaneous consideration of ...
详细信息
We investigate models that efficiently map branch-And-bound algorithms on a distributed computer architecture using a case of multidimensional Lipschitz Global Optimization. A combination of MPI and Pthreads is studie...
详细信息
In our paper we present an abstract object oriented runtime system that helps to develop scientific applications for new her erogenous architectures based on multi-node of multi-core processors enhanced with accelerat...
详细信息
Standard system tools employed by users on a daily basis do not take full advantage of parallel file system I/O bandwidth and do not understand associated idiosyncrasies such as Lustre striping. This can lead ton on-o...
详细信息
In this paper we describe two ongoing initiatives for teaching concurrency and distribution in PUC-Rio and UFRJ. One of them is a new approach for teaching distributed systems. Conventional distributed system courses ...
详细信息
This paper focuses on Platform as a Service (PaaS). From the advent of Cloud computing, the latest trend has been to design ad hoc platforms specialized to address given operational scenarios. In this context, the nee...
详细信息
As the tapering off of Moore's Law produces the need for more parallelism in high-performance applications, the parallel programming model becomes central to achieving and maintaining performance per watt on curre...
详细信息
Dynamic load-balancing in parallel algorithms typically requires locks and/or atomic instructions for correctness. We have shown that sometimes an optimistic parallelization approach can be used to avoid the use of lo...
详细信息
We present a parallel implementation of three transmission switching algorithms. The first is based on a parallel search of all candidate lines, the second is based on a priority listing of lines and the third is base...
详细信息
暂无评论