We present methods that can dramatically improve numerical consistency for parallel calculations across varying numbers of processors. By calculating global sums with enhanced precision techniques based on Kahan or Kn...
详细信息
We present methods that can dramatically improve numerical consistency for parallel calculations across varying numbers of processors. By calculating global sums with enhanced precision techniques based on Kahan or Knuth summations, the consistency of the numerical results can be greatly improved with minimal memory and computational cost. This study assesses the value of the enhanced numerical consistency in the context of general finite difference or finite volume calculations. (C) 2011 Elsevier B.V. All rights reserved.
The year 2017 marks the 15th anniversary of the SCIT supercomputer project, which allows us to summarize the results and draw conclusions. In this paper, we discuss the evolution of SCIT architecture and statistics of...
详细信息
The year 2017 marks the 15th anniversary of the SCIT supercomputer project, which allows us to summarize the results and draw conclusions. In this paper, we discuss the evolution of SCIT architecture and statistics of the supercomputing center for years 2002-2017. These data will be useful for computer cluster developers and researchers who design resource management algorithms for computing clusters.
Summarising distributed data is a central routine for parallel programming, lying at the core of widely used frameworks such as the map/reduce paradigm. In the IoT context it is even more crucial, being a privileged m...
详细信息
Provides an abstract of the tutorial presentation and may include a brief professional biography of the presenter. The complete presentation was not made available for publication as part of the conference proceedings.
Provides an abstract of the tutorial presentation and may include a brief professional biography of the presenter. The complete presentation was not made available for publication as part of the conference proceedings.
Provides an abstract of the invited presentation and may include a brief professional biography of the presenter. The complete presentation was not made available for publication as part of the conference proceedings.
ISBN:
(纸本)9781538655566;9781538655559
Provides an abstract of the invited presentation and may include a brief professional biography of the presenter. The complete presentation was not made available for publication as part of the conference proceedings.
Task parallelism is designed to simplify the task of parallel programming. When executing a task parallel program on modern NUMA architectures, it can fail to scale due to the phenomenon called work inflation, where t...
详细信息
The computation of geodesic distances is an important research topic in Geometry Processing and 3D Shape Analysis as it is a basic component of many methods used in these areas. In this work, we present a minimalistic...
详细信息
This paper is devoted to the research of bitmap image processing based on wavelet functions. The Daubechies wavelet function was used as a mathematical model for filtering, compression and smoothing of two-dimensional...
详细信息
This paper is devoted to the research of bitmap image processing based on wavelet functions. The Daubechies wavelet function was used as a mathematical model for filtering, compression and smoothing of two-dimensional signals, because the analysis of existing wavelet functions showed that the Daubechies wavelet family is most effective for image processing. OpenMP parallel programming in C/C++ was used for the parallelization of computing processes in image processing problems.
Modern switches have packet processing capacity of up to multi-tera bits per second, and they are also becoming more and more programmable. We seek to understand whether the programmability can translate packet proces...
详细信息
The use of object storage in the HPC world becomes a common case as it enables to overcome some POSIX limitations in scalability and performance. Indeed, object stores use a flat namespace, avoiding hierarchy in acces...
详细信息
ISBN:
(纸本)9781538678800
The use of object storage in the HPC world becomes a common case as it enables to overcome some POSIX limitations in scalability and performance. Indeed, object stores use a flat namespace, avoiding hierarchy in access requests and the cost of maintaining dependencies between multiple entries. Object stores also differentiate data flow from metadata flow, providing better concurrency and throughput. They can store trillions of objects and each object has its own customized metadata attributes so these metadata can be richer than POSIX attributes.
暂无评论