the proceedings contains 32 papers. Topics discussed include algorithms for parallelization, distributed computer systems and networking, software tools and environments, parallel finite and boundary elements, applica...
详细信息
the proceedings contains 32 papers. Topics discussed include algorithms for parallelization, distributed computer systems and networking, software tools and environments, parallel finite and boundary elements, applications in fluid flour and applications in applied science.
For solving linear system of equations is known several algorithms. Iteration algorithms are recommended for the large linear systems with sparse matrix. But in the case of general non-symmetrical or n x m matrices th...
详细信息
We propose two extensions for a state-of-the-art method of rollback-recovery in distributed CEP (complex event processing). In CEP, an operator network is used to search for patterns in events streams. Sometimes these...
详细信息
ISBN:
(纸本)9781538655023
We propose two extensions for a state-of-the-art method of rollback-recovery in distributed CEP (complex event processing). In CEP, an operator network is used to search for patterns in events streams. Sometimes these operators fail and lose their state. Rollback-recovery is a method for dealing with such state losses. the type of rollback-recovery we consider is upstream backup, where the state of a failed operator is recovered by replaying to it the input events that led it to that state. these events are kept in upstream operators' memory buffers, which are trimmed continuously as the downstream operator progresses. the first extension we propose saves memory and speeds up recovery by avoiding to store and retransmit unnecessary events. the second extension makes the base method of upstream backup compatible with data-parallel CEP, allowing that the windows into which operators partition their input be processed in parallel. We evaluated the proposed extensions through experiments that showed a significant reduction in memory usage and recovery time at the expense of a negligible processing overhead during normal operation.
this special issue of Journal of Systems Science & Complexity [JSSC) includes papers selected from the 1st internationalconference on Forecasting Economic &: Financial Systems (FEFS-2012) and the 5th Internati...
详细信息
this special issue of Journal of Systems Science & Complexity [JSSC) includes papers selected from the 1st internationalconference on Forecasting Economic &: Financial Systems (FEFS-2012) and the 5th InternatioIlal Workshop on Singular Spectrum Analysis & its Appli- cations (SSA-2012), held in Beijing, China, May 17-20, 2012.
the parallel versions of bioinspired algorithms are presented in the paper. the parallel evolutionary algorithms and artificial immune systems are described. the applications of bioinspired algorithms to optimization ...
详细信息
ISBN:
(纸本)9783540681052
the parallel versions of bioinspired algorithms are presented in the paper. the parallel evolutionary algorithms and artificial immune systems are described. the applications of bioinspired algorithms to optimization of mechanical structures are shown. the numerical tests presented in the paper were computed with use of grid based on Alchemi framework.
Recently, the OpenCL hardware-software co-design methodology has gained traction in realizing effective parallel architecture designs in heterogeneous FPGA platforms. In fact, the portability of OpenCL on hardware rea...
详细信息
ISBN:
(纸本)9781479982523
Recently, the OpenCL hardware-software co-design methodology has gained traction in realizing effective parallel architecture designs in heterogeneous FPGA platforms. In fact, the portability of OpenCL on hardware ready platforms such as GPU or multicore CPU enables ease of design verification. this is true especially for parallel algorithms before implementing them using cumbersome HDL-based RTL design. In this paper we employed OpenCL programming platform based on Altera SDK for OpenCL (AOCL) to implement a Sobel filter algorithm as an image processing test case on a Cyclone V FPGA board. Using the portability of this platform, the performance of the kernel code is benchmarked against that of the GPU and multicore CPU implementations for different image and kernel sizes. Different optimization strategies are also applied for each platform. We found that increasing the Sobel filter kernel size from 3 x3 to 5 x 5 results in only 11.3% increase in computation time for FPGA, while the effect was much more significant where the execution time was as high as 23.6% and 85.7% for CPU and GPU, respectively.
Spherical harmonics serve as basis functions on the unit sphere and spherical harmonic transform is required in analysis and processing of signals in the spectral domain. We investigate the possibility of parallel com...
详细信息
ISBN:
(纸本)9781457711800
Spherical harmonics serve as basis functions on the unit sphere and spherical harmonic transform is required in analysis and processing of signals in the spectral domain. We investigate the possibility of parallel computation of spherical harmonic transform using Compute Unified Device Architecture (CUDA) with no communication between parallel kernels. We identify the parallel components in the widely used spherical harmonic transform method proposed by Driscoll and Healy. We provide the implementation details and compare the computational complexity withthe sequential algorithm. For a given bandlimited signal with maximum spherical harmonics degree L, using the O(L) number of parallelprocessing kernels, we present that the spherical harmonic coefficients can be calculated in O(Llog(2) L) time as compared to O(L-2 log(2) L). For corroboration, we provide the simulation results using CUDA which indicate the reduction in computational complexity
In this paper a new parallel algorithm is presented for generation of t-ary trees. Computations run in an associative processor model. Tree sequences are generated in lexicographic order, with 0(l) time per object, in...
详细信息
ISBN:
(纸本)3540437924
In this paper a new parallel algorithm is presented for generation of t-ary trees. Computations run in an associative processor model. Tree sequences are generated in lexicographic order, with 0(l) time per object, in a new representation, as combinations with repetitions with restricted growth. the corresponding full t-ary trees (x-sequences) appear in anti lexicographic order.
With its strong floating-point operation capability and high memory bandwidth in data parallelism, the graphics processing unit (GPU) has been widely used in general-purpose computing. GPU-based computations have been...
详细信息
ISBN:
(纸本)9781665423144
With its strong floating-point operation capability and high memory bandwidth in data parallelism, the graphics processing unit (GPU) has been widely used in general-purpose computing. GPU-based computations have been extensively applied in the field of computational fluid dynamics (CFD). this paper aims to design an extremely efficient double-precision GPU-accelerated parallel algorithm for supersonic flow computations on hybrid grids. Compute unified device architecture (CUDA) is used as a general-purpose parallel computing platform and programming model to perform parallel computing codes on GPUs. the cell-centered finite volume method based on unstructured grids is used in the spatial discretization of governing equations, whereas the three-stage explicit Runge-Kutta scheme with second-order accuracy is used for temporal discretization. the turbulence is solved by using the K-omega SST two-equation model. three test cases are studied to validate the computational accuracy of the proposed algorithm. the numerical results agree well withthe experiment data, thereby suggesting that the GPU-accelerated parallel algorithm has good accuracy.
Maskless laser micro-processing has been applied to the acrylic resin for a frame of the x-ray telescope mirrors using a cw-CO2 laser, Dependence of the processed surface morphology on the laser power, beam scan speed...
详细信息
ISBN:
(纸本)0780326520
Maskless laser micro-processing has been applied to the acrylic resin for a frame of the x-ray telescope mirrors using a cw-CO2 laser, Dependence of the processed surface morphology on the laser power, beam scan speed, and flux of assist gas is investigated. the best processing condition for a good prosessed surface is defined in this study, Double scanning method is applied for the smooth and transparent surface.
暂无评论