To solve the large-scale scheduling problem more efficiently within the requirements of the contract in shipyard, a threelayer parallel computing system was proposed. An optimized model for shipbuilding project schedu...
详细信息
To solve the large-scale scheduling problem more efficiently within the requirements of the contract in shipyard, a threelayer parallel computing system was proposed. An optimized model for shipbuilding project scheduling problem was constructed under the condition of taking time and resource constraints into account. Moreover, the key techniques of proposed system were elaborated and the main steps were designed. In the first computing layer, the problem was decomposed into small parts in heterogeneous systems, reducing the problem scale;then, in the second layer, a coevolution strategy for multi-populations was put forward to improve the algorithm robustness;in the third layer, a massive parallelcomputing method was performed under the Graphic Processing Unit structure. Finally, through two simulation examples, the robustness and outperforming others of the improved algorithm were verified.
The article describes the implemented prototype of the Templet parallel computing system for the C++ language. The system uses a novel version of the actor execution model. The design of the actor model makes it possi...
详细信息
The article describes the implemented prototype of the Templet parallel computing system for the C++ language. The system uses a novel version of the actor execution model. The design of the actor model makes it possible to define the behavior of the parallel program developed in the Templet system mathematically strict with a temporal logic formula. We believe that this feature of the system is critical, because it gives application developers the freedom to implement actors on any desired platform. Our variant of the actor model can be easily reimplemented for different hardware and in different programming languages, at least for multithreaded execution in shared memory. The article defines the Templet actor model in terms of Temporal Logic of Action, discusses the system design, and shows some examples of its practical use. (C) 2017 The Authors. Published by Elsevier Ltd.
The article describes the implemented prototype of the Templet parallel computing system for the C++ language. The system uses a novel version of the actor execution model. The design of the actor model makes it possi...
详细信息
The article describes the implemented prototype of the Templet parallel computing system for the C++ language. The system uses a novel version of the actor execution model. The design of the actor model makes it possible to define the behavior of the parallel program developed in the Templet system mathematically strict with a temporal logic formula. We believe that this feature of the system is critical, because it gives application developers the freedom to implement actors on any desired platform. Our variant of the actor model can be easily reimplemented for different hardware and in different programming languages, at least for multithreaded execution in shared memory. The article defines the Templet actor model in terms of Temporal Logic of Action, discusses the system design, and shows some examples of its practical use.
The method employed in this study combines large-scale shaking table tests with fully coupled 3D non-linear dynamic analysis to investigate the site response and structural damage mechanisms of laterally spreading sit...
详细信息
The method employed in this study combines large-scale shaking table tests with fully coupled 3D non-linear dynamic analysis to investigate the site response and structural damage mechanisms of laterally spreading site cluster pile foundation systems under weak and strong seismic excitation. The study also aims to verify the applicability of the high-efficiency and universal parallel calculation system method, which involves SolidWorks, HyperMesh, OpenSeesMP, and GiD, with a fully visualized interface operation in geotechnical engineering, based on the test database. The results show that under weak seismic excitation, the acceleration and displacement in the free field and near-pile region are linearly distributed along the burial depth, and the motion of shallow pile foundation is mainly controlled by the inertial effect of the superstructure;while under strong seismic excitation, the displacement in the near-field region is much smaller than the free field showing significant pile pinning effect, and the displacement and acceleration in the near-pile domain are cosine-shaped distributed along the burial depth, with the lateral spreading of site liquefaction intensifying. At the intersection of clay and sand layers, a weak interlayer is present, which affects the structural response. The kinematic effect of the site dominates the response, and the location of vulnerability gradually shifts from the pile bottom to the pile head.
In the effort to reduce the number of cancer survivor, various technique have been conducted. The major problem is the cancer cannot be detected in the early stage. Majority patients have been diagnosed the cancer at ...
详细信息
ISBN:
(纸本)9781467356091;9781467356084
In the effort to reduce the number of cancer survivor, various technique have been conducted. The major problem is the cancer cannot be detected in the early stage. Majority patients have been diagnosed the cancer at the final level. If the problem can be solved, death because of the cancer can be reduced. Therefore, edge detection of tumor can help radiologist to diagnose the disease at an early stage. This paper focused on the implementation of parallel algorithm for the human tumor edge detection. The objectives of this study is to detect the edge of breast and brain tumor in Mammogram and MRI medical imaging respectively. Edge of the tumors is detected by using Wavelet Transform Modulus Maxima on parallel computing system that is used as a platform to compute the parallel algorithm for the edge detection of the tumors. Performance of parallel computer in terms of time processing, speedup and efficiency is reported.
Since Finite-Difference Time-Domain(FDTD) method was proposed by Yee in 1966,its hardware acceleration problem arouses much *** this paper,how to use GPU+COW parallel computing system to accelerate the FDTD method w...
详细信息
ISBN:
(纸本)9781467363273
Since Finite-Difference Time-Domain(FDTD) method was proposed by Yee in 1966,its hardware acceleration problem arouses much *** this paper,how to use GPU+COW parallel computing system to accelerate the FDTD method was introduced *** acceleration scheme and the simulation results were presented,and then the feasibility and efficiency of the proposed scheme were proved.
We propose a new low-diameter interconnection network called FleX, which offers high flexibility when installing interconnections in a HPC system. FleX consists of multiple layers with only connections between neighbo...
详细信息
We propose a new low-diameter interconnection network called FleX, which offers high flexibility when installing interconnections in a HPC system. FleX consists of multiple layers with only connections between neighboring layers and not within each layer. These structural properties make it easy to achieve a low diameter with regardless of the scale. The cross-like connections between the adjacent layers in FleX impart various alternative minimal paths, allowing FleX to have high resiliency and a wide bisection width. We also discuss the minimal routing scheme and a stochastic load balancing scheme (LBR) for the proposed interconnection network. Through cycle-based simulations, the performance of FleX is evaluated, and the cost and power consumption analyses in comparison with other interconnection networks are also conducted. We verify that FleX has high configuration flexibility with regard to cost and performance, and also provides low latency and high saturation throughput with the same cost over the legacy interconnection networks for the HPC system. Moreover, being synergied with the proposing LBR, we also verify that FleX can expand its saturation throughput further while only sacrificing the latency slightly.
The communication performance and symmetry properties are two vital properties for a multiprocessor interconnection network. Diameter is an important factor to determine the communication performance of a graph. In a ...
详细信息
The communication performance and symmetry properties are two vital properties for a multiprocessor interconnection network. Diameter is an important factor to determine the communication performance of a graph. In a vertex-transitive graph, every vertex is identical to every other vertex in relation to its location. In this paper, we study the communication performance and vertex-transitivity property of the folded locally twisted cube FLTQ(n). We find that the diameter of FLTQ(n) is fn/21, which is less than the diameter proposed by Peng [S. Peng, C. Guo, B. Yang, Topological properties of folded locally twisted cubes, Journal of Computational Information systems, 11 (2015) 7667-7676]. And we prove that FLTQ(n) is vertex-transitive for n = 2, and is not vertex-transitive for n = 3.
The topology properties of multi-processors interconnection networks are important to the performance of high performance computers. The hypercube network Q(n) has been proved to be one of the most popular interconnec...
详细信息
The topology properties of multi-processors interconnection networks are important to the performance of high performance computers. The hypercube network Q(n) has been proved to be one of the most popular interconnection networks. The n-dimensional locally twisted cube LTQ(n) is an important variant of Q(n). Fault diameter and wide diameter are two communication performance evaluation parameters of a network. Let D(LTQ(n)), D-n-1(f) (LTQ(n)) and D-n(LTQ(n)) denote the diameter, the n - 1 fault diameter and the wide diameter of LTQ(n), respectively. In this paper, we prove that D-n-1(f)(LTQ(n)) = D-n(LTQ(n)) = D(LTQ(n)) + 2 if n is an odd integer with n >= 7, D(LTQ(n)) + 1 <= D-n-1(f)(LTQ(n)) <= D-n(LTQ(n)) <= D(LTQ(n)) + 2 if n is an even integer with n >= 8.
The paper describes software Q-system for research of the resource of numerical algorithms parallelism. The theoretical basis of the Q-system is the concept of Q-determinant where Q is the set of operations used by th...
详细信息
ISBN:
(纸本)9783030365929;9783030365912
The paper describes software Q-system for research of the resource of numerical algorithms parallelism. The theoretical basis of the Q-system is the concept of Q-determinant where Q is the set of operations used by the algorithm. The Q-determinant consists of Q-terms. Their number is equal to the number of output data items. Each Q-term describes all possible ways to calculate one of the output data items based on the input data. Any numerical algorithm has a Q-determinant and can be represented in the form of a Q-determinant. Such a representation is a universal description of numerical algorithms. It makes the algorithm transparent in terms of structure and implementation. The software Q-system enables to calculate the parallelism resource of any numerical algorithm, and also to compare the parallelism resources of two algorithms that solve the same algorithmic problem. In the paper we show the application of the Q-system on the example of numerical algorithms with different structures of Q-determinants. Among them, we have the matrix multiplication algorithm, methods of Gauss-Jordan, Jacobi, Gauss-Seidel for solving systems of linear equations, and other algorithms. The paper continues the research begun in the previous papers of the authors. The results of the research can be used to increase the efficiency of implementing numerical algorithms on parallel computing systems.
暂无评论