K. Harada conjectured for any finite group G, the product of sizes of all conjugacy classes is divisible by the product of degrees of all irreducible characters. We study this conjecture when G is the general linear g...
Although β-hydroxybutyrate (BHB), one of the endogenous body ketones, possesses high bioactivities, it is rapidly consumed, metabolized, and eliminated from the body. In this study, we designed new self-assembling na...
详细信息
It is known that the set of possible cofinalities pcf(A) has good properties if A is a progressive interval of regular cardinals. In this paper, we give an interval of regular cardinals A such that pcf(A) has no good ...
In an earlier study, we optimized the Authentic Radiative Transfer (ART) method to solve the space radiative transfer problems in early universe astrophysical simulations using an Intel Arria 10 Field programmable Gat...
详细信息
ISBN:
(数字)9781665415927
ISBN:
(纸本)9781665415934
In an earlier study, we optimized the Authentic Radiative Transfer (ART) method to solve the space radiative transfer problems in early universe astrophysical simulations using an Intel Arria 10 Field programmable Gate Array (FPGA). In this paper, we optimize this method for use on the latest FPGA, an Intel Stratix 10, and evaluate its performance by comparing the GPU implementation on multiple nodes. For the multi-FPGA computing and communication framework, we apply our original system, called as Communication Integrated Reconfigurable CompUting System (CIRCUS), to realize OpenCL based programming and utilize multiple optical links on an FPGA for parallel FPGA processing, and this study is the first implementation of a real application applied using CIRCUS. The FPGA implementation is 4.54-, 8.41-, and 10.64-times faster than that of a GPU on one, two, and four nodes, respectively, for multi-GPU cases using an InfiniBand HDR100 network. It also achieves 94.2 % parallel efficiency running on four FPGAs. We believe this efficiency is brought about from the low-latency and high-efficiency pipelined communication of CIRCUS, which provide easy programming on multi-FPGAs using OpenCL for high-performance computing applications.
暂无评论