The dynamic star simulator is a commonly used ground-test calibration device for star sensors. For the problems of slow calculation speed, low integration, and high power consumption in the traditional star chart simu...
详细信息
The dynamic star simulator is a commonly used ground-test calibration device for star sensors. For the problems of slow calculation speed, low integration, and high power consumption in the traditional star chart simulation method, this paper designs a FPGA-based star chart display algorithm for a dynamic star simulator. The design adopts the USB 2.0 protocol to obtain the attitude data, uses the SDRAM to cache the attitude data and video stream, extracts the effective navigation star points by searching the starry sky equidistant right ascension and declination partitions, and realizes the pipelined displaying of the star map by using the parallel computing capability of the FPGA. Test results show that under the conditions of chart field of view of phi 20 degrees and simulated magnitude of 2.0 similar to 6.0 Mv, the longest time for calculating a chart is 72 mu s under the clock of 148.5 MHz, which effectively improves the chart display speed of the dynamic star simulator. The FPGA-based star map display algorithm gets rid of the dependence of the existing algorithm on the computer, reduces the volume and power consumption of the dynamic star simulator, and realizes the miniaturization and portable demand of the dynamic star simulator.
Railway wheel-rail contact simulations are the most important and time-consuming tasks when simulating the system dynamics of vehicles. parallel computing is a good approach for improving the numerical computing speed...
详细信息
Railway wheel-rail contact simulations are the most important and time-consuming tasks when simulating the system dynamics of vehicles. parallel computing is a good approach for improving the numerical computing speed. This paper reports the advances in parallel computing of the wheel-rail contact simulations. The proposed method uses OpenMP to parallelise the multiple contact points of all the wheel-rail interfaces of a locomotive model. The method has been implemented in the vehicle system dynamics simulation package GENSYS. Simulations were conducted using two numerical solvers (4th Runge-Kutta and HeunC) and a maximum of four computer cores. Simulation cases have shown exactly the same numerical results using serial computing and parallel computing, which prove the effectiveness of the parallel computing method. The HeunC solver achieved the same simulation results and is 3.5 times faster than the 4th Runge-Kutta method. Simulation results obtained from both numerical solvers show that parallel computing using 2, 3 and 4 computer cores can improve the simulation speeds by roughly 29, 39 and 41%, respectively. There is an apparent diminishing of the rate of improvement due to the increase of the communication resource overhead when more computer cores are used. Using up to four computer cores does not require revision of the GENSYS code, and simulations can be executed using personal computers.
In this article, an innovative 3-dimensional (3D) heat-transfer finite element parallel-computing model with adaptive mesh capability, named LPBFSim, for Laser Powder Bed Fusion (LPBF) additive manufacturing is introd...
详细信息
In this article, an innovative 3-dimensional (3D) heat-transfer finite element parallel-computing model with adaptive mesh capability, named LPBFSim, for Laser Powder Bed Fusion (LPBF) additive manufacturing is introduced for high precision prediction of melt pool dimensions and single-layer part-level process simulations. Numerical modeling can significantly reduce the expense of the sole deployment of trial-and-error experiments for achieving optimal process parameters. The previously developed single-track model [1] is highly accurate to predict melt pool dimensions for different combinations of process parameters. However, without using parallel computing and an adaptive mesh, it is very challenging to scale it up to a multi-track model or even a part-level model due to the high computational cost. The proposed model is parallelized to be able to run on a cluster and aimed to solve this diffi culty while keeping all the accuracy from the previous version. Single-track, multi-track, and single-layer part-level models have been implemented to demonstrate its efficiency and accuracy.
Modelling the flow properties of rubber blends makes it possible to predict their rheological behaviour during the processing and production of rubber-based products. As the nonlinear nature of such complex processes ...
详细信息
Modelling the flow properties of rubber blends makes it possible to predict their rheological behaviour during the processing and production of rubber-based products. As the nonlinear nature of such complex processes complicates the creation of exact analytical models, it is appropriate to use artificial intelligence tools in this modelling. The present study was implemented to develop a highly efficient artificial neural network model, optimised using a novel training algorithm with fast parallel computing to predict the results of rheological tests of rubber blends performed under different conditions. A series of 120 real dynamic viscosity-time curves, acquired by a rubber process analyser for styrene-butadiene rubber blends with varying carbon black contents vulcanised at different temperatures, were analysed using a Generalised Regression Neural Network. The model was optimised by limiting the fitting error of the training dataset to a pre-specified value of less than 1%. All repeated calculations were made via parallel computing with multiple computer cores, which significantly reduces the total computation time. An excellent agreement between the predicted and measured generalisation data was found, with an error of less than 4.7%, confirming the high generalisation performance of the newly developed model.
With the development of engineering technology, engineering has higher requirements for the accuracy and the scale of simulation calculation. The computational efficiency of traditional serial programs cannot meet the...
详细信息
With the development of engineering technology, engineering has higher requirements for the accuracy and the scale of simulation calculation. The computational efficiency of traditional serial programs cannot meet the requirements of engineering. Therefore, reducing the calculation time of the temperature control simulation program has important engineering significance for real-time simulation of temperature field and stress field, and then adopting more reasonable temperature control and crack prevention measures. GPU parallel computing is introduced into the temperature control simulation program of massive concrete to solve this problem and the optimization is carried out. Considering factors such as GPU clock rate, number of cores, parallel overhead and parallel Region, the improved GPU parallel algorithm analysis indicator formula is proposed. It makes up for the shortcomings of traditional formulas that focus only on time. According to this formula, when there are enough threads, the parallel effect is limited by the size of the parallel domain, and when the parallel domain is large enough, the efficiency is limited by the parallel overhead and the clock rate. This paper studies the optimal Kernel execution configuration. Shared memory is utilized to improve memory access efficiency by 155%. After solving the problem of bank conflicts, an accelerate rate of 437.5x was realized in the subroutine of the matrix transpose of the solver. The asynchronous parallel of data access and logical operation is realized on GPU by using CUDA Stream, which can overlap part of the data access time. On the basis of GPU parallelism, asynchronous parallelism can double the computing efficiency. Compared with the serial program, the accelerate rate of inner product matrix multiplication of the GPU asynchronous parallel program is 61.42x. This study further proposed a theoretical formula of data access overlap rate to guide the selection of the number of CUDA streams to achieve th
Accurate input of geological elements is essential for evaluating or predicting natural hazards such as subsidence, landslides, and earthquakes. This paper proposes an approach to carry out an open pit's overall a...
详细信息
Accurate input of geological elements is essential for evaluating or predicting natural hazards such as subsidence, landslides, and earthquakes. This paper proposes an approach to carry out an open pit's overall and whole-process mechanical analysis with complex geological conditions, using precise modeling and large-scale parallel calculation techniques. Taking the Fushun West Open-pit Mine (the largest open-pit coal mine in Asia) as an example, through the elaborate multi-method geological investigation, the interfaces of interbedded shales and mudstones, the unloading zones, and the small structures were identified, a detailed 3D geological model was built and finely meshed in full-size with 100 million degrees of freedom, large-scale parallel numerical simulation was then performed, the results agree well with the InSAR monitoring data and in situ observations. Besides, the simulation can replicate the landslides in recent years. Through the simulation, it is possible to locate the potential landslide area, and targeted backfilling schemes for stability treatment were put forward and further simulated. The results indicate that the proposed approach can more effectively and reliably evaluate the Fushun West Open-pit Mine's overall slope stability and closure plan.
A new version of the numerical model of artificial Earth satellites (AES) motion is presented, which consists of four program blocks intended for 1) predicting the AES motion, 2) studying the chaotic condition in moti...
详细信息
A new version of the numerical model of artificial Earth satellites (AES) motion is presented, which consists of four program blocks intended for 1) predicting the AES motion, 2) studying the chaotic condition in motion of near-Earth space objects, 3) determining the AES motion parameters from the measurement data, and 4) studying the resonance dynamics of near-Earth objects. The main feature of the new version is the use of a new more efficient integrator, which is a further development of the well-known Everhart integrator. It is shown that with the same accuracy, the new integrator has much higher performance. The version intended for use in the parallel computing environment and called the "Numerical model of motion of AES systems" has undergone additional changes related to the optimization of the computation parallelization process. The estimates show that with the new method of parallelization, the integration accuracy is more stable and the integration speed increases several times.
The emerging multi-mode cooling tower can cool down the circulating water by flexibly switching the operating modes according to varying weather conditions. Herein, a computational framework for addressing a large-sca...
详细信息
The emerging multi-mode cooling tower can cool down the circulating water by flexibly switching the operating modes according to varying weather conditions. Herein, a computational framework for addressing a large-scale stochastic simulation-optimization task is developed to obtain the optimal thermodynamic performance of the multi-mode cooling system. First, the numerical model is constructed using a well-validated evaporative cooler in the wet and wet-heating modes, as well as an air cooler in the dry mode. A well-suited experimental design is performed for generating an optimal set of samples by approximating the multivariate probability distributions of uncertain data. To reduce the computational burden, a customized parallel computing strategy is presented via parallelization of the task using the message-passing interface. Finally, an example illustrates that the time reduction is up to 93.5%, while the optimal exergy efficiency ratios are expected to be 37.0%, 17.3%, and 22.6% for the wet, dry, and wet-heating modes, respectively.
Software-intensive systems in most domains, from autonomous vehicles to health, are becoming predominantly parallel to efficiently manage large amount of data in short (even real-) time. There is an incredibly rich li...
详细信息
Software-intensive systems in most domains, from autonomous vehicles to health, are becoming predominantly parallel to efficiently manage large amount of data in short (even real-) time. There is an incredibly rich literature on languages for parallel computing, thus it is difficult for researchers and practitioners, even experienced in this very field, to get a grasp on them. With this work we provide a comprehensive, structured, and detailed snapshot of documented research on those languages to identify trends, technical characteristics, open challenges, and research directions. In this article, we report on planning, execution, and results of our systematic peer-reviewed as well as grey literature review, which aimed at providing such a snapshot by analysing 225 studies.
This article introduces the calculation method for continuous safety analysis of GPU speed. Combine the configuration of CPU-GPU software with hardware and software configuration, logically allocate computing resource...
详细信息
This article introduces the calculation method for continuous safety analysis of GPU speed. Combine the configuration of CPU-GPU software with hardware and software configuration, logically allocate computing resources, and use the DC flow system to diagnose whether there are errors in the entire network. Combining the characteristics of network connectivity after a failure, a true N-1 error analysis based on the same calculation strategy of data collection is proposed, and comparisons are made. Various levels of granularity are designed and calculations are performed. This method can create power lines for each group, and ensure fast, real-time safety analysis, and propose an improved method to analyze Web applications. In the multi-level parallel computing model, the key steps of multi-GPU integration are described. The virtual container technology detects applications that intensively interact between multiple GPU panels in managing security analysis and sensitivity analysis applications, as well as application design for security application analysis and sensitivity analysis. This paper selects several typical construction project cases through the research on the construction safety management of construction companies based on the Internet of Things, which mainly involve five issues of on-site safety inspection, staff safety education, safety conflict management and safety awareness, and the application of BIM technology. Highlights the company's problems in the safety management of engineering construction;it also consults and integrates relevant construction team types and actual management conditions, and combines viewpoints and actual conditions to provide reasonable and contradictory suggestions from five perspectives, namely: completion Safe construction project system, using advanced technology to improve the efficiency of construction safety management.
暂无评论