检索结果-内蒙古大学图书馆

A scalable parallel computing method for autonomous platoons

VEHICLE SYSTEM DYNAMICS 2024年第9期62卷 2283-2303页

作者： Wu, Qing Ge, Xiaohua Han, Qing-Long Cole, Colin Spiryagin, Maksym Cent Queensland Univ Sch Engn & Technol Rockhampton Qld Australia Swinburne Univ Technol Sch Sci Comp & Engn Technol Melbourne Australia

This paper developed a scalable parallel computing method that can be used for platoon simulations and controller validations. A scalable adaptive platooning control law was firstly designed, which accommodates a variety of vehicle-to-vehicle communication topologies. A road vehicle dynamics model that considered the Magic Formula tyre model and suspension dynamics was then derived and validated. The parallel computing method adopted the Message Passing Interface technique to allow fast and scalable simulations. Platoon length changes do not require controller and algorithm changes. An 11-vehicle platoon on a real-world 10 km long road section was simulated. Different localisation sensor errors, communication delays, heterogenous vehicle masses and driving modes were considered. Results show that localisation errors have negligible influences on space errors. Aggressive driving and heterogeneous vehicle masses slightly increase space errors (increases less than 0.23 m). Communication delays are the greatest influencer for space errors. Increases for 15, 45 and 75 ms delays were 0.43, 1.41 and 2.41 m, respectively. It is further shown that parallel computing can improve the computing speed by three times on personal computers and seven to 12 times on workstations.

关键词： Autonomous platoon parallel computing scalable platooning control vehicle dynamics model validation

来源：评论

学校读者我要写书评

暂无评论

A sparse domain decomposition method for parallel computing of a four-dimensional lattice spring model

引用

INTERNATIONAL JOURNAL FOR NUMERICAL AND ANALYTICAL METHODS IN GEOMECHANICS 2021年第17期45卷 2581-2601页

作者： Fu, Meng Zhao, Gao-Feng Tianjin Univ Sch Civil Engn Tianjin 300072 Peoples R China

In this work, an improved domain decomposition method is developed to address workload imbalance when implementing the parallel computing of a four-dimensional lattice spring model (4D-LSM) to solve problems in rock engineering on a large scale. A cubic domain decomposition scheme is adopted and optimized by a simulated annealing algorithm (SAA) to minimize the workload imbalance among subdomains. The improved domain decomposition method is implemented in the parallel computing of the 4D-LSM. Numerical results indicate that the proposed domain decomposition method can further improve the workload balance among processors, which is helpful to supersede the limit of computational scale when solving large-scale geotechnical problems and decrease the runtime of the parallel 4D-LSM by at most 40% compared to the original cubic decomposition method. This shows the practicability of the proposed method in parallel computing. Two types of target functions of SAA are tested, and their influence on the performance of the parallel 4D-LSM is investigated. Finally, a computational model with one billion particles for one actual engineering application of using 4D-LSM is realized, and the result shows the advantages of parallel computing.

关键词： domain decomposition lattice spring model parallel computing simulated annealing algorithm

来源：评论

学校读者我要写书评

暂无评论

Theoretical Basis of Mathematical Apparatus for parallel computing Implementation in Computer-Aided Design Systems

引用

PROGRAMMING AND COMPUTER SOFTWARE 2024年第5期50卷 335-342页

作者： Konopatskiy, E. Nizhny Novgorod State Univ Architecture & Civil En 65 Iljinskaya st Nizhnii Novgorod 603000 Russia

The purpose of this work is to develop a mathematical apparatus and computational algorithms for implementation of parallel computing in geometric modeling and computer-aided design (CAD) systems. The analysis of existing approaches to parallel computing implementation in CAD systems is carried out. As a result, it is found that most information modeling and CAD systems do not support parallel computing at the level of the geometric kernel. A concept for the development of a CAD geometric kernel based on the invariants of parallel projection of geometric objects onto the axes of the global coordinate system is proposed. It combines the potential of constructive methods for geometric modeling, capable of parallelizing geometric constructions by tasks (message passing), and the mathematical apparatus of point calculus, capable of parallelization by data through coordinate-by-coordinate calculation (data parallel). The use of the coordinate-by-coordinate calculation for point equations not only makes it possible to parallelize computations along coordinate axes, but also ensures the consistency of computational operations with respect to threads, which significantly reduces the idle time and optimizes the CPU operation to achieve the maximum effect from the use of parallel computing.

关键词： CAD mathematical apparatus parallel computing point calculus coordinate-by-coordinate calculation coordinate vector invariants of parallel projection and hidden parallelism

来源：评论

学校读者我要写书评

暂无评论

Topology optimization of 2D continua for minimum compliance using parallel computing

引用

STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION 2006年第2期32卷 121-132页

作者： Mahdavi, A. Balaji, R. Frecker, M. Mockensturm, E. M. Penn State Univ Dept Mech & Nucl Engn University Pk PA 16802 USA

Topology optimization is often used in the conceptual design stage as a preprocessing tool to obtain overall material distribution in the solution domain. The resulting topology is then used as an initial guess for shape optimization. It is always desirable to use fine computational grids to obtain high-resolution layouts that minimize the need for shape optimization and postprocessing (Bendsoe and Sigmund, Topology optimization theory, methods and applications. Springer, Berlin Heidelberg New York 2003), but this approach results in high computation cost and is prohibitive for large structures. In the present work, parallel computing in combination with domain decomposition is proposed to reduce the computation time of such problems. The power law approach is used as the material distribution method, and an optimality criteria-based optimizer is used for locating the optimum solution [Sigmund (2001)21:120-127;Rozvany and Olhoff, Topology optimization of structures and composites continua. Kluwer, Norwell 2000]. The equilibrium equations are solved using a preconditioned conjugate gradient algorithm. These calculations have been done using a master-slave programming paradigm on a coarse-grain, multiple instruction multiple data, shared-memory architecture. In this study, by avoiding the assembly of the global stiffness matrix, the memory requirement and computation time has been reduced. The results of the current study show that the parallel computing technique is a valuable tool for solving computationally intensive topology optimization problems.

关键词： topology optimization parallel computing finite element analysis MPI SIMP domain decomposition

来源：评论

学校读者我要写书评

暂无评论

Research and application of the parallel computing method for the grid-based Xin'anjiang model

引用

HYDROLOGY RESEARCH 2023年第4期54卷 591-605页

作者： Liu, Qian Wan, Dingsheng Yu, Yufeng Zhang, Yangming Hohai Univ Coll Comp & Informat Nanjing 211100 Jiangsu Peoples R China Bank Nanjing Nanjing 211100 Jiangsu Peoples R China

The grid-based Xin'anjiang model (GXM) has been widely applied to flood forecasting. However, when the model warm-up period is long and the amount of input data is large, the computational efficiency of the GXM is obviously low. Therefore, a GXM parallel algorithm based on grid flow direction division is proposed from the perspective of spatial parallelism, which realizes the parallel computing of the GXM by extracting the parallel routing sequence of the watershed grids. To solve data skew, a DAG scheduling algorithm based on dynamic priority is proposed for task scheduling. The proposed GXM parallel algorithm is verified in the Qianhe River watershed of Shaanxi Province and the Tunxi watershed of Anhui Province. The results show that the GXM parallel algorithm based on grid flow direction division has good flood forecasting accuracy and higher computational efficiency than the traditional serial computing method. In addition, the DAG scheduling algorithm can effectively improve the parallel efficiency of the GXM.

关键词： distributed hydrologic model flood forecasting grid-based Xin'anjiang model parallel computing

来源：评论

学校读者我要写书评

暂无评论

A dynamic texture based segmentation method for ultrasound images with Surfacelet, HMT and parallel computing

引用

MULTIMEDIA TOOLS AND APPLICATIONS 2019年第5期78卷 5381-5401页

作者： Cai, Bo Ye, Wei Zhao, Jianhui Wuhan Univ Sch Comp Sci Wuhan 430072 Hubei Peoples R China

To segment regions of interest (ROIs) from ultrasound images, one novel dynamic texture based algorithm is presented with surfacelet transform, hidden Markov tree (HMT) model and parallel computing. During surfacelet transform, the image sequence is decomposed by pyramid model, and the 3D signals with high frequency are decomposed by directional filter banks. During HMT modeling, distribution of coefficients is described with Gaussian mixture model (GMM), and relationship of scales is described with scale continuity model. From HMT parameters estimated through expectation maximization, the joint probability density is calculated and taken as feature value of image sequence. Then ROIs and non-ROIs in collected sample videos are used to train the support vector machine (SVM) classifier, which is employed to identify the divided 3D blocks from input video. To improve the computational efficiency, parallel computing is implemented with multi-processor CPU. Our algorithm has been compared with the existing texture based approaches, including gray level co-occurrence matrix (GLCM), local binary pattern (LBP), Wavelet, for ultrasound images, and the experimental results prove its advantages of processing noisy ultrasound images and segmenting higher accurate ROIs.

关键词： Dynamic texture Surfacelet transform HMT model parallel computing Ultrasound images

来源：评论

学校读者我要写书评

暂无评论

parallel computing methods for analyzing gene expression relationships

Parallel computing methods for analyzing gene expression rel...

引用

Conference on Microarrays - Optical Technologies and Informatics

作者： Suh, EB Dougherty, ER Kim, S Russ, DE Martino, RL NIH Ctr Informat Technol Bethesda MD 20892 USA

ISBN: (纸本)0819439444

This paper presents a parallel program for assessing the codetermination of gene transcriptional states from large-scale simultaneous gene expression measurements with cDNA microarrays. The parallel program is based on a nonlinear statistical framework recently proposed for the analysis of gene interaction via multivariate expression arrays. parallel computing is key in the application of the statistical framework to a large set of genes because a prohibitive amount of computer time is required on a classical single-CPU machine. Our parallel program, named the parallel Analysis of Gene Expression (PAGE) program, exploits inherent parallelism exhibited in the proposed codetermination prediction models. By running PAGE on 64 processors in Beowulf, a clustered parallel system, an analysis of melanoma cDNA microarray expression data has been completed within 12 days of computer time, an analysis that would have required about one and half years on a single-CPU computing system. A data visualization program, named the Visualization of Gene Expression (VOGE) program, has been developed to help interpret the massive amount of quantitative information produced by PAGE. VOGE provides graphical data visualization and analysis tools with filters, histograms, and accesses to other genetic databanks for further analyses of the quantitative information.

关键词： parallel computing data visualization coefficient of determination gene expression cDNA microarray

来源：评论

学校读者我要写书评

暂无评论

Teaching parallel computing concepts with a desktop computer

引用

INTERNATIONAL JOURNAL OF ELECTRICAL ENGINEERING EDUCATION 2004年第2期41卷 113-125页

作者： Fung, YF Ercan, MF Chong, YS Ho, TK Cheung, WL Singh, G Hong Kong Polytech Univ Dept Elect Engn Hong Kong Hong Kong Peoples R China Singapore Polytech Sch Elect & Elect Engn Singapore Singapore

parallel computing is currently used in many engineering problems. However, because of limitations in curriculum design, it is not always possible to offer students specific formal teaching in PF this topic. Furthermore, parallel machines are still too expensive for many institutions. The latest microprocessors, such as Intel's Pentium III and IV, embody single instruction multiple-data (SIMD) type parallel features, which makes them a viable solution for introducing parallel computing concepts to students. Final year projects have been initiated utilizing SSE (streaming SIMD extensions) features and it has been observed that students can easily learn parallel programming concepts after going through some programming exercises. They can now experiment with parallel algorithms on their own PCs at home.

关键词： electrical engineering parallel computing SIMD paradigm

来源：评论

学校读者我要写书评

暂无评论

Dynamic evaluation strategy for fine-grain data-parallel computing

引用

IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES 1996年第3期143卷 181-188页

作者： Muchnick, VB Shafarenko, AV Department of Electronic and Electrical Engineering University of Surrey Guildford United Kingdom

The placement of elemental operations (as opposed to data) of a data-driven data-parallel computation in a network of processors is examined. A fast suboptimal algorithm is proposed for such placement which tends to examined. A fast suboptimal algorithm is proposed for such placement which tends to minimise the overall network load when the computation is essentially nonlocal. The cases of grid, torus and hypercube topology are considered. It is shown that the proposed algorithm, while having moderate computational complexity, demonstrates up to a 50% reduction in required network throughput over some straightforward placement schemes in the practical range of network sizes.

关键词： parallel computing computational networks data-parallel computation

来源：评论

学校读者我要写书评

暂无评论

Efficient Ranking and Selection in parallel computing Environments

引用

OPERATIONS RESEARCH 2017年第3期65卷 821-836页

作者： Ni, Eric C. Ciocan, Dragos F. Henderson, Shane G. Hunter, Susan R. Cornell Univ Sch Operat Res & Informat Engn Ithaca NY 14853 USA INSEAD Technol & Operat Management F-77300 Fontainebleau France Purdue Univ Sch Ind Engn W Lafayette IN 47907 USA

The goal of ranking and selection (R&S) procedures is to identify the best stochastic system from among a finite set of competing alternatives. Such procedures require constructing estimates of each system's performance, which can be obtained simultaneously by running multiple independent replications on a parallel computing platform. Nontrivial statistical and implementation issues arise when designing R&S procedures for a parallel computing environment. We propose several design principles for parallel R&S procedures that preserve statistical validity and maximize core utilization, especially when large numbers of alternatives or cores are involved. These principles are followed closely by our parallel Good Selection Procedure (GSP), which, under the assumption of normally distributed output, (i) guarantees to select a system in the indifference zone with high probability, (ii) in tests on up to 1,024 parallel cores runs efficiently, and (iii) in an example uses smaller sample sizes compared to existing parallel procedures, particularly for large problems (over 106 alternatives). In our computational study we discuss three methods for implementing GSP on parallel computers, namely the Message-Passing Interface (MPI), Hadoop MapReduce, and Spark, and show that Spark provides a good compromise between the efficiency of MPI and robustness to core failures.

关键词： ranking and selection parallel computing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：