检索结果-内蒙古大学图书馆

Optimal parallel preprocessing algorithms for testing weak visibility of polygons from segments

parallel algorithms and Applications 1993年第2期1卷 83-98页

作者： Hsu, F.R. Chang, R.C. Lee, R.C.T. Institute of Computer Science and Information Engineering National Chiao Tung University Hsinchu 300 Taiwan Department of Computer Science National Tsing Hua University Hsinchu 300 Taiwan

For an n-gon P, we say P is weakly visible from segment s if any point on P is visible from at least one point of the segment. In this paper, we present an optimal preprocessing algorithm which runs in O(log n) time using O(n) processors under the concurrent read exclusive write parallel random access machine model such that after preprocessing, it takes O(log n) time to test if P is weakly visible from a given segment using a single processor. ©1993 Gordon and Breach Science Publishers S.A. All right reserved. © 1993, Taylor & Francis Group, LLC. All rights reserved.

关键词： Computational geometry CREW parallel algorithm Polygon Visibility

来源：评论

学校读者我要写书评

暂无评论

Optimization and Application of Tight-Binding Molecular Dynamics

Optimization and Application of Tight-Binding Molecular Dyna...

引用

2011 International Conference on Computers, Communications, Control and Automation

作者： ZHANG Yuqian LI Yu-chen Yao Ge National Lab.of Solid State Microstructures and Dept.of Physics Nanjing Univ.

ISBN: (纸本)9781612841021

In this paper, we briefly introduce the basic theory and method of tight-binding molecular dynamics(TBMD), and study the quantum oscillation of graphene at about the absolute zero Kelvin. By using the TBMD method and parallel program to simulate the graphene and analyzing the simulated results, we propose some improvements on computing the forces by perturbation and sparse matrix method.

关键词： tight-binding molecular dynamics graphene, TBMD program parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

Využití grafického procesoru jako akcelerátoru - technologie OpenCL

Využití grafického procesoru jako akcelerátoru - technol...

引用

作者： Hrubý, Michal Brno University of Technology

Tato práce se zabývá technologií OpenCL a jejím využitím pro detekci objektů. První část je zaměřená na popis principů technologie OpenCL a základní teorii ... 详细信息

Tato práce se zabývá technologií OpenCL a jejím využitím pro detekci objektů. První část je zaměřená na popis principů technologie OpenCL a základní teorii o detekci objektů. Následuje kapitola analýzy, kde je navržená metoda zpracování s přihlédnutím na možnosti OpenCL. Další část popisuje samotnou implementaci detekční aplikace a experimentálně vyhodnocuje výkon detektoru. Poslední kapitola shrnuje dosažené výsledky.

关键词： OpenCL grafická karta detekce objektů klasifikátor AdaBoost WaldBoost Local Binary Patterns paralelní algoritmus kernel OpenCL graphics card object detection classifier AdaBoost WaldBoost Local Binary Patterns parallel algorithm kernel T

来源：评论

学校读者我要写书评

暂无评论

Large-scale nonlinear parallel computations by perturbed functional iterations

引用

parallel algorithms and Applications 1994年第3-4期3卷 211-226页

作者： Dey, S.K. Institute for Numerical Analysis The Technical University of Denmark DK-2800 Lyngby Building 305 Denmark

In this work PFI (Perturbed Functional Iterations) has been extended to solve large-scale nonlinear models by applying parallel computations. PFI partially linearizes a given nonlinear system, and irrespective of the physical dimension of the model it solves in parallel a sequence of linear equations of significantly smaller order to compute perturbation parameters and adds them in parallel to nonlinear Jacobi iterations to compute new iterates. As convergence is approached all linearizations are damped out, restoring thereby nonlinear properties of the model near the root. This generates a high degree of accuracy. In comparison with other nonlinear algorithms, PFI has a simple algorithm which is easy to program. Some applications have produced encouraging results. © 1994, Taylor & Francis Group, LLC. All rights reserved.

关键词： Mathematics of computing parallel algorithm parallel programming

来源：评论

学校读者我要写书评

暂无评论

Fast parallel computation of the jordan normal form of matrices

引用

parallel Processing Letters 1996年第2期6卷 203-212页

作者： Roch, Jean-Louis Institut IMAG Laboratoire LMC 38031 Grenoble Cedex 46 Av. F. Viallet France

We establish that the problem of computing the Jordan normal form of a matrix over a field F is in NCF3 for F being a field of characteristic zero or a finite field. © World Scientific Publishing Company.

We establish that the problem of computing the Jordan normal form of a matrix over a field F is in NC_F³ for F being a field of characteristic zero or a finite field. © World Scientific Publishing Company.

关键词： Invariant factors Jordan normal form Ncf3 parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

CudaCHPre2D: A straightforward preprocessing approach for accelerating 2D convex hull computations on the GPU

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2020年第10期32卷 e5229-e5229页

作者： Qin, Jiayu Mei, Gang Cuomo, Salvatore Guo, Sixu Li, Yixuan China Univ Geosci Beijing Sch Engn & Technol Beijing Peoples R China Univ Naples Federico II Dept Math & Applicat R Caccioppoli Naples Italy

An effective strategy for accelerating the calculation of convex hulls is to filter the input points by discarding interior points. In this paper, we present such a straightforward preprocessing approach by discarding the points locating in a convex polygon formed by 16 extreme points. Extreme points of a planar point set do not alter when all points are rotated with the same angle in the plane. Four groups of four extreme points with min or max x or y coordinates can be found for the original point set and three rotated point sets. These 16 extreme points are used to form a planar convex polygon. We discard those points locating in the convex polygon and calculate the desired convex hull of the remaining points. The proposed preprocessing algorithm is evaluated on two computational platforms. Experiments show that, when employing the proposed preprocessing algorithm on the computational platform 1, it achieves speedups of approximately 4 x similar to 5x on average and 5 x similar to 6x in the best cases over the cases where the proposed approach is not used, while on the computational platform 2, the speedups are approximately 6 x similar to 9x on average and 9 x similar to 14x in the best cases. Moreover, more than 99% input points can be discarded in most cases.

关键词： computational geometry convex hull Graphics Processing Unit (GPU) parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

parallelization of a branch-and-bound algorithm for the maximum weight clique problem

引用

DISCRETE OPTIMIZATION 2021年 41卷 100646-100646页

作者： Shimizu, Satoshi Yamaguchi, Kazuaki Masuda, Sumio Kobe Univ Grad Sch Engn Kobe Hyogo Japan

In this paper, parallelization techniques are proposed for the branch-and-bound algorithm OTClique for the maximum weight clique problem. OTClique consists of the precomputation phase and the branch-and-bound phase. The proposed algorithm parallelizes both of them. In the precomputation phase, the construction of optimal tables is parallelized. In the branch-and-bound phase, the proposed algorithm generates small subproblems and assigns them to threads. A technique to share lower and upper bounds is also proposed. Experiments using some benchmarks show that the proposed parallelization techniques improve the performance of OTClique. With an 8-core CPU, the computation time of OTClique becomes 6.91 times shorter on random graphs and 5.38 times on DIMACS benchmarks on average. (C) 2021 Elsevier B.V. All rights reserved.

关键词： Maximum weight clique Exact algorithm Branch-and-bound parallel algorithm NP-hard

来源：评论

学校读者我要写书评

暂无评论

Efficient algorithms for discovering high-utility patterns with strong frequency affinities

引用

EXPERT SYSTEMS WITH APPLICATIONS 2021年 169卷 114464-114464页

作者： Nhan Vuong Bac Le Tin Truong Duy-Phuong Nguyen Univ Food Ind Fac Informat Technol Ho Chi Minh City Vietnam Univ Sci Fac Informat Technol Ho Chi Minh City Vietnam Vietnam Natl Univ Ho Chi Minh City Vietnam Ton Duc Thang Univ Inst Computat Sci Div Computat Math & Engn Ho Chi Minh City Vietnam Ton Duc Thang Univ Fac Math & Stat Ho Chi Minh City Vietnam Posts & Telecommun Inst Technol Dept Comp Sci Ho Chi Minh City Vietnam

In recent years, high-utility pattern mining has been studied extensively. However, most of these studies have addressed mining high-utility patterns (HUPs) without consideration for their frequencies, leading to the mining of meaningless HUPs. One of the approaches to solving this problem is to use HUP mining with strong affinity frequencies. In this paper, we propose two algorithms to discover HUPs with strong affinity frequencies: DHUPMiner (Discriminative High-Utility pattern Miner) and its parallel version, DHUP-Miner*. Several novel pruning strategies are applied to reduce the search space for potential DHUPs. Experimental results show that the proposed algorithms are faster than the state-of-the-art algorithm (FDHUP) for both sparse and dense benchmark datasets. Moreover, the parallel algorithm (DHUP-Miner*) was found to handle large datasets well.

关键词： High utility pattern mining Frequency affinity Discriminative DHUP-Miner DHUP-Miner parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

parallelIZATION OF A 3-DIMENSIONAL COMPRESSIBLE TRANSITION CODE

引用

AIAA JOURNAL 1990年第1期28卷 83-90页

作者： ERLEBACHER, G BOKHARI, SH HUSSAINI, MY UNIV ENGN & TECHNOL LAHOREPAKISTAN

The compressible, three-dimensional, time-dependent Navier-Stokes equations are solved on a 20 processor Flex/32 computer. The code is a parallel implementation of an existing code operational on the Cray-2 at NASA Ames, which performs direct simulations of the initial stages of the transition process of wall-bounded flow at supersonic Mach numbers. Spectral collocation in all three spatial directions (Fourier along the plate and Chebyshev normal to it) ensures high accuracy of the flow variables. By hiding most of the parallelism in low-level routines, the casual user is shielded from most of the nonstandard coding constructs. Speedups of 13 out of a maximum of 16 are achieved on the largest computational grids. © 1990 American Institute of Aeronautics and Astronautics, Inc., All rights reserved.

关键词： Freestream Mach Number Flow Variables Navier Stokes Equations NASA Langley Research Center FORTRAN parallel algorithm Incompressible Flow Continuity Equation Supercomputers Computing

来源：评论

学校读者我要写书评

暂无评论

Porting LASG/IAP Climate System Ocean Model to Gpus Using OpenAcc

引用

IEEE ACCESS 2019年 7卷 154490-154501页

作者： Jiang, Jinrong Lin, Pengfei Wang, Joey Liu, Hailong Chi, Xuebin Hao, Huiqun Wang, Yuzhu Wang, Wu Zhang, Linghan Univ Chinese Acad Sci Chinese Acad Sci Comp Network Informat Ctr Beijing 100190 Peoples R China NVIDIA Beijing 100004 Peoples R China Univ Chinese Acad Sci Chinese Acad Sci Inst Atmospher Phys LASG Beijing 100029 Peoples R China China Univ Geosci Beijing Sch Informat Engn Beijing 100083 Peoples R China Minist Nat Resources Key Lab Geol Informat Technol Beijing 100812 Peoples R China

GPUs have become important solutions for accelerating scientific applications. Most of the existing work on climate models now use code rewritten using CUDA to achieve a limited speedup. This restriction also greatly limits followup development and applications. In this paper, we designed and implemented a GPU-based acceleration of the LASG/IAP climate system ocean model (LICOM) version 2, called LICOM2-GPU. Considering the extremely large codebase of the model and the occasional need to modify the code, we implemented the model completely in OpenACC. Several accelerated methods, including OpenACC data locality optimization, loop optimization, and interprocess communication optimization are presented. Developing for GPUs using OpenACC is substantially simpler than using the CUDA port. Thus, the OpenACC is a suitable GPU programming model for complex systems, such as the earth system model and its components. Our experimental results using 4 NVIDIA K80 cards achieved up to a 6.6x speedup compared with 4 Intel(R) Xeon(R) CPU E5-2690 v2 GPUs.

关键词： High performance computing parallel algorithm GPU LICOM parallel acceleration

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：