检索结果-内蒙古大学图书馆

Cellular automata-based modelling and simulation of biofilm structure on multi-core computers

WATER SCIENCE AND TECHNOLOGY 2015年第11期72卷 2071-2081页

作者： Skoneczny, Szymon Cracow Univ Technol Dept Chem & Proc Engn Warszawska 24 PL-31155 Krakow Poland

The article presents a mathematical model of biofilm growth for aerobic biodegradation of a toxic carbonaceous substrate. Modelling of biofilm growth has fundamental significance in numerous processes of biotechnology and mathematical modelling of bioreactors. The process following double-substrate kinetics with substrate inhibition proceeding in a biofilm has not been modelled so far by means of cellular automata. Each process in the model proposed, i.e. diffusion of substrates, uptake of substrates, growth and decay of microorganisms and biofilm detachment, is simulated in a discrete manner. It was shown that for flat biofilm of constant thickness, the results of the presented model agree with those of a continuous model. The primary outcome of the study was to propose a mathematical model of biofilm growth;however a considerable amount of focus was also placed on the development of efficient algorithms for its solution. Two parallel algorithms were created, differing in the way computations are distributed. Computer programs were created using OpenMP Application Programming Interface for C++ programming language. Simulations of biofilm growth were performed on three high-performance computers. Speed-up coefficients of computer programs were compared. Both algorithms enabled a significant reduction of computation time. It is important, inter alia, in modelling and simulation of bioreactor dynamics.

关键词： biofilm structure cellular automata mathematical modelling parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Conditional Edge Fault-Tolerant Hamiltonian-Connected of Locally Twisted Cubes LTQn

Conditional Edge Fault-Tolerant Hamiltonian-Connected of Loc...

引用

International Conference on Network and Information Systems for Computers (ICNISC)

作者： Xirong Xu Hang Su Sijia Zhang Fan Wang School of Computer Science and Technology Dalian University of Technology Dalian P. R. China

ISBN: (纸本)9781467388399

The n-dimensional locally twisted cube LTQn is a variant of the hypercube, which possesses some properties superior to the hypercube. This paper investigates the conditional edge fault-tolerant Hamiltonian-connected of LTQn, and shows that for any n-dimensional locally twisted cube LTQn (n≥5) with faulty edges up to 2n-8 in which each vertex is incident to at least three fault-free edges, there exists a fault-free Hamiltonian path connecting any two vertices.

关键词： Hypercubes Fault tolerance Fault tolerant systems Joining processes parallel algorithms Information systems

来源：评论

学校读者我要写书评

暂无评论

A Study of Numerical Error Caused by parallelizing Complicated Continuous-Time Models with Dependence on Calculation Order

A Study of Numerical Error Caused by Parallelizing Complicat...

引用

第35届中国控制会议

作者： Kota Sata Shun-ichi Azuma Akira Ohata Toyota Motor Corporation Advanced Unit Management System Development Div.

The advancement of the engine control increases the amount of *** production ECU(Electronic Control Unit),which is made of single-core architecture,cannot have a higher clock *** multi- / many-core architecture is the only way to decrease execution ***,when implementing the engine control software,various problems occur in utilization of the multi- / many-core *** of the biggest problems is sequential structure of control software because the software can only execute with one core on the multi- / many-core *** purpose of this paper is to evaluate numerical error caused by parallelizing complicated models using the proposed parallelized control design method[1],which has decomposed sequential structure and decreases execution time in the embedded multi- / many-core production ECU,and evaluate the performance of each parallelized term.

关键词： automotive control control design execution times electronic control units internal combustion engine parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

PFMAP: Exploitation of Particle Filters for Network-on-Chip Mapping

引用

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS 2015年第10期23卷 2116-2127页

作者： Bayar, Salih Yurdakul, Arda Bogazici Univ Dept Comp Engn TR-34470 Istanbul Turkey

In this paper, we propose a mapping algorithm called particle filter mapping (PFMAP);PFMAP is able to map task nodes onto the cores of tile-based network-on-chip (NoC) architectures, such as regular, irregular, and custom 2-D or 3-D topologies. PFMAP is inspired from systematic resampling algorithm for particle filters, in which all particles can run parallel and independently from each other. Based upon the experimental results from applying PFMAP for various real life and synthetic applications onto the different topologies and architectures, the performance of the 2-D mesh architectures in terms of communication cost increased by up to 51% for irregular topologies, and by up to 31% for custom architectures. Similarly, total travel distance obtained by PFMAP is reduced by up to 45% for custom 2-D mesh architectures. In addition to these, average clock cycles per flit and total network power are reduced by up to 17% and 15% for regular 2-D mesh architectures, respectively. Finally, communication cost is diminished by up to 34% for 3-D regular NoC architectures.

关键词： Communication system traffic digital signal processing greedy algorithms multithreading network-on-chip parallel algorithms routing system-on-chip

来源：评论

学校读者我要写书评

暂无评论

Accelerating Molecular Structure Determination Based on Inter-Atomic Distances Using OpenCL

引用

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2015年第12期26卷 3250-3263页

作者： Lorentz, Istvan Andonie, Razvan Fabry-Asztalos, Levente Splash Software Brasov City Romania Cent Washington Univ Dept Comp Sci Ellensburg WA USA Transilvania Univ Elect & Comp Dept Brasov Romania Cent Washington Univ Dept Chem Ellensburg WA USA

Fast and accurate determination of the 3D structure of molecules is essential for better understanding their physical, chemical, and biological properties. We focus on an existing method for molecular structure determination: restrained molecular dynamics with simulated annealing. In this method a hybrid function, composed by a physical model and experimental restraints, is minimized by simulated annealing. Our goal is to accelerate computation time using commodity multi-core CPUs and GPUs in a heterogeneous computing model. We present a parallel and portable OpenCL implementation of this method. Experimental results are discussed in terms of accuracy, execution time, and parallel scalability. With respect to the XPLOR-NIH professional software package, compared to the single CPU core implementation, we obtain speedups of three to five times (increasing with problem size) on commodity GPUs. We achieve these performances by writing specialized kernels for different problem sizes and hardware architectures.

关键词： Molecular structure determination NMR molecular dynamics GPU OpenCL simulated annealing parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

An optimal parallel algorithm for minimum spanning trees in planar graphs

An optimal parallel algorithm for minimum spanning trees in ...

引用

European Symposium on algorithms, ESA 2015

作者： Chong, Ka Wong Zaroliagis, Christos Department of Computer Science The University of Hong-Kong Porfulam Road Porfulam Hong Kong Department of Computer Engineering and Informatics University of Patras Patras26504 Greece Computer Technology Institute and Press ‘Diophantus’ N. Kazantzaki Str. Patras University Campus Patras26504 Greece

ISBN: (纸本)9783319240237

We present an optimal deterministic O(n)-work parallel algorithm for finding a minimum spanning tree on an n-vertex planar graph. The algorithm runs in O(log n) time on a CRCW PRAM and in O(log n log∗ n) time on an EREW PRAM. Our results hold for any sparse graph that is closed under taking of minors, as well as for a class of graphs with non-bounded genus. © Springer International Publishing Switzerland 2015.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Runtime verification with minimal intrusion through parallelism

引用

FORMAL METHODS IN SYSTEM DESIGN 2015年第3期46卷 317-348页

作者： Berkovich, Shay Bonakdarpour, Borzoo Fischmeister, Sebastian Blue Coat Syst Waterloo ON N2V 2G4 Canada McMaster Univ Dept Comp & Software Hamilton ON L8S 4L7 Canada Univ Waterloo Dept Elect & Comp Engn Waterloo ON N2L 3G1 Canada

Runtime verification is a monitoring technique to gain assurance about well-being of a program at run time. Most existing approaches use sequential monitors;i.e., when the state of the program with respect to an event of interest changes, the monitor interrupts the program execution, evaluates a set of logical properties, and finally resumes the program execution. In this paper, we propose a GPU-based method for design and implementation of monitors that enjoy two levels of parallelism: the monitor (1) works along with the program in parallel, and (2) evaluates a set of properties in a parallel fashion as well. Our parallel monitoring algorithms effectively exploit the many-core platform available in the GPU. In addition to parallel processing, our approach benefits from a true separation of monitoring and functional concerns, as it isolates the monitor in the GPU. Thus, our monitoring approach incurs minimal intrusion, as executing monitoring tasks take place in a different computing hardware from execution of the program under inspection. Our method is fully implemented for parametric and non-parametric 3-valued linear temporal logic. Our experimental results show significant reduction in monitoring overhead, monitoring interference, and power consumption due to leveraging the GPU technology. In particular, we observe that our parallel verification algorithms are indeed scalable.

关键词： Runtime monitoring parallel algorithms Temporal logic Formal methods

来源：评论

学校读者我要写书评

暂无评论

parallelization of Bin Packing on Multicore Systems

Parallelization of Bin Packing on Multicore Systems

引用

International Conference on High Performance Computing

作者： Sayan Ghosh Assefaw H. Gebremedhin School of Electrical Engineering and Computer Science Washington State University Pullman WA USA

ISBN: (纸本)9781509054121

We study effective parallelization of approximation algorithms for the one-dimensional bin packing problem on a multicore platform. Bin packing is a classic combinatorial optimization problem that aims to pack a given sequence of items into a minimum number of equal-sized bins. The problem potentially serves as a model for a wide variety of applications. Examples include: packing data into chunks in a memory hierarchy in a given system to increase application performance, loading vehicles subject to weight limitations, and packing TV commercials into station breaks. Bin packing has long served as a proving ground for the analysis of approximation algorithms and played a crucial role in the development of much of the theory of approximation algorithms. Its parallelization, however, has received comparatively much less attention. In this work, we develop multiple parallel versions of an effective approximation algorithm (First Fit Decreasing) for the problem and investigate the trade-off between solution quality and execution time. We use OpenMP and Cilk Plus as mechanisms for achieving the parallelization. The new parallel algorithms obtain a speedup of more than 10× (on 32 cores) for moderate to large input sequences without sacrificing much on the quality of solution produced by the sequential algorithm - in particular, we see only about 3 to 30% increase in the number of bins compared to the sequential version. In turn, the solution obtained by the sequential First Fit Decreasing algorithm is provably almost optimal (the approximation ratio is less than 1.3).

关键词： Approximation algorithms Algorithm design and analysis parallel algorithms Dynamic scheduling Heuristic algorithms Multicore processing Upper bound

来源：评论

学校读者我要写书评

暂无评论

Keynote 1

Keynote 1

引用

IEEE International Conference on Cluster Computing

作者： Torsten Hoefler

ISBN: (纸本)9781509036547

Summary form only given. We advocate the usage of mathematical models and abstractions in practical high-performance computing. For this, we show a series of examples and use-cases where the abstractions introduced by performance models can lead to clearer pictures of the core problems and often provide non-obvious insights. We start with models of parallel algorithms leading to close-to-optimal practical implementations. We continue our tour with distributed-memory programming models that provide various abstractions to application developers. A short digression on how to measure parallel systems shows common pitfalls of practical performance modeling. Application performance models based on such accurate measurements support insight into the resource consumption and scalability of parallel programs on particular architectures. We close with a demonstration of how mathematical models can be used to derive practical network topologies and routing algorithms. In each of these areas, we demonstrate newest developments but also point to open problems. All these examples testify to the value of modeling in practical high-performance computing. We assume that a broader use of these techniques and the development of a solid theory for parallel performance will lead to deep insights at many fronts.

关键词： resource allocation distributed memory systems parallel algorithms parallel programming

来源：评论

学校读者我要写书评

暂无评论

DNA Sequence Splicing Algorithm Based on Spark

DNA Sequence Splicing Algorithm Based on Spark

引用

International Conference on Industrial Informatics, Computing Technology, Intelligent Technology, Industrial Information Integration (ICIICII)

作者： Xu Pan Xue-Liang Fu Gai-Fang Dong Hong-Hui Li College of Computer and Information Engineering Inner Mongolia Agricultural University Hohhot China

ISBN: (纸本)9781509035762

Bioinformatics is a cross subject of biological information processing. DNA sequence splicing is one of its research content. At present, most parallel algorithms are based on the operating environment of MapReduce. There is a complex process for reading and writing to hard disk, which lead to inferiority that the speed of the algorithm will be slow. In this paper, Spark calculation model based on memory is proposed to solve the problem. At the same time, a new method of matching K-2 bit will be also used by us. Results of experiment show that the running environment based on Spark and the method can ensure accuracy of stitching results and make the algorithm more efficient.

关键词： Algorithm design and analysis Sparks DNA Splicing parallel algorithms Computers Hard disks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：