检索结果-内蒙古大学图书馆

algorithms FOR CONSTRUCTION OF OPTIMAL AND ALMOST-OPTIMAL LENGTH-RESTRICTED CODES

parallel PROCESSING LETTERS 2006年第1期16卷 81-92页

作者： Karpinski, Marek Nekrich, Yakov Univ Bonn Dept Comp Sci Bonn Germany

In this paper we present new results on sequential and parallel construction of optimal and almost-optimal length-restricted prefix-free codes. We show that lengthrestricted prefix-free codes with error l/n(k) for any k > 0 can be constructed in O(n log n) time, or in O(logn) time with n CREW processors. A length-restricted code with error l/n(k) for any k <= L/log(Phi)n, where Phi = (1 + root 5)/2, can be constructed in O(logn) time with n/log n CREW processors. We also describe an algorithm for the construction of optimal length-restricted codes with maximum codeword length L that works in O(L) time with n CREW processors.

关键词： parallel algorithms Huffman Codes Minimum Redundancy Length-Restricted Codes Almost Optimal Codes

来源：评论

学校读者我要写书评

暂无评论

FOURIER-TRANSFORMS IN VLSI

引用

IEEE TRANSACTIONS ON COMPUTERS 1983年第11期32卷 1047-1057页

作者： THOMPSON, CD Division of Computer Science University of California Abstract Authors References Cited By Keywords Metrics Similar Download Citation Email Print Request Permissions

This paper surveys nine designs for VLSI circuits that compute N-element Fourier transforms. The largest of the designs requires O(N2 log N) units of silicon area; it can start a new Fourier transform every O(log N) t... 详细信息

关键词： algorithms implemented in hardware FFT Fourier transform VLSI area-time complexity computational complexity mesh-connected computers parallel algorithms shuffle-exchange network

来源：评论

学校读者我要写书评

暂无评论

Sorting-based selection algorithms for hypercubic networks

引用

ALGORITHMICA 2000年第2期26卷 237-254页

作者： Berthomé, P Ferreira, A Maggs, BM Perennes, S Plaxton, CG Ecole Normale Super Lyon CNRS Lab Informat Parellelisme F-69364 Lyon 07 France Carnegie Mellon Univ Sch Comp Sci Pittsburgh PA 15213 USA NEC Res Inst Princeton NJ 08540 USA CNRS I3S F-06560 Valbonne France Univ Texas Dept Comp Sci Austin TX 78712 USA

This paper presents several deterministic algorithms for selecting the kth largest record from a set of n records on any n-node hypercubic network. All of the algorithms are based on the selection algorithm of Cole and Yap, as well as on various sorting algorithms for hypercubic networks. Our fastest algorithm runs in O(lg n lg* n) time, very nearly matching the trivial n(lg n) lower bound. Previously, the best upper bound known for selection was O(lgn lg lg n). A key subroutine in our O(lgn lg* n) time selection algorithm is a sparse version of the Sharesort algorithm that sorts n records using p processors, p greater than or equal to n, in O(lg n(lg lg p lg lg(p/n))(2)) time.

关键词： selection hypercube parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Data analysis at scale

引用

IT-INFORMATION TECHNOLOGY 2015年第2期57卷 130-132页

作者： Gemulla, Rainer Univ Mannheim Data & Web Sci Res Grp D-68131 Mannheim Germany

My research focuses on methods to analyze and mine large datasets as well as their practical realizations and applications. The key question of interest to me is: How can we effectively and efficiently distill useful information from large, complex, and potentially noisy datasets? To approach this question, we are developing systems for scalable data analysis and data mining, for working with incomplete and noisy data, for data-intensive optimization, as well as for extracting structured information from natural-language text. This article highlights some of my work in these areas.

关键词： Data analysis data mining parallel algorithms information systems

来源：评论

学校读者我要写书评

暂无评论

A parallel algorithm for the concurrent atomistic-continuum methodology

引用

JOURNAL OF COMPUTATIONAL PHYSICS 2022年 463卷 1页

作者： Diaz, Adrian Gu, Boyang Li, Yang Plimpton, Steven J. McDowell, David L. Chen, Youping Univ Florida Dept Mech & Aerosp Engn Gainesville FL 32611 USA Sandia Natl Labs Albuquerque NM 87185 USA Georgia Inst Technol Woodruff Sch Mech Engn Atlanta GA 30332 USA Georgia Inst Technol Sch Mat Sci & Engn Atlanta GA 30332 USA

In this work we present a parallel algorithm for the Concurrent Atomistic Continuum (CAC) formulation that can be integrated into existing molecular dynamics codes. The CAC methodology is briefly introduced and its parallel implementation in LAMMPS is detailed and then demonstrated through benchmarks that compare CAC simulation results with corresponding all-MD (molecular dynamics) results. The parallel efficiency of the algorithm is demonstrated when simulating systems represented by both atoms and finite elements. The verification benchmarks include dynamic crack propagation and branching in a Si single crystal, wave propagation and scattering in a Si phononic crystal, and phonon transport through the phase interface in a PbTe/PbSe heteroepitaxial system. In each of these benchmarks the CAC algorithm is shown to be in good agreement with MD-only models. This parallel CAC algorithm thus offers one of the first scalable multiscale material simulation methodologies that relies solely on atomic-interaction models. (c) 2022 Published by Elsevier Inc.

关键词： Concurrent atomistic-continuum method Molecular dynamics Nonequilibrium processes parallel algorithms LAMMPS

来源：评论

学校读者我要写书评

暂无评论

0(N2) algorithms FOR GRAPH PLANARIZATION - COMMENTS

引用

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS 1991年第12期10卷 1582-1583页

作者： TAKEFUJI, Y KUO, CL CHO, YB Department of Electrical Engineering Case Western Reserve University Cleveland OH USA

This article points out that our parallel algorithm provides the maximum planar subgraph and it is compared with the maximal planar subgraph provided by Jayakumar et al. in the above paper. The space-time product comp... 详细信息

关键词： Planarization parallel algorithms Design automation Computational modeling Very large scale integration Printed circuits Routing

来源：评论

学校读者我要写书评

暂无评论

AN OPTIMIZED, parallel COMPUTATION OF THE GHOST LAYER FOR ADAPTIVE HYBRID FOREST MESHES

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 2021年第6期43卷 C359-C385页

作者： Holke, Johannes Knapp, David Burstedde, Carsten German Aerosp Ctr DLH Inst Software Technol D-51147 Cologne Germany Rhein Friedrich Wilhelms Univ Bonn Inst Numer Simulat D-53115 Bonn Germany

We discuss parallel algorithms to compute the ghost layer in computational, distributed memory, recursively adapted meshes. Its creation is a fundamental, necessary task in executing most parallel, element-based computer simulations. Common methods differ in that the ghost layer may either be inherently part of the mesh data structure that is maintained and modified, or kept separate and constructed/deleted as needed. In this work, we present a design following the latter approach, which we chose for its modularity of algorithms and data structures. We target arbitrary adaptive, nonconforming forest-of-trees meshes of mixed element shapes, such as cubes, prisms, and tetrahedra, and restrict ourselves to ghost elements across mesh faces. Our algorithm has low code complexity and redundancy since we reduce it to generic co dimension-1 subalgorithms that can be flexibly combined. We recover older algorithms for cubic elements as special cases and optimize further using recursive, amortized tree searches and traversals.

关键词： adaptive mesh refinement parallel algorithms ghost layer forest of trees

来源：评论

学校读者我要写书评

暂无评论

Fast parallel construction of variable-length Markov chains

引用

BMC BIOINFORMATICS 2021年第1期22卷 487-487页

作者： Gustafsson, Joel Norberg, Peter Qvick-Wester, Jan R. Schliep, Alexander Univ Gothenburg Inst Biomed Dept Infect Dis Gothenburg Sweden Univ Gothenburg Dept Comp Sci & Engn Chalmers Univ Technol Gothenburg Sweden

Background: Alignment-free methods are a popular approach for comparing biological sequences, including complete genomes. The methods range from probability distributions of sequence composition to first and higher-order Markov chains, where a k-th order Markov chain over DNA has 4k formal parameters. To circumvent this exponential growth in parameters, variable-length Markov chains (VLMCs) have gained popularity for applications in molecular biology and other areas. VLMCs adapt the depth depending on sequence context and thus curtail excesses in the number of parameters. The scarcity of available fast, or even parallel software tools, prompted the development of a parallel implementation using lazy suffix trees and a hash-based alternative. Results: An extensive evaluation was performed on genomes ranging from 12Mbp to 22Gbp. Relevant learning parameters were chosen guided by the Bayesian Information Criterion (BIC) to avoid over-fitting. Our implementation greatly improves upon the state-of-the-art even in serial execution. It exhibits very good parallel scaling with speed-ups for long sequences close to the optimum indicated by Amdahl's law of 3 for 4 threads and about 6 for 16 threads, respectively. Conclusions: Our parallel implementation released as open-source under the GPLv3 license provides a practically useful alternative to the state-of-the-art which allows the construction of VLMCs even for very large genomes significantly faster than previously possible. Additionally, our parameter selection based on BIC gives guidance to endusers comparing genomes.

关键词： Variable-length Markov chain Sequence analysis parallel algorithms Alignment-free

来源：评论

学校读者我要写书评

暂无评论

A FAST AND EFFICIENT NC ALGORITHM FOR MAXIMAL MATCHING

引用

INFORMATION PROCESSING LETTERS 1995年第6期55卷 303-307页

作者： CHEN, ZZ Department of Mathematical Sciences Tokyo Denki University Hatoyama Saitama 350-03 Japan

The fastest known NC algorithm for maximal matching is a simple reduction to Luby's NC algorithm for maximal independent sets. It runs in O(log(2) n) time using m(3) Delta processors on an EREW PRAM. In this paper, we present an algorithm that finds a maximal matching of a given n-vertex m-edge graph with maximum degree Delta in O((Delta(2)/p). log(2) n) time using p .(m + n)/log n processors on an EREW PRAM for any 1 less than or equal to p less than or equal to Delta(2). In particular for p = Delta(2), our algorithm has the same running time as the algorithm of Luby but uses a much smaller number of processors (Delta(2) .(m + n)/log n instead of m(3) Delta).

关键词： parallel algorithms MAXIMAL MATCHING NC RNC

来源：评论

学校读者我要写书评

暂无评论

Space-efficient scheduling of multithreaded computations

引用

SIAM JOURNAL ON COMPUTING 1998年第1期27卷 202-229页

作者： Blumofe, RD Leiserson, CE Univ Texas Dept Comp Sci Austin TX 78712 USA MIT Comp Sci Lab Cambridge MA 02139 USA

This paper considers the problem of scheduling dynamic parallel computations to achieve linear speedup without using significantly more space per processor than that required for a single-processor execution. Utilizing a new graph-theoretic model of multithreaded computation, execution efficiency is quantified by three important measures: T-1 is the time required for executing the computation on a 1 processor, T-infinity is the time required by an infinite number of processors, and S-1 is the space required to execute the computation on a 1 processor. A computation executed on P processors is time-efficient if the time is O(T-1/P + T-infinity), that is, it achieves linear speedup when P = O(T-1/T-infinity), and it is space-efficient if it uses O(S1P) total space, that is, the space per processor is within a constant factor of that required for a 1-processor execution. The first result derived from this model shows that there exist multithreaded computations such that no execution schedule can simultaneously achieve efficient time and efficient space. But by restricting attention to "strict" computations-those in which all arguments to a procedure must be available before the procedure can be invoked-much more positive results are obtainable. Specifically, for any strict multithreaded computation, a simple online algorithm can compute a schedule that is both time-efficient and space-efficient. Unfortunately, because the algorithm uses a global queue, the overhead of computing the schedule can be substantial. This problem is overcome by a decentralized algorithm that can compute and execute a P-processor schedule online in expected time O(T-1/P + T-infinity lg P) and worst-case space O(S1P lg P), including overhead costs.

关键词： parallel computing multithreaded computing parallel algorithms scheduling algorithms randomized algorithms strict execution stack memory

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：