检索结果-内蒙古大学图书馆

1st Russian Conference on Supercomputing Days 2015, RuSCDays 2015

作者： Kharchenko, Sergey

The paper considers algorithm for computing sparse QR decomposition of a specially ordered rectangular matrix. Decomposition is based on block sparse Householder transformations. For ordering computations the ND-type ordering for sparsity of ATA matrix can be used, here A-original rectangular matrix. For mesh based problems the ordering can be constructed starting from appropriate volume partitioning of the computational mesh. parallel computations are based on sparse QR decomposition for sets of rows with additional zero block at the beginning. The suggested algorithm is planned to be used as main computational kernel in the developed by the author parallel iterative algorithms for solving SLAEs and least squares problems. The corresponding algorithms will be based on composition of the subspaces represented by sparse bases.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Fast parallel conversion of edge list to adjacency list for large-scale graphs 23

Fast parallel conversion of edge list to adjacency list for ...

引用

23rd High Performance Computing Symposium, HPC 2015, Part of the 2015 Spring Simulation Multi-Conference, SpringSim 2015

作者： Arifuzzamant, Shaikh Khan, Maleq Department of Computer Science Virginia Bioinformatics Institute Virginia Tech BlacksburgVA24061 United States Network Dynamics and Simulation Science Lab Virginia Bioinformatics Institute Virginia Tech BlacksburgVA24061 United States

ISBN: (纸本)9781510810600

In the era of Bigdata, we are deluged with massive graph data emerged from numerous social and scientific applications. In most cases, graph data are generated as lists of edges (edge list), where an edge denotes a link between a pair of entities. However, most of the graph algorithms work efficiently when information of the adjacent nodes (adjacency list) for each node is readily available. Although the conversion from edge list to adjacency list can be trivially done on the fly for small graphs, such conversion becomes challenging for the emerging large-scale graphs consisting billions of nodes and edges. These graphs do not fit into the main memory of a single computing machine and thus require distributed-memory parallel or external-memory algorithms. In this paper, we present efficient MPI-based distributed memory parallel algorithms for converting edge lists to adjacency lists. To the best of our knowledge, this is the first work on this problem. To address the critical load balancing issue, we present a parallel load balancing scheme which improves both time and space efficiency significantly. Our fast parallel algorithm works on massive graphs, achieves very good speedups, and scales to large number of processors. The algorithm can convert an edge list of a graph with 20 billion edges to the adjacency list in less than 2 minutes using 1024 processors. Denoting the number of nodes, edges and processors by n, m, and P, respectively, the time complexity of our algorithm is O(m/n+ n + P) which provides a speedup factor of at least Ω(min{P, davg}), where davg is the average degree of the nodes. The algorithm has a space complexity of O(m/p), which is optimal. Copyright © 2015 Society for Modeling & Simulation International (SCS).

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Research on Linkage Disequilibrium Method Based on OpenMP

Research on Linkage Disequilibrium Method Based on OpenMP

引用

International Conference on Information Science and Control Engineering (ICISCE)

作者： Jun Lu Jun Li Ruiqing Jing Qiong Wang Cheng Chang Jiaxing Guo College of Computer Science and Technology Heilongjiang University Harbin China Key Laboratory of Database and Parallel Computing of Heilongjiang Province Harbin China

ISBN: (纸本)9781509025367

Linkage disequilibrium method is applied for the research on inferring population genetics, LD mapping, haploid type diversity analysis and so on. Soybean genotypes are adopted as the data source and linkage disequilibrium parallel algorithm is implemented by OpenMP technology. In this algorithm, single nucleotide polymorphism sites are divided by using sliding windows into groups, adjacent sites allele in a window of each chromosome are parallel calculated and store the LD results. According to the experimental data, the serial and parallel algorithms are compared and analyzed. The conclusion shows that the OpenMP parallel technology can effectively improve the efficiency of linkage disequilibrium analysis method. It is a realistic significance for processing massive biological information data.

关键词： Biological cells Couplings Genetics Instruction sets Sociology Statistics parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Copper Sediment Toxicity and Partitioning during Oxidation in a Flow-Through Flume

引用

ENVIRONMENTAL SCIENCE & TECHNOLOGY 2015年第11期49卷 6926-6933页

作者： Costello, David M. Hammerschmidt, Chad R. Burton, G. Allen Kent State Univ Dept Biol Sci Kent OH 44242 USA Wright State Univ Dept Earth & Environm Sci Dayton OH 45435 USA Univ Michigan Sch Nat Resources & Environm Ann Arbor MI 48109 USA Univ Michigan Earth & Environm Sci Ann Arbor MI 48109 USA

The bioavailability of transition metals in sediments often depends on redox conditions in the sediment. We explored how the physicochemistry and toxicity of anoxic Cu-amended sediments changed as they aged (i.e., naturally oxidized) in a flow-through flume. We amended two sediments (Dow and Ocoee) with Cu, incubated the sediments in a flow-through flume, and measured sediment physicochemistry and toxicity over 213 days. As sediments aged, oxygen penetrated sediment to a greater depth, the relative abundance of Fe oxides increased in surface and deep sediments, and the concentration of acid volatile sulfide declined in Ocoee surface sediments. The total pool of Cu in sediments did not change during aging, but porewater Cu, and Cu bound to amorphous Fe oxides decreased while Cu associated with crystalline Fe oxides increased. The dose-response of the epibenthic amphipod Hyalella azteca to sediment total Cu changed over time, with older sediments being less toxic than freshly spiked sediments. We observed a strong doseresponse relationship between porewater Cu and H. azteca growth across all sampling periods, and measurable declines in relative growth rates were observed at concentrations below interstitial water criteria established by the U.S. EPA. Further, solid-phase bioavailability models based on AVS and organic carbon were overprotective and poorly predicted toxicity in aged sediments. We suggest that sediment quality criteria for Cu is best established from measurement of Cu in pore water rather than estimating bioavailable Cu from the various solid-phase ligands, which vary temporally and spatially.

关键词： COPPER poisoning SEDIMENTS (Geology) -- Heavy metal content FLUMES OXIDATION-reduction reaction parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Fast parallel community detection algorithm based on modularity 18

Fast parallel community detection algorithm based on modular...

引用

18th CSI International Symposium on Computer Architecture and Digital Systems, CADS 2015

作者： Moradi, Ehsan Fazlali, Mahmood Malazi, Hadi Tabatabaee Department of Information Technology Kermanshah Branch Islamic Azad University Kermanshah Iran Department of Computer Science Shahid Beheshti University GC Tehran Iran Department of Software Engineering Shahid Beheshti University GC Tehran Iran

ISBN: (纸本)9781467380232

In recent years, detecting dense sub-graphs that are known as communities in massive graphs has been a common issue in different fields of science. It provides the facility of studying complex graphs by simplifying them through utilizing communities. Due to ceaseless increases in graph size that are used in social networks (with billions of nodes and edges), algorithm execution time is an important factor for detecting communities. To cope with this problem, a new parallel community detection algorithm is presented in this paper. The main idea behind the proposed method is to assign parallel threads for the calculation of adding qualified neighbor nodes to the community. Proposed algorithm is tested using a general PC (IntelCorei7, 4 GByte). It leads to abating the algorithm execution time from 25% to 78% compared to the fastest previous parallel algorithms. © 2015 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallelizing Count-Min Sketch Algorithm on Multi-core Processors

Parallelizing Count-Min Sketch Algorithm on Multi-core Proce...

引用

2016 6th International Conference on Machinery,Materials,Environment,Biotechnology and Computer(MMEBC 2016)

作者： BOWEN Yu YU Zhang LUBING Li College of Computer and Control Engineering Nankai University

In this paper, we present a novel method that exploits the great parallel capability of multi-cores to speed up the famous Count-Min sketch algorithmThe proposed parallel Count-Min sketch algorithm equally distributes the input data stream into sub-threads which use the original Count-Min sketch algorithm to process the sub-streamsThe counters in each local Count-Min sketch with frequency increments exceeding a pre-defined threshold are sent to a merging thread which is able to return the estimated frequencies satisfying the（ε, δ）-approximation requirementExperiments with real traffic traces demonstrate the excellent performance as well as the effects of parametersThe parallel Count-Min sketch algorithm achieves near-linear speedup at the cost of greater memory use.

关键词： Count-Min sketch parallel algorithms Frequent items

来源：评论

学校读者我要写书评

暂无评论

Estimation of local subgraph counts

Estimation of local subgraph counts

引用

IEEE International Conference on Big Data

作者： Nesreen K. Ahmed Theodore L. Willke Ryan A. Rossi Intel Labs Palo Alto Research Center

ISBN: (纸本)9781467390064

Graphlets represent small induced subgraphs and are becoming increasingly important for a variety of applications. Despite the importance of the local subgraph (graphlet) counting problem, existing work focuses mainly on counting graphlets globally over the entire graph. These global counts have been used for tasks such as graph classification as well as for understanding and summarizing the fundamental structural patterns in graphs. In contrast, this work proposes an accurate, efficient, and scalable parallel framework for the more challenging problem of counting graphlets locally for a given edge or set of edges. The local graphlet counts provide a topologically rigorous characterization of the local structure surrounding an edge. The aim of this work is to obtain the count of every graphlet of size k for each edge. The framework gives rise to efficient, parallel, and accurate unbiased estimation methods with provable error bounds, as well as exact algorithms for counting graphlets locally. Experiments demonstrate the effectiveness of the proposed exact and estimation methods on various datasets. In particular, the exact methods show strong scaling results (11-16x on 16 cores). Moreover, our estimation framework is accurate with error less than 5% on average.

关键词： Estimation Big data Biology Image edge detection Error analysis Conferences parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

On the computational power of WECPAR

引用

JOURNAL OF SUPERCOMPUTING 2015年第1期71卷 28-44页

作者： El-Boghdadi, Hatem M. Cairo Univ Fac Engn Dept Comp Engn Giza 12211 Egypt

Reconfigurable models were shown to be very powerful in solving many problems faster than non-reconfigurable models. WECPAR is an reconfigurable model that has point-to-point reconfigurable interconnection with wires between neighboring processors. This paper studies several aspects of WECPAR. We first consider solving the list ranking problem on WECPAR. Some of the results obtained show that, ranking one element in a list of elements can be solved on WECPAR in time. Also, on , ranking a list of elements can be done in time. Then, we assess the relative computational power of WECPAR and transfer a large body of algorithms to work directly on WECPAR. We introduce several simulation algorithms between WECPAR and well-known models such as PRAM and RMBM. Simulation algorithms show that a PRIORITY CRCW PRAM of processors and shared memory locations can be simulated on WECPAR in time. Also, we show that a PRIORITY CRCW basic RMBM(, of processors and buses can be simulated on WECPAR in time. This directly migrate a large number of algorithms to work on WECPAR with the simulation overhead.

关键词： parallel algorithms Simulation algorithms List ranking

来源：评论

学校读者我要写书评

暂无评论

An Effective GPU-Based Approach to Probabilistic Query Confidence Computation

引用

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 2015年第1期27卷 17-31页

作者： Serra, Edoardo Spezzano, Francesca UMIACS College Pk MD 20742 USA

In recent years, probabilistic data management has received a lot of attention due to several applications that deal with uncertain data: RFID systems, sensor networks, data cleaning, scientific and biomedical data management, and approximate schema mappings. Query evaluation is a challenging problem in probabilistic databases, proved to be #P-hard. A general method for query evaluation is based on the lineage of the query and reduces the query evaluation problem to computing the probability of a propositional formula. The main approaches proposed in the literature to approximate probabilistic queries confidence computation are based on Monte Carlo simulation, or formula compilation into decision diagrams (e.g., d-trees). The former executes a polynomial, but with too many, iterations, while the latter is polynomial for easy queries, but may be exponential in the worst case. We designed a new optimized Monte Carlo algorithm that drastically reduces the number of iterations and proposed an efficient parallel version that we implemented on GPU. Thanks to the elevated degree of parallelism provided by the GPU, combined with the linear speedup of our algorithm, we managed to reduce significantly the long running time required by a sequential Monte Carlo algorithm. Experimental results show that our algorithm is so efficient as to be comparable with the formula compilation approach, but with the significant advantage of avoiding exponential behavior.

关键词： Probabilistic databases query processing probabilistic reasoning monte carlo parallel algorithms GPU-computing

来源：评论

学校读者我要写书评

暂无评论

parallel algorithm for solving large-scale dynamic general equilibrium models 1

Parallel algorithm for solving large-scale dynamic general e...

引用

1st Russian Conference on Supercomputing Days 2015, RuSCDays 2015

作者： Melnikov, N.B. Gruzdev, A.P. Dalton, M.G. O'Neill, B.C. Lomonosov Moscow State University Russia National Oceanic and Atmospheric Administration United States National Center for Atmospheric Research United States

We present a parallel algorithm for computing an equilibrium path in a large-scale eco-nomic growth model. We exploit the special block structure of the nonlinear systems of equations common in such models. Our algorithm is based on an iterative method of Gauss-Seidel type with prices of different time periods calculated simultaneously rather than recursively. We have implemented the parallel algorithm in OpenMP and MPI programming environments. The numerical results show that speedup im-proves almost linearly as number of nodes increases. Different methods for solving an individual block: Newton-type methods, Krylov subspace methods and trust-region methods, give similar results for the speedup.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：