检索结果-内蒙古大学图书馆

9th International Conference on Bioinformatics and Computational Biology, BICOB 2017

作者： Goparaju, Aditya Salem, Saeed Department of Computer Science North Dakota State University FargoND58102 United States

ISBN: (纸本)9781943436071

Robust and scalable techniques for mining patterns or subgraphs in protein protein interaction (PPI) networks can help identify functionally relevant and coherent subnetworks. Recently, researchers have focused on integrating genes attributes with the protein-protein interaction networks for mining connected subnetworks whose genes are similar in a subset of attributes. However, most of the proposed approaches assume that these subnetworks are dense. While detecting dense and cohesive subnetworks is desirable, the density factor can prevent these algorithms from reporting highly cohesive subgraphs which are not particularly dense. In this paper, we propose a parallel algorithm for mining maximal cohesive subgraphs from node-attributed networks. Experiments on two real interaction networks and gene expression attributes demonstrate the effectiveness of the proposed algorithm. Moreover, biological enrichment analysis of the reported patterns show that the patterns are biologically relevant and enriched with known biological processes and KEGG pathways.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Multi-core parallelism for plane sweep algorithms as a foundation for GIS operations

引用

GEOINFORMATICA 2017年第1期21卷 151-174页

作者： McKenney, Mark Frye, Roger Dellamano, Mathew Anderson, Kevin Harris, Jeremy Southern Illinois Univ Edwardsville Dept Comp Sci Edwardsville IL 62026 USA

The plane sweep algorithm is a foundational algorithm for many geometric and spatial computations;thus, improvements in the algorithm have far reaching effects in many applications. In this paper, we examine the performance of the serial plane sweep algorithm, and introduce a parallelization technique for the algorithm that is suitable to multi-core computers. The parallelization technique is described in detail and shown to be correct. Finally, experiments are performed using multiple data sets on computers with varying numbers of processing cores. We show that our algorithm achieves significant speedups over the serial plane sweep algorithm using a wide range of input parameters;thus, our algorithm achieves good performance without the need to tune the input parameters for specific input cases.

关键词： Plane sweep parallel algorithms Multi-core Spatial decomposition Acceleration

来源：评论

学校读者我要写书评

暂无评论

parallel solvers for fractional power diffusion problems

Parallel solvers for fractional power diffusion problems

引用

作者： Čiegis, Raimondas Starikovičius, Vadimas Margenov, Svetozar Kriauzienė, Rima Vilnius Gediminas Technical University Saulėtekio av. 11 VilniusLT-10223 Lithuania Institute of Information and Communication Technologies Bulgarian Academy of Sciences Acad. G. Bonchev str. bl. 25A Sofia1113 Bulgaria Vilnius University Institute of Mathematics and Informatics Akademijos str. 4 VilniusLT-08663 Lithuania

Mathematical models with fractional-order differential operators are computationally expensive due to the non-local nature of these operators. In this work, we construct and investigate parallel solvers for problems described by fractional powers of elliptic operators, like fractional diffusion. Three state-of-the-art approaches are used to transform the non-local fractional-order differential problem into local partial differential equation problems formulated in a space of higher dimension. Numerical schemes and parallel algorithms are developed for all three approaches. The resulting parallel algorithms have very different properties. We investigate the weak and strong scalability of the developed parallel algorithms and compare their parallel performance. Copyright © 2017 John Wiley & Sons, Ltd.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

SIMULATION OF DYNAMIC PROCESSES IN THREE-DIMENSIONAL LAYERED FRACTURED MEDIA WITH THE USE OF THE GRID-CHARACTERISTIC NUMERICAL METHOD

引用

JOURNAL OF APPLIED MECHANICS AND TECHNICAL PHYSICS 2017年第3期58卷 539-545页

作者： Golubev, V. I. Gilyazutdinov, R. I. Petrov, I. B. Khokhlov, N. I. Vasyukov, A. V. Moscow Inst Phys & Technol Dolgoprudnyi Russia Russian Acad Sci Keldysh Inst Appl Math Moscow 125047 Russia

This paper touches upon the computer simulation of the propagation of elastic waves in three-dimensional multilayer fractured media. The dynamic processes are described using the defining system of equations in the partial derivatives of the deformed solid mechanics. The numerical solution of this system is carried out via the grid-characteristic method on curvilinear structural grids. The fractured nature of the medium is accounted for by explicitly selecting the boundaries of individual cracks and setting special boundary conditions in them. Various models of heterogeneous deformed media with a fractured structures are considered: a homogeneous medium, a medium with horizontal boundaries, a medium with inclined boundaries, and a medium curvilinear boundaries. The wave fields detected on the surface are obtained, and their structures are analyzed. It is demonstrated that it is possible to detect the waves scattered from fractured media even in the case of nonparallel (inclined and curvilinear) boundaries of geological layers.

关键词： fractured media mathematical simulation numerical methods parallel algorithms direct seismic prospecting tasks composite materials

来源：评论

学校读者我要写书评

暂无评论

Data Flow algorithms for Processors with Vector Extensions

引用

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY 2017年第1期87卷 21-31页

作者： Barford, Lee Bhattacharyya, Shuvra S. Liu, Yanzhou Keysight Technol Inc Keysight Labs 561 Keystone Ave Unit 434 Reno NV 89503 USA Univ Maryland College Pk MD 20742 USA Tampere Univ Technol Tampere Finland

Full use of the parallel computation capabilities of present and expected CPUs and GPUs requires use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dependencies between actor invocations that make vectorizing across actor invocations impossible. Ideally, issues of inter-thread coordination required by serial data dependencies should be handled by code written by parallel programming experts that is separate from code specifying signal processing operations. The purpose of this paper is to present one approach for so doing in the case of actors that maintain state. We propose a methodology for using the parallel scan (also known as prefix sum) pattern to create algorithms for multiple simultaneous invocations of such an actor that results in vectorizable code. Two examples of applying this methodology are given: (1) infinite impulse response filters and (2) finite state machines. The correctness and performance of the resulting IIR filters and one class of FSMs are studied.

关键词： Digital signal processing Data flow computing Vector processors parallel algorithms Graphics processing units

来源：评论

学校读者我要写书评

暂无评论

Guidefill: GPU Accelerated, Artist Guided Geometric Inpainting for 3D Conversion of Film

引用

SIAM JOURNAL ON IMAGING SCIENCES 2017年第4期10卷 2049-2090页

作者： Hocking, L. Robert MacKenzie, Russell Schonlieb, Carola-Bibiane Univ Cambridge Dept Appl Math & Theoret Phys Cambridge CB2 1TN England Gener8 Media Corp Vancouver BC V5T 1M6 Canada

The conversion of traditional film into stereo 3D has become an important problem in the past decade. One of the main bottlenecks is a disocclusion step, which in commercial 3D conversion is usually done by teams of artists armed with a toolbox of inpainting algorithms. A current difficulty in this is that most available algorithms either are too slow for interactive use or provide no intuitive means for users to tweak the output. In this paper we present a new fast inpainting algorithm based on transporting along automatically detected splines, which the user may edit. Our algorithm is implemented on the GPU and fills the inpainting domain in successive shells that adapt their shape on the fly. In order to allocate GPU resources as efficiently as possible, we propose a parallel algorithm to track the inpainting interface as it evolves, ensuring that no resources are wasted on pixels that are not currently being worked on. Theoretical analyses of the time and processor complexity of our algorithm without and with tracking (as well as numerous numerical experiments) demonstrate the merits of the latter. Our transport mechanism is similar to the one used in coherence transport [F. Bornemann and T. Marz, J. Math. Imaging Vision, 28 (2007), pp. 259-278;T. Marz, SIAM J. Imaging Sci., 4 (2011), pp. 981-1000] but improves upon it by correcting a "kinking" phenomenon whereby extrapolated isophotes may bend at the boundary of the inpainting domain. Theoretical results explaining this phenomenon and its resolution are presented. Although our method ignores texture, in many cases this is not a problem due to the thin inpainting domains in 3D conversion. Experimental results show that our method can achieve a visual quality that is competitive with the state of the art while maintaining interactive speeds and providing the user with an intuitive interface to tweak the results.

关键词： image processing image inpainting 3D conversion PDEs parallel algorithms GPU

来源：评论

学校读者我要写书评

暂无评论

A matrix-free approach to efficient affine-linear image registration on CPU and GPU

引用

JOURNAL OF REAL-TIME IMAGE PROCESSING 2017年第1期13卷 205-225页

作者： Ruehaak, Jan Koenig, Lars Tramnitzke, Florian Koestler, Harald Modersitzki, Jan Fraunhofer MEVIS Maria Goeppert Str 3 D-23562 Lubeck Germany Univ Erlangen Nurnberg Lehrstuhl Syst Simulat Cauerstr 11 D-91058 Erlangen Germany Univ Lubeck Inst Math & Image Comp Maria Goeppert Str 3 D-23562 Lubeck Germany

This paper presents a generic approach to highly efficient image registration in two and three dimensions. Both monomodal and multimodal registration problems are considered. We focus on the important class of affine-linear transformations in a derivative-based optimization framework. Our main contribution is an explicit formulation of the objective function gradient and Hessian approximation that allows for very efficient, parallel derivative calculation with virtually no memory requirements. The flexible parallelism of our concept allows for direct implementation on various hardware platforms. Derivative calculations are fully matrix free and operate directly on the input data, thereby reducing the auxiliary space requirements from to . The proposed approach is implemented on multicore CPU and GPU. Our GPU code outperforms a conventional matrix-based CPU implementation by more than two orders of magnitude, thus enabling usage in real-time scenarios. The computational properties of our approach are extensively evaluated, thereby demonstrating the performance gain for a variety of real-life medical applications.

关键词： Image registration Computational efficiency parallel algorithms GPU programming Real-time processing

来源：评论

学校读者我要写书评

暂无评论

A New Method for Computational Private Information Retrieval

引用

COMPUTER JOURNAL 2017年第8期60卷 1238-1250页

作者： Tillem, Gamze Savas, Erkay Kaya, Kamer Sabanci Univ Fac Engn & Nat Sci Istanbul Turkey

Lipmaa's Computational Private Information Retrieval (CPIR) protocol is probably the most bandwidth efficient method in the literature, although its computational complexity is a limiting factor for practical applications as it is based on expensive public key operations. Utilizing binary decision diagrams (Bdd) and the DamgArd-Jurik cryptosystem, Lipmaa's CPIR performs three modular exponentiation operations per internal node in Bdd. In this paper, we present a new CPIR protocol, which reduces the number of exponentiation operations to 1 per first-level internal nodes and 2 per other internal nodes of the Bdd. For 1024-bit exponents (i.e. 80-bit security level) and 32 768 items, when compared with the fastest parallel implementation in the literature on four cores, reducing the number of exponentiations yields a 1.22x speedup and the multi-exponentiation technique adds 2.23x more on top of that. Overall, when combined, reducing the number of exponentiations, multi-exponentiation, parallelization on four cores and the hybrid approach can provide more than 300x speedup compared to the sequential implementation of the original method.

关键词： number theoretic private information retrieval security privacy parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Highly scalable implementation of an implicit matrix-free solver for gas dynamics on GPU-accelerated clusters

引用

JOURNAL OF SUPERCOMPUTING 2017年第2期73卷 631-638页

作者： Menshov, Igor Pavlukhin, Pavel Keldysh Inst Appl Math Moscow 125047 Russia Res & Dev Inst Kvant Moscow 125438 Russia

A numerical approach for solving gas dynamics on Cartesian grids is considered which employs an implicit time marching scheme with the matrix-free Lower-Upper Symmetric Gauss-Seidel (LU-SGS) method for solving discrete equations. Boundary conditions are treated with an embedded-boundary method. The method has two attractive features-(1) algorithmic uniformity of calculations and (2) structured memory accesses that well fit massively parallel architectures with GPU accelerators. We propose a novel CUDA+MPI computational algorithm scalable up to hundreds of GPUs and give in-depth analysis of its implementation (interoperability issues, libraries tuning).

关键词： CFD LU-SGS parallel algorithms CUDA MPI

来源：评论

学校读者我要写书评

暂无评论

Primal-Dual parallel Algorithm for Optimal Content Delivery in Cloud CDNs 8

Primal-Dual Parallel Algorithm for Optimal Content Delivery ...

引用

8th IEEE International Conference on Computational Intelligence and Computing Research, ICCIC 2017

作者： Mahesh, Gadiraju R Maheswara Rao, V.V. Shankar, R Shiva G Sirisha, Gn V Dept. of C.S.E. S.R.K.R. Engineering College Bhimavaram A.P. India

ISBN: (纸本)9781509066209

Content delivery networks have been providing content delivery services for the last two decades using their own infrastructure. Now-a-days content delivery networks have the better option of using storage cloud sites as edge servers. The problems of replicating the content required by the users on optimal sites in Cloud and assigning the sites to users are considered in this work. Given a set of current user requests and cloud sites potential to the user, the combined problem of finding the optimal sites for content placement and content dissemination is set-cover problem. The Previous works solved this problem by using greedy algorithm. Primal-dual parallel algorithm for optimal content delivery in Cloud content delivery networks is proposed in this work. The proposed algorithm is an efficient parallel algorithm that requires only local information. Primal-dual algorithm takes less time than greedy algorithm and the experimental results demonstrate the fact. © 2017 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：