Fast multipole method (FMM) is a fast, robust and accurate algorithm which is widely used in molecular dynamics, electrostatics and electromagnetics simulations. In this paper, we implemented and optimized parallel FM...
详细信息
Garbage collection is a memory management mechanism for automatically reclaiming garbage objects in memory. It can effectively relieve the programmers' burden and optimize the design of programs. Currently many ga...
详细信息
Garbage collection is a memory management mechanism for automatically reclaiming garbage objects in memory. It can effectively relieve the programmers' burden and optimize the design of programs. Currently many garbage collection methods for distributed systems have been put forth as the distributed systems are more and more popular. Relaxed consistency model is a very important kind of memory consistency model in distributed system, but it is inefficiently to perform garbage collection in the Distributed shared memory system (DSM) on relaxed memory model with traditional methods. This article proposes a trace-based garbage collection algorithm for DSM system based on relaxed memory model. Our algorithm notes the references among active objects of every node in DSM system, which makes the application process and the collection process able to execute concurrently. And it makes use of the relaxed consistency model's features to reduce communication cost and response delay. Furthermore, the use of two tables insures the algorithm correct and effective. This algorithm displays its advantages of real-time and incremental, and it is proved to be correct and effective.
A single nucleotide polymorphism (SNP),as the most common form of genetic variation,has been widely studied to help analyze the possible association between diseases and *** gain more information,SNPs on a single chro...
详细信息
A single nucleotide polymorphism (SNP),as the most common form of genetic variation,has been widely studied to help analyze the possible association between diseases and *** gain more information,SNPs on a single chromosome are usually studied together,which constitute a *** haplotypes from biological experiments is usually very costly and time-consuming,which causes people to develop efficient methods to determine haplotypes from the computational *** problems and algorithms about haplotypes have been proposed to reduce the cost of studies of disease *** general,four categories of problems are widely researched:the haplotype assembly problem,the haplotype inference problem,the haplotype block partition problem,and the haplotype tagging SNP selection *** former two problems have been well reviewed by many researchers,whereas the latter two have not been comprehensively surveyed to our *** this paper,we try to make a detailed introduction to the four problems,especially the latter two.
Simulation of blood flow in arteries is an extensive topic of research to study various hemodynamic factors in CFD. This paper studies two different degrees of stenosis (less than 50 and 80% by flow area) that have be...
详细信息
Wireless sensor networks have posed a number of challenging problems such as localization, deployment and tracking, etc. One of the interesting problems is the calculation of the coverage and exposure paths for the se...
详细信息
Wireless sensor networks have posed a number of challenging problems such as localization, deployment and tracking, etc. One of the interesting problems is the calculation of the coverage and exposure paths for the sensor networks. This paper presents a fully localized algorithm to solve the worst coverage problem first introduced by Meguerdichian et al. The nodes of the sensor network cooperate to construct the worst coverage path only by the one-hop neighbor's information, thus avoiding the massive communication and conserving the energy. The correctness of the proposed algorithm is proved formally under the sensing diminishing model. Moreover, this algorithm can be easily extended to solve the minimal exposure problem with local information as well.
In this paper, we present a scalable parallel simulator - MPUS - for verifying the design of our next generation highperformance parallel computer - RedNeurons(RN) parallel computer. The RedNeurons parallel computer ...
详细信息
ISBN:
(纸本)9781595937575
In this paper, we present a scalable parallel simulator - MPUS - for verifying the design of our next generation highperformance parallel computer - RedNeurons(RN) parallel computer. The RedNeurons parallel computer is based on CMP technology, and it adopts an advanced but maybe some complicated architecture and topology. This paper mainly describes the design and implementation of the MPUS.
Recent works in deep learning have shown that large models can dramatically improve performance. In this paper, we accelerated the deep network training using many GPUs. We have developed a framework based on Caffe ca...
详细信息
ISBN:
(纸本)9781509053827
Recent works in deep learning have shown that large models can dramatically improve performance. In this paper, we accelerated the deep network training using many GPUs. We have developed a framework based on Caffe called Caffe-HPC that can utilize computing clusters with multiple GPUs to train large models. Caffe[6] provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models. And Caffe-HPC retains all the features of the original Caffe, the model trained on original Caffe can be continue to trained on Caffe-HPC. It provides a convenient solution for people who are using Caffe and want to speed up the training. Using an Asynchronous Stochastic Gradient Descent optimizer, We made a good acceleration on training a CNN model on ILSVRC[5] 2012 dataset. And we have compared the convergence of different SGD algorithms. We believe our work will makes it possible to train larger networks on larger training sets in a reasonable amount of time.
Wireless sensor networks have been widely used in many important fields. Medium Access Control (MAC) protocols have a significant effect on the function and performance of sensor networks. At present, most MAC protoco...
详细信息
The accuracy of localization is a significant criterion to evaluate the practical utility of localization algorithm in wireless sensor networks. In mostly localization algorithms, one of the main methods to improve lo...
详细信息
The interference imposes a potential negative impact on the performance of a wireless network. A device is interfered if it receives a transmission not intended for it. In this paper, we introduce an explicit computat...
详细信息
暂无评论