We present a parallel bisection mesh refinement algorithm based on ALBERT (Adaptive multi-Level finite element toolbox using Bisection refinement and Error control by Residual Techniques). The goal is to develop a par...
详细信息
We present a parallel bisection mesh refinement algorithm based on ALBERT (Adaptive multi-Level finite element toolbox using Bisection refinement and Error control by Residual Techniques). The goal is to develop a parallel adaptive finite element code suitable for distributed memory parallel computers or PC clusters. An overview on the basic strategy for the parallelization of ALBERT is given. Issues on the parallel mesh refinement are addressed. A modified mesh refinement algorithm, which can be implemented efficiently on distributed memory parallel computers, is proposed and its properties are discussed. Numerical experiments with parallel bisection mesh refinement algorithm are shown.
In software/content development in intercultural environments, collaboration among participants and effective administration are crucial for overall successful projects. Various tools are used to achieve the collabora...
详细信息
ISBN:
(纸本)0780390350
In software/content development in intercultural environments, collaboration among participants and effective administration are crucial for overall successful projects. Various tools are used to achieve the collaboration and the administration of the project: no single groupware is sufficient for both collaboration support and management. Therefore, interoperability among various tools employed in one project is essential for efficient collaboration. To establish semantic interoperability of collaboration tools, we have designed an RDF tag set that can function as vocabulary to describe information interoperably among the tools. In this paper, we propose a tag set to describe information in intercultural projects and show how the data in collaboration tools and project management tools are integrated with those semantic tags. With the tag set and the supporting method, it is expected that the members and the administrators do not have to pay much efforts to keep the collaboration status.
For its simplicity, reliability and maturity, NFS is widely-used in clusters. However, due to its high overheads and implementation limitations, the standard NFS cannot fully exert the potential abilities provided by ...
详细信息
For its simplicity, reliability and maturity, NFS is widely-used in clusters. However, due to its high overheads and implementation limitations, the standard NFS cannot fully exert the potential abilities provided by multiple network channels and multiple SCSI channels on the server. In this paper, we present a new efficient way to high performance NFS implementation for cluster applications. By adding mechanisms to make good use of NFS server's multiple communication channels and multiple I/O channels, CluserNFS can potentially provide better I/O performance than standard NFS, as illustrated by our simulation experiment results.
The Sockets application programming interface is the de facto standard in network programming. Sockets emulation over high performance networks has being pursued by many researchers. Most projects in this area favor u...
详细信息
The Sockets application programming interface is the de facto standard in network programming. Sockets emulation over high performance networks has being pursued by many researchers. Most projects in this area favor user level communication, but this approach has resulted in some compatibility problems. In this paper, after the reexamination of the tradeoff between user level and kernel level communication, the design and implementation of Sockvia are discussed which is a kernel level Sockets emulation system based on virtual interface architecture. Sockvia emulates Sockets streaming semantics and achieves full compatibility with Sockets over TCP/IP. Through performance optimization methods such as lightweight flow control and private buffer, the performance of Sockvia is very attractive compared with that of Sockets over GM-IP or SGM. The half round-trip latency of Sockvia is below 12 us and the peak bandwidth is over 240 MBytes. The results of real-world application tests are also presented
The major challenge in designing cluster file systems is to provide high aggregate I/O bandwidth and high metadata processing throughput for applications running on large-scale cluster systems. And with the rapid incr...
详细信息
The major challenge in designing cluster file systems is to provide high aggregate I/O bandwidth and high metadata processing throughput for applications running on large-scale cluster systems. And with the rapid increase of required data storage, how to improve access performance of a PB-scale cluster file system is also a challenging issue. In this paper, we introduce the storage space management policy for a PB-scale cluster file system. Our performance results showed that compared with Lustre and GFS, DCFS2 is able to provide comparable or even better aggregate I/O bandwidth
Summary form only given. Clusters equipped with Linux, a commonly available operating system, are very popular in the supercomputer industry. While as a general-purpose operating system, Linux probably contains some r...
详细信息
Summary form only given. Clusters equipped with Linux, a commonly available operating system, are very popular in the supercomputer industry. While as a general-purpose operating system, Linux probably contains some redundant properties that are unbefitting or unnecessary for HPC applications, and moreover, the design tradeoff cannot take some special applications' features into account excessively. This has the potential to optimize Linux for special applications such as Linpack. In this paper, we present experiences of optimizing Linux for Linpack, which is a standard benchmark for supercomputer, on Dawning4000A - an HTFlops Linux/PC cluster system. Based on observation, the implementation of optimization involves three aspects: memory management system - superpage support, light little-noise kernel and process management system. Measurements show that superpage support produces 2-3 percent performance improvement, light little-noise kernel and optimization on process management give only faint or even negative performance advantage. Despite Linux, issues identified should be also relevant to other general-purpose operating systems
Multiple sequence alignment is a fundamental and challenging problem in computational molecular biology. ClustalW, the most widely used multiple sequence alignment software, performs very slowly on hundreds of sequenc...
详细信息
Multiple sequence alignment is a fundamental and challenging problem in computational molecular biology. ClustalW, the most widely used multiple sequence alignment software, performs very slowly on hundreds of sequence. Here, we analyze the algorithm complexity of ClustalW as well as the time profile in practice, and then propose a strategy which uses the reconfigurable hardware FPGA to accelerate ClustalW. Comparison with other coarse-grained parallel strategies demonstrates a fine speedup of this strategy and savage of computing resource
How to distribute the items in the file system hierarchy across a group of metadata servers is an important issue that determines the holistic metadata processing performance (HMPP) of a cluster file system which mana...
详细信息
How to distribute the items in the file system hierarchy across a group of metadata servers is an important issue that determines the holistic metadata processing performance (HMPP) of a cluster file system which manages its metadata by a group of metadata servers. The HMPP is affected by two factors: balance degree of metadata distribution and number of branch points. Two types of well-used metadata distribution policies are the dynamic subtree policy and the random policy. Both of them emphasize one factor and neglect the other factor. As a result, their HMPP is low. In order to make good use of processing capacity of all metadata servers, we present a novel metadata distribution policy, called dynamic dir-grain (DDG) policy, which takes both factors into account. Our performance results show that this policy is potentially more efficient than the other two types of policies under real environments, as well as the conditions of creation or removal of a large hierarchy
We propose a simulation-based technique for analysis and optimization of extended burst-mode (XBM) asynchronous controllers. In asynchronous controllers of this sort, timing information on control signals is significa...
详细信息
This paper newly proposes the four-fingered robot hand with dual turning mechanism where two and two other fingers can independently rotate inner and outer circles with the common center, respectively. Due to this mec...
详细信息
This paper newly proposes the four-fingered robot hand with dual turning mechanism where two and two other fingers can independently rotate inner and outer circles with the common center, respectively. Due to this mechanical configuration, it has the particular rotating axis where the manipulation around the axis can be completely decomposed into the velocity control around the axis and the internal force control in the contact plane. We achieved a manipulation task around the axis with the time of 0.8[sec] for one rotation, while it is relatively slow for another axis.
暂无评论