As computational clusters rapidly grow in both size and complexity, system reliability and, in particular, application resilience have become increasingly important factors to consider in maintaining efficiency and pr...
详细信息
ISBN:
(纸本)9781605589428
As computational clusters rapidly grow in both size and complexity, system reliability and, in particular, application resilience have become increasingly important factors to consider in maintaining efficiency and providing improved computational performance over predecessor systems. One commonly used mechanism for providing application fault tolerance in parallel systems is the use of checkpointing. By making use of a multi-cluster simulator, we study the impact of sub-optimal checkpoint intervals on overall application efficiency. By using a model of a 1926 node cluster and workload statistics from Los Alamos National laboratory to parameterize the simulator, we find that dramatically over-estimating the AMTTI has a fairly minor impact on application efficiency while potentially having a much more severe impact on user-centric performance metrics such a queueing delay. We compare and contrast these results with the trends predicted by an analytical model. Copyright 2010 ACM.
Large scale cluster based on cloud technologies has been widely used in many areas, including the data center and cloud computing environment. The purpose of presenting the research paper in this field was to solve th...
详细信息
Large scale cluster based on cloud technologies has been widely used in many areas, including the data center and cloud computing environment. The purpose of presenting the research paper in this field was to solve the challenge in Medical Image exchanging, storing and sharing issues of EMR (Electronic Medical Record). In recent years, many countries invested significant resources on the projects of EMR topics. The benefit of the EMR included: Patient-centered Care, Collaborative Teams, Evidence-based Care, Redesigned Business Processes, Relevant Data Capture and Analysis and Timely Feedback and Education. For instance, the ARRAHIT project in Untied States (2011-2015), Health Infoway project in Canada (2001-2015) and NHIP project in Taiwan, etc. Aim to the topic of EMR, we presented a system called MIFAS (Medical Image File Accessing System) to solve the exchanging, storing and sharing on Medical Images of crossing the different hospitals issues. Through this system we can enhance efficiency of sharing information between patients and their caregivers. Furthermore, the system can make the best-possible patient-care decisions.
Currently, Java has been gradually applied in embedded real-time area with the improvement by the Real-Time Specification for Java (RTSJ). Accordingly, a hardware Java execution engine for embedded real-time applicati...
详细信息
performance perturbations are a natural phenomenon in volunteer computing systems. Scheduling parallel applications with precedence-constraints is emerging as a new challenge in these systems. In this paper, we propos...
详细信息
We recently converted a CS1 (Introduction to computing) class to use the Python language in place of C++. Among other reasons, we hoped that the new language would help students who typically struggled with the course...
详细信息
Efficient coverage algorithms are essential for information search or dispersal in all kinds of networks. We define an extended coverage problem which accounts for constrained resources of consumed bandwidth B and tim...
详细信息
Efficient coverage algorithms are essential for information search or dispersal in all kinds of networks. We define an extended coverage problem which accounts for constrained resources of consumed bandwidth B and time T. Our solution to the network challenge is here studied for regular grids only. Using methods from statistical mechanics, we develop a coverage algorithm with proliferating message packets and temporally modulated proliferation rate. The algorithm performs as efficiently as a single random walker but O(B(d−2)/d) times faster, resulting in significant service speed-up on a regular grid of dimension d. The algorithm is numerically compared to a class of generalized proliferating random walk strategies and on regular grids shown to perform best in terms of the product metric of speed and efficiency.
In this work we describe the initial implementation of a web-based tool for the automatic segmentation of cardiac Magnetic Resonance Images. The application uses an active contours algorithm called Snakes, which are a...
详细信息
The Copenhagen climate conference 2009 has concluded that we have to change the way how we live. Currently, several carbon management systems are being developed to meet enterprises' requirement. In addition, a nu...
详细信息
The Copenhagen climate conference 2009 has concluded that we have to change the way how we live. Currently, several carbon management systems are being developed to meet enterprises' requirement. In addition, a number of carbon footprint calculation tools have been developed to fulfill carbon calculation needs from industries, households and individuals. However, under the current circumstance, it is still a challenging issue on how to take advantage of the latest IT technologies to improve carbon calculations with comprehensive functionality. In this paper, we extend carbon 2.0, the concept proposed by previous studies, and introduce a new carbon framework based on cloud computing. This new framework is massively scalable and can configure and deliver carbon-related services dynamically and on demand. Moreover, it is a real low-carbon framework itself!
Modern highperformancecomputing systems are being increasingly deployed in a hierarchical fashion with multi-core computing platforms forming the base of the hierarchy. These systems are usually comprised of multipl...
详细信息
ISBN:
(纸本)9781424465330
Modern highperformancecomputing systems are being increasingly deployed in a hierarchical fashion with multi-core computing platforms forming the base of the hierarchy. These systems are usually comprised of multiple racks, with each rack consisting of a finite number of chassis, and each chassis having multiple compute nodes or blades, based on multi-core architectures. The networks are also hierarchical with multiple levels of switches. Message exchange operations between processes that belong to different racks involve multiple hops across different switches and this directly affects the performance of collective operations. In this paper, we take on the challenges involved in detecting the topology of large scale InfiniBand clusters and leveraging this knowledge to design efficient topology-aware algorithms for collective operations. We also propose a communication model to analyze the communication costs involved in collective operations on large scale supercomputing systems. We have analyzed the performance characteristics of two collectives, MPI_Gather and MPI_Scatter, on such systems and we have proposed topology-aware algorithms for these operations. Our experimental results have shown that the proposed algorithms can improve the performance of these collective operations by almost 54% at the micro-benchmark level.
暂无评论