Multi-core processors can easily provide benefits for multithreaded workloads, but many applications written for uniprocessors cannot automatically benefit from chip multiprocessors (CMP) designs. this paper presents ...
详细信息
ISBN:
(纸本)9780769527369
Multi-core processors can easily provide benefits for multithreaded workloads, but many applications written for uniprocessors cannot automatically benefit from chip multiprocessors (CMP) designs. this paper presents a reverse compilation framework, which translates existing binary code without source code to the Static Single Assignment (SSA) form, and then the internal SSA form is applied by the compilation phase to generate the Speculative parallelthreading (SPT) code. A profiler is applied to optimize the code dynamically during execution. the evaluation results show that these existing binary codes without source codes execute on CMP with performance improved, due to taking advantage of the speculative parallelthreading support provided by the processor.
this paper describes the use of standard Linux® [1] and Open Source software to produce an environment to support parallel scientific applications on High Performance Computers (HPC). the goals of this approach a...
详细信息
ISBN:
(纸本)980656068X
this paper describes the use of standard Linux® [1] and Open Source software to produce an environment to support parallel scientific applications on High Performance Computers (HPC). the goals of this approach are to maximize the HPC resources delivered to the application, to improve system stability and predictability, and to reduce software management burdens. the simplicity of this approach provides an additional benefit. the paper presents the reader with background, motivations, and a discussion of advantages and drawbacks of the light-os.
the secure interaction between two or more administrative domains is a major concern. IRBAC2000 is a model that quickly establishes a flexible policy for dynamic role translation from foreign domains to local. A-IRBAC...
详细信息
ISBN:
(纸本)9780769527369
the secure interaction between two or more administrative domains is a major concern. IRBAC2000 is a model that quickly establishes a flexible policy for dynamic role translation from foreign domains to local. A-IRBAC2000 mode utilizes RBAC to manage dynamic role translation between foreign and local domains. We will see that these mechanisms have significant shortcomings. We propose an improved Administrative Usage Control model named A UCON to overcome the weakness of previous models. A UCONprovides administrates user-role assignment for local and foreign domain with unified method. It provides flexible enough mechanism to distinguish users of foreign and local domain and can enforce more strict control for foreign user. While retaining the advantage of traditional RBAC model, AUCON model is being implemented in experiment system.
this paper presents an overview of our ongoing project KAP, which aims to build a message-passing parallelizing compiler for distributed-memory machines. In this paper an improved automatic code generation algorithm i...
详细信息
ISBN:
(纸本)9780769527369
this paper presents an overview of our ongoing project KAP, which aims to build a message-passing parallelizing compiler for distributed-memory machines. In this paper an improved automatic code generation algorithm is discussed Our algorithm uses the data and computation decomposition, and the reading and writing access functions to create the communication code. We can not only solve the problems which the conventional algorithm can do but also can solve another kind of problem. In such cases: there is no data dependence and the read access is not aligned in the loop nest, or the exact data-flow analysis is not given because of the limitation of the algorithm of LWT the conventional algorithm can not create communication code correctly while our improved algorithm can resolve this problem. Experiments prove that the novel algorithm can achieve satisfactory effect.
Tracking and tracing individual items is a new and emerging trend in many industries. Driven by maturing technologies such as Radio-Frequency Identification (RFID) and upcoming standards such as the Electronic Product...
详细信息
ISBN:
(纸本)9780769525778
Tracking and tracing individual items is a new and emerging trend in many industries. Driven by maturing technologies such as Radio-Frequency Identification (RFID) and upcoming standards such as the Electronic Product Code (EPC) a rapidly increasing number of enterprises are collecting vast amounts of tracking data. To enable traceability over the entire life-cycle of items data has to be shared across independent and possibly competing enterprises. the need to simultaneously compete and cooperate requires a traceability system design that allows companies to share their traceability data while maintaining complete sovereignty over what is shared and with whom. Based on an extensive study of traceability applications, we introduce the formal concept of traceability networks and highlight the technical challenges involved in sharing data in such a network. To address these challenges, we present an innovative combination of query processing techniques from P2P networks and distributed as well as parallel databases with confidentiality enforcement techniques.
It is time-consuming work by using computer-aided engineering software for finding optimum design during product development process. Using the cluster personal computers for parallel or distributed computation enviro...
详细信息
ISBN:
(纸本)9789806560680
It is time-consuming work by using computer-aided engineering software for finding optimum design during product development process. Using the cluster personal computers for parallel or distributed computation environment may be the best way to reduce the time needed to complete design. In this study a distributed artificial immune algorithm will be developed for Windows system using Builder C/C++, Winsock API and TCP/IP to construct a PC cluster environment for optimization. Different CAE softwares. cfd-rc and ansys, will be integrated withdistributed artificial immune algorithm for engineering optimization. Some test functions are used first to verify the performance of developed system. the problems of engineering optimization in structure and heat sink will be used to prove the performance of distributed artificial algorithm in real-world applications.
this paper addresses issues in interconnecting military command and control (C2) and simulation systems, using Web technologies and mediation via a common representation, the Command and Control Information Exchange D...
详细信息
ISBN:
(纸本)0769526977
this paper addresses issues in interconnecting military command and control (C2) and simulation systems, using Web technologies and mediation via a common representation, the Command and Control Information Exchange Data Model (C2IEDM). the authors supported a NATO Exploratory Team that designed an experimental program to assess the approach and prepare for its use by coalitions. As part of. that activity, the United States and France implemented an international demonstration of C2 and simulation systems interoperating using Web Services and C2IEDM this paper reports on how C2 and simulation systems were employed with Web technologies to achieve a successful demonstration in a very short time period, including experience gained in the process and plans for future development of C-BML capabilities.
Task scheduling is one of the core steps to effectively exploit the capabilities of resources in the Grid. the task scheduling problem is an NP-complete problem. this paper studied on the task scheduling problem in gr...
详细信息
ISBN:
(纸本)0769526381
Task scheduling is one of the core steps to effectively exploit the capabilities of resources in the Grid. the task scheduling problem is an NP-complete problem. this paper studied on the task scheduling problem in grid environment and proposed a task scheduling mechanism, which expressed each possible task scheduling scheme as a task-resource assignment graph (T-RA G) and thus mapped the. task scheduling problem into a graph optimal selection problem. Moreover, in order to find the optimal solution quickly and accurately, a task scheduling algorithm based on particle swarm optimization (PSO) was proposed. this algorithm regards the longest path of the task-resource assignment graph as fitness value and encodes every task-resource assignment as a particle. Finally, the experimentation shows that the approach proposed in this paper is effective to solve task scheduling problem.
Reservoir Engineering industry is very interested in using ever growing dataset with more and more complex physics and detailed models. the key issue still remains running simulations in an acceptable CPU time. Regard...
详细信息
ISBN:
(纸本)9780889866386
Reservoir Engineering industry is very interested in using ever growing dataset with more and more complex physics and detailed models. the key issue still remains running simulations in an acceptable CPU time. Regarding the trend in hardware technologies, the challenge is to improve the efficiency of reservoir simulation software on a large number of processors. New numerical difficulties and performance problems appear when the number of cells and the number of processors are growing. In this paper, we present different approaches to overcome these kinds of problems. We discuss technical choices such like advanced scalable linear solver algorithm, load balancing issue with different domain decomposition strategies, dynamic space criteria, mesh partitioner strategy and parallel solver performance management, flexible 10 strategy from simple file system to more complex parallel file system or database. We have developed and benchmarked these different solutions on published reference large scale problems and actual case studies with several tens millions of cells. We analyse the results and discuss the efficiency of each solution to overcome the scalability difficulties and performance limitations due to load unbalance.
Large clustered computers provide low-cost compute cycles, and therefore have promoted the development of sophisticated parallel-programming algorithms based on the Message Passing Interface. Storage platforms, howeve...
详细信息
ISBN:
(纸本)088986568X
Large clustered computers provide low-cost compute cycles, and therefore have promoted the development of sophisticated parallel-programming algorithms based on the Message Passing Interface. Storage platforms, however, fall to keep pace with similar advances. this paper compares standard 4X InfiniBand (IB) to 10-Gigabit Ethernet (GbE) for Use as a common storage infrastructure in addition to message passing. Considering IB's native ability to accelerate protocol processing in hardware, the Ethernet hardware in this study provided similar acceleration using TCP Offload Engines. We evaluated their I/O perfon-nance using the IOZONE benchmark on the iSCSI-based TerraGRID parallel filesystem. Our evaluations show that 10GbE, with or without protocol-offload, offered better throughput and latency than IB to socket-based applications. Although protocol-offload in both10GbE and IB demonstrated significant improvement in I/O performance, large amount of CPU are still being consumed to handle the associated data-copies and interrupts. the emerging RDMA technologies hold promises to remove the remaining CPU overhead. We plan to continue our study to research the applications of RDMA in parallel I/O.
暂无评论