检索结果-内蒙古大学图书馆

Proceedings of the international conference on parallel and distributed processing techniques and applications

作者： Liu, Thomas J. New Jersey City University New Jersey City NJ United States

ISBN: (纸本)1892512416

This paper describes a performance evaluation study of data replication employing SQL Server with Windows 2000 server operating system in a networked distributed system. The simulated environment involves an isolated LAN with 100/10 Mb bandwidth and the deployment of 16 Windows 2000 Servers as database publishers with another Windows 2000 Server as a central subscriber. The set up of transactional replication of SQL Server databases provides the predefined channels for data flow from each publisher to central subscriber. A JAVA multi-thread program is written to conduct the concurrent data publishing to the central subscriber database. Up to two millions data transactions benchmarks the performance of the SQL Server database in data replication. It also reports the maximum data flow rate and the issues regarding the data replication in the distributed network system.

关键词： distributed database systems

来源：评论

学校读者我要写书评

暂无评论

parallel execution of I/O system and application functionality

Parallel execution of I/O system and application functionali...

引用

international conference on parallel and distributed processing techniques and applications

作者： Enblom, L Malardalen Univ Dept Comp Sci & Engn Vasteras Sweden

ISBN: (纸本)1892512459

Many real-time control systems in industry are designed today for single processor architectures. At the same time, more functionality needs to be integrated into the software system. In order to enable correct timely execution of the control and protection applications, designers may need to optimize application code aggressively. Unwanted simplifications of algorithms or low sampling frequencies of the environment may be the result. Functionality In a system, which already has a degree of concurrency, may enable the system to scale onto a multiprocessor environment. This paper discusses and presents results from a study, which separates a substation automation real-time I/O communication system from application level threads in order to exploit existing concurrency. Within the system model described here, as well as in many other system models, it is possible to execute communication mechanisms and applications in parallel. The motivation for this work Is let parallel execution of the I/O System and the application enable higher performance for application functionality. The result Is more flexibility for the application designers. By describing a model of the real-time substation automation I/O System and extending that model with a mechanism to enable execution in a multiprocessor architecture, we contribute to the understanding of both the composition and the performance issues concerning parallel execution In such industrial systems. Measurements and results originate from execution in an existing system and from the multiprocessor system created.

关键词： real-time system I/O system multiprocessor

来源：评论

学校读者我要写书评

暂无评论

Computing the Euclidean distance transform on a linear array of processors

引用

JOURNAL OF SUPERCOMPUTING 2003年第2期25卷 177-185页

作者： Gavrilova, ML Alsuwaiyel, MH Univ Calgary Dept Comp Sci Calgary AB T2N 1N4 Canada KFUPM Dept Informat & Comp Sci Dhahran Saudi Arabia

Given an n x n binary image of white and black pixels, we present an optimal parallel algorithm for computing the distance transform and the nearest feature transform using the Euclidean metric. The algorithm employs ... 详细信息

关键词： feature transform distance transform Euclidean distance parallel algorithm linear array of processors image processing

来源：评论

学校读者我要写书评

暂无评论

Performance Meets Programmabilty: Enabling Native Python MPI Tasks In PyCOMPSs 28

Performance Meets Programmabilty: Enabling Native Python MPI...

引用

28th Euromicro international conference on parallel, distributed and Network-Based processing (PDP)

作者： Elshazly, Hatem Lordan, Fratacesc Ejarque, Jorge Badia, Rosa M. Barcelona Supercomp Ctr BSC Dept Comp Sci Barcelona Spain

ISBN: (纸本)9781728165820

The increasing complexity of modern and future computing systems makes it challenging to develop applications that aim for maximum performance. Hybrid parallel programming models offer new ways to exploit the capabilities of the underlying infrastructure. However, the performance gain is sometimes accompanied by increased programming complexity. We introduce an extension to PyCOMPSs, a high-level task-based parallel programming model for Python applications, to support tasks that use MPI natively as part of the task model. Without compromising application's programmability, using Native MPI tasks in PyCOMPSs offers up to 3x improvement in total performance for compute intensive applications and up to 1.9x improvement in total performance for 110 intensive applications over sequential implementation of the tasks.

关键词： Hybrid Programming Models distributed Computing MPI High Performance Computing Task-based parallel Programming Models Performance Productivity

来源：评论

学校读者我要写书评

暂无评论

Performance study of HPC applications on an Arm-based cluster using a generic efficiency model 28

Performance study of HPC applications on an Arm-based cluste...

引用

28th Euromicro international conference on parallel, distributed and Network-Based processing (PDP)

作者： Banchelli, Fabio Peiro, Kilian Querol, Andrea Ramirez-Gargallo, Guillem Ramirez-Miranda, Guillem Vinyals, Joan Vizcaino, Pablo Garcia-Gasulla, Marta Mantovani, Filippo Barcelona Supercomp Ctr Barcelona Spain

ISBN: (纸本)9781728165820

HPC systems and parallel applications are increasing their complexity. Therefore the possibility of easily study and project at large scale the performance of scientific applications is of paramount importance. In this paper we describe a performance analysis method and we apply it to four complex HPC applications. We perform our study on a pre-production HPC system powered by the latest Arm-based CPUs for HPC, the Marvell ThunderX2. For each application we spot inefficiencies and factors that limit their scalability. The results show that in several cases the bottlenecks do not come from the hardware but from the way applications are programmed or the way the system software is configured.

关键词： Performance analysis High Performance Computing parallel applications Arm ThunderX2

来源：评论

学校读者我要写书评

暂无评论

A heterogeneous checkpoint and recovery protocol in cluster-based distributed systems

A heterogeneous checkpoint and recovery protocol in cluster-...

引用

Proceedings of the international conference on parallel and distributed processing techniques and applications

作者： Paul, Himadri Sekhar Gupta, Arobinda Badrinath, R. Dept. of Computer Science and Eng. Indian Institute of Technology Kharagpur. 721302 India

ISBN: (纸本)1892512416

distributed systems consisting of clusters of computing nodes are becoming increasingly popular for solving long running applications. Checkpoint and recovery is a common technique for providing fault tolerance to such applications. In non-cluster based distributed systems, the entire system employs a single checkpoint and recovery protocol. However, in a cluster based system, the constituent clusters may employ different checkpoint and recovery protocols for fault tolerance inside the cluster boundary, for reasons of administrative policy or resource constraints. In this paper we investigate the problem of co-existence of different checkpoint and recovery protocols in different clusters. The problem of employing coordinated and message logging protocols, two of the most popular checkpoint and recovery protocols, in different clusters is discussed. A protocol is presented to provide a consistent checkpoint and recovery based fault tolerance for the entire system, when the individual clusters are running either coordinated or message logging protocols.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

Optimization techniques for Concurrent STM-Based Implementations: A Concurrent Binary Heap as a Case Study

Optimization Techniques for Concurrent STM-Based Implementat...

引用

23rd IEEE international parallel and distributed processing Symposium

作者： Dragicevic, Kristijan Bauer, Daniel IBM Corp Zurich Res Lab Zurich Switzerland

ISBN: (纸本)9781424437511

Much research has been done in the area of software transactional memory. (STM) as a new programming paradigm to help ease the implementation of parallel applications. While most research has been invested for answering the question of how STM should be implemented, there is less work about how to use STM efficiently. This paper is focused on the challenge of how to use STM for efficient and scalable implementations of non-trivial applications. We present a fine-grained STM-based concurrent binary heap, an application of STAT for a data structure that is notoriously difficult to parallelize. We describe extensions to the basic STM approach and also the benefits of our proposal. Our results show that the fine-grained STM-based binary heap provides very good scalability compared to the naive approach. Nevertheless, rye reach a point where the complexity of some fine-grained techniques do not justify its use for the increase in performance that can be obtained.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Priority-based RR scheduling for soft real-time distributed object system

Priority-based RR scheduling for soft real-time distributed ...

引用

international conference on parallel and distributed processing techniques and applications

作者： Baek, S Rim, H Kim, S Sogang Univ Dept Comp Sci Seoul South Korea

ISBN: (纸本)1892512459

The real-time scheduling schemes proposed for RT CORBA are mostly priority-based, soft real-time scheduling schemes. The problem of the previous scheme is that the priority giving and the request allocating procedure are considered as two different things. In the worst case, the tasks of imminent deadlines can be allocated on the same sever and the continuous deadline violations can occur. In general real-time system, the punctuality of deadline is more emphasized than the task throughput. Therefore, a modified scheduling algorithm is required, which takes the priority distribution into account when allocating a request. Our scheduling scheme, Priority-based RR tries to evenly distribute the task priorities on local severs by controlling the Round-Robin scheduling order according to the task urgency. Simulation says that Priority-based RR distribution can show the cost effective performance when the system load isn't too high.

关键词： real-time scheduling priority distribution

来源：评论

学校读者我要写书评

暂无评论

StreamGen: A workload generation tool for distributed information flow applications

StreamGen: A workload generation tool for distributed inform...

引用

33rd international conference on parallel processing

作者： Mansour, M Wolf, M Schwan, K Georgia Inst Technol Coll Comp Atlanta GA 30332 USA

ISBN: (纸本)0769521975

This paper presents the StreamGen load generator, which is targeted at distributed information flow applications. These include the event streaming services used in wide-area publish/subscribe systems or in operational information systems, the data streaming services used in remote visualization or collaboration, and the continuous data streams occurring in download services. Running across heterogeneous distributed platforms, these services are implemented by computational component that capture, manipulate, and produce information streams and are linked via overlay topologies. StreamGen can be used to produce the distributed computational and communication loads imposed by these applications. Dynamic application behaviors can be created with mathematical specifications or with behavior traces collected from application-level traces. An interesting set of traces presented in this paper is derived from long-term observations of the FTP download patterns observed at the Linux mirror site being run by the CERCS research center at the Georgia Institute of Technology. Two different flow-based applications are created and evaluated with StreamGen. The first emulates the data streaming behavior in a distributed scientific collaboration, where a scientific simulation (i.e., a molecular dynamics code) produces simulation data sent to and displayed for multiple, interactive remote users. The second emulates portions of the event-streaming behavior of an operational information system used by a large U.S. corporation. Parametric studies with StreamGen's FTP traces applied to these applications are used to evaluate different load balancing strategies for the cluster machines manipulating these applications' data streams.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

parallel Searching on Biological Networks 27

Parallel Searching on Biological Networks

引用

27th Euromicro international conference on parallel, distributed and Network-Based processing (PDP)

作者： Bombieri, Nicola Bonnici, Vincenzo Giugno, Rosalba Univ Verona Dipartimento Informat Strata Grazie 15 I-37134 Verona Italy

ISBN: (纸本)9781728116440

Software applications for biological networks analysis rely on graphs to model the structure interactions. A great part of them requires searching for subgraphs in a target graph or in collections of graphs. Even though very efficient algorithms have been defined to solve such a subgraph isomorphisms problem, the complexity of current real biological networks make their sequential execution time prohibitive. On the other hand, parallel architectures, from multi-core to many-core, have become pervasive to deal with the problem of the data size. Nevertheless, the sequential nature of the graph searching algorithms makes their implementation for parallel architectures very challenging. This paper presents three different parallel solutions for the graph searching problem. The first two target the exact search for multi-core CPUs and many-core GPUs, respectively. The third one targets the approximate search for GPUs, which handles node, edge, and node label mismatches. The paper shows how different techniques have been developed in all the solutions to reduce the search space complexity. The paper shows the performance of the proposed solutions on representative biological networks containing antiviral chemical compounds and protein interactions networks.

关键词： Biology Search problems Complexity theory Graphics processing units Indexes Topology

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：