检索结果-内蒙古大学图书馆

20th ieee international parallel and distributed processing symposium, IPDPS 2006

作者： Picconi, Fabio Sens, Pierre Laboratoire d'Informatique de Paris 6 France INRIA Rocquencourt France

ISBN: (纸本)1424400546

distributed Hash Tables (DHTs) provide a means to build a completely decentralized, large-scale persistent storage service from the individual storage capacities contributed by each node of the peer-to-peer overlay. However, persistence can only be achieved if nodes are highly available, that is, if they stay most of the time connected to the overlay. In this paper we present an incentives-based mechanism to increase the availability of DHT nodes, thereby providing better data persistence for DHT users. High availability increases a node's reputation, which translates into access to more DHT resources and a better Quality-of-Service. The mechanism required for tracking a node's reputation is completely decentralized, and is based on certificates reporting a node's availability which are generated and signed by the node's neighbors. An audit mechanism deters collusive neighbors from generating fake certificates to take advantage of the system. © 2006 ieee.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

Exploiting parallelism of MPEG-4 decoder with dataflow programming on multicore processor

Exploiting parallelism of MPEG-4 decoder with dataflow progr...

引用

ieee international symposium on parallel and distributed processing with applications

作者： Chen, Zhong-Ho Chen, Ta-Chun Chien, Jung-Yin Su, Alvin Shieh, Ce-Kuen Dept. of CSIE National Cheng-Kung of University Tainan Taiwan Dept. of EE National Cheng-Kung of University Tainan Taiwan

ISBN: (纸本)9780769541907

Multicore processor provides large computation capability but also involves the complicate parallel programming. One of major considerations in parallel programming is the performance. Traditional design methodologies which usually start a design on a selected platform spend a lot of effort and time on tuning performance and debugging. When platform is changed even with different number of cores, considerable redesign effort is required. Hence a flexible design methodology is necessary. In this paper, a design methodology is presented for video codec, by using MPEG-4 SP decoder as an example, on multicore processor. The parallelisms of MPEG-4 decoder are discussed and exposed with the dataflow model. The dataflow model provides a high-level abstraction of underlying hardware. Computation and communication of MPEG-4 decoder are separated and represented as modules and channels, respectively. It is possible to synthesize the model targeting to either dedicate hardware or software on multiprocessor. To map the high level dataflow model to Cell processor, the mapping flow, including offline profiling, task allocation and runtime libraries, are developed. According to the profiling results, the allocation algorithm could allocate tasks on multiprocessors as balanced as possible. An efficient synchronization mechanism on Cell processor is also proposed. We also discuss the impact of the model and the mapping flow corresponding to decoding speed. The results show that the proposed methodology gets considerable performance boost when the number of cores is increased. © 2010 ieee.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

DPX10: An Efficient X10 Framework for Dynamic Programming applications 44

DPX10: An Efficient X10 Framework for Dynamic Programming Ap...

引用

44th Annual international Conference on parallel processing Workshops (ICPPW)

作者： Wang, Chen Yu, Ce Sun, Jizhou Meng, Xiangfei Tianjin Univ Sch Comp Sci & Technol Tianjin Peoples R China Nankai Univ Coll Comp & Control Engn Natl Supercomp Ctr Tianjin Tianjin Peoples R China

ISBN: (纸本)9781467375887

X10 language and Asynchronous Partitioned Global Address Space (APGAS) model is an emerging mechanism for programming high-performance computers and commodity clusters. However, little work exists on distributed programming framework for dynamic programming (DP) problems based on X10 and APGAS model. In this paper we present DPX10, an efficient distributed X10 framework for DP applications. DPX10 enables developers to write highly efficient DP programs without much effort. A DPX10 program is specified by a directed acyclic graph (DAG) pattern and a compute method for the vertices. DPX10 provides eight commonly used DAG patterns and a simple API to create custom patterns. The system handles all the tiresome work of implementing parallelization including DAG distribution, vertices scheduling, and vertices communication. Moreover, a new recovery method for distributed arrays is developed to provide transparent fault tolerance. We describe the design of the framework and use four DP applications with up to a billion vertices on 120 cores to demonstrate its simplicity, efficiency, and scalability.

关键词： X10 APGAS dynamic programming programming framework

来源：评论

学校读者我要写书评

暂无评论

Hector: automated task allocation for MPI

Hector: automated task allocation for MPI

引用

Proceedings of the 1996 10th international parallel processing symposium

作者： Russ, Samuel H. Flachs, Brian Robinson, Jonathan Heckel, Bjorn Mississippi State Univ United States

Many institutions already have networks of workstations, which could potentially be harnessed as a powerful parallel processing resource. A new, automatic task allocation system has been built on top of MPI, an environment that permits parallel programming by using the message-passing paradigm and implemented in extensions to C and FORTRAN. This system, known as 'Hector', supports dynamic migration of tasks and automatic run-time performance optimization. MPI programs can be run without modification under Hector, and can be run on existing networks of workstations. Thus Hector permits institutions to harness existing computational resources quickly and transparently.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Proceedings - 21st international parallel and distributed processing symposium, IPDPS 2007;Abstracts and CD-ROM

Proceedings - 21st International Parallel and Distributed Pr...

引用

21st international parallel and distributed processing symposium, IPDPS 2007

ISBN: (纸本)1424409101

The proceedings contain 448 papers. The topics discussed include: building the tree of life on terascale systems;efficient block device sharing over myrinet with memory bypass;conserving memory bandwidth in chip multiprocessors with runahead execution;towards a better understanding of workload dynamics on data-intensive clusters and grids;energy-aware self-stabilization in mobile ad hoc networks: a multicasting case study;optimizing multiple distributed stream queries using hierarchical network partitions;fast failure detection in a process group;route table partitioning and load balancing for parallel searching with TCAMs;load balancing in the bulk-synchronous-parallel setting using process migrations;capacity sharing and stealing in dynamic server-based real-time systems;and power-aware routing for well-nested communications on the circuit switched tree.

关键词： Trees (mathematics)

来源：评论

学校读者我要写书评

暂无评论

On the optimality of the likelihood-ratio test for local sensor decision rules in the presence of nonideal channels

引用

ieee TRANSACTIONS ON INFORMATION THEORY 2005年第2期51卷 693-699页

作者： Chen, B Willett, PK Syracuse Univ Dept Elect Engn & Comp Sci Syracuse NY 13244 USA Univ Connecticut Dept Elect & Comp Engn Storrs CT 06269 USA

distributed detection has been intensively studied in the past. In this correspondence, we consider the design of local decision rules in the presence of nonideal transmission channels between the sensors and the fusion center. Under the conditional independence assumption among multiple sensor observations, we show that the optimal local decisions that minimize the error probability at the fusion center amount to a likelihood-ratio test (LRT) given a particular constraint on the fusion rule. This constraint turns out to be quite general and is easily satisfied for most sensible fusion rules. A design example using a parallel sensor fusion structure with binary-symmetric channels (BSCs) between local sensors and the fusion center is given to illustrate the usefulness of the result in obtaining optimal thresholds for local sensor observations. The study that incorporates the transmission channel in sensor system design may have potential applications in the emerging field of wireless sensor networks.

关键词： distributed detection likelihood-ratio quantizers minimum error probability sensor networks

来源：评论

学校读者我要写书评

暂无评论

Realization of Efficient and Low-Power parallel Face-Detection with Massive-parallel Memory-Embedded SIMD Matrix

Realization of Efficient and Low-Power Parallel Face-Detecti...

引用

53rd Midwest symposium on Circuits and Systems (MWSCAS 2010)

作者： Kumaki, Takeshi Imai, Yuta Hiramoto, Hirokazu Koide, Tetsushi Mattausch, Hans Juergen Hiroshima Univ Res Inst Nanodevices & Bio Syst RNBS Higashihiroshima 7398527 Japan

ISBN: (纸本)9781424477739

This paper presents an efficient and low-power-consumption parallel face-detection technology based on Haar-like features and implemented with a massive-parallel memory-embedded SIMD matrix. The massive-parallel memory-embedded SIMD matrix architecture has up to 2,048 2-bit processing elements, which are connected by a flexible switching network, and supports 2-bit 2,048-way bit-serial and word-parallel operations with a single command. For experimented verification of this matrix processing architecture, this parallel Haar-like-feature based face-detection technique has been implemented on an evaluation board and tested in practice. Evaluation results show that a total processing time of about 313 ms at 162 MHz clock frequency and 150 mW power dissipation can be realized. Thus, the reported parallel-face detection method with the massive-parallel memory-embedded SIMD matrix is a practical technology and is a promising solution for real-time mobile multimedia applications.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

Time synchronization on SP1 and SP2 parallel systems

Time synchronization on SP1 and SP2 parallel systems

引用

Proceedings of the ieee 9th international parallel processing symposium

作者： Abali, Bulent Stunkel, Craig B. IBM Thomas J. Watson Research Cent Yorktown Heights United States

We describe an experimental time utility for synchronizing the operating system clocks on the SP1 and SP2 parallel system nodes. It synchronizes the node clocks typically within 5 microseconds of each other utilizing the synchronous feature of the SP1 and SP2 interconnection network. This is 2 to 3 orders of magnitude better than what can be achieved by previous methods. Synchronized clocks are useful for parallel program performance measurement and tuning, parallel program tracing and debugging, and gang scheduling of parallel processes, to name a few. We also measure the performance of a widely used time synchronization utility using the SP1 and SP2 interconnection network.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

A study of meta-scheduling architectures for high throughput computing: Pull versus Push

A study of meta-scheduling architectures for high throughput...

引用

4th international symposium on parallel and distributed Computing (ISPDC 2005)

作者： Garonne, V Tsaregorodtsev, A Caron, E CNRS CPPM IN2P3 F-13288 Marseille 09 France

ISBN: (纸本)0769524346

In this paper we present a model and simulator for many clusters of heterogeneous PCs belonging to a local network. These clusters are assumed to be connected to each other through a global network and each cluster is managed via a local scheduler which is shared by many users. We validate our simulator by comparing the experimental and analytical results of a M/M/4 queuing system. These studies indicate that the simulator is consistent. Next, we do the comparison with a real batch system and we obtain an average error of 10.5% for the response time and 12% for the makespan. We conclude that the simulator is realistic and well describes the behaviour of a large-scale system. Thus we can study the scheduling of our system called DIRAC in a high throughput context. We, justify our decentralized, adaptive and opportunistic approach in comparison to a centralized approach in such a context.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

CloudMapper: A Model-based Framework for Portability of Cloud applications Consuming PaaS Services 25

CloudMapper: A Model-based Framework for Portability of Clou...

引用

25th Euromicro international Conference on parallel, distributed and Network-Based processing (PDP)

作者： Munisso, Riccardo Chis, Adriana E. Natl Coll Ireland Cloud Competency Ctr Dublin Ireland

ISBN: (纸本)9781509060580

More and more companies rely on cloud services to provide their online software solutions. Cloud services are offered by a multitude of providers, each of them offering services through proprietary, mostly incompatible interfaces. Developing applications employing these vendor specific interfaces can create the "vendor lock-in" problem (i.e the application is tightly coupled to the underlying cloud provider). Consequently, such applications cannot be ported without incurring significant costs and time delay. A cloud services consumer can decide to switch to a different cloud provider based on different criteria such as changes in business requirements, continuously evolving offerings from cloud providers and costs control. Maintaining the flexibility to change cloud providers in an efficient way can be a challenging task. We propose an efficient model-driven framework for cloud application portability. Our approach enables applications consuming REST resources in the cloud to be transferred to different cloud providers without the need to refactor the applications. The framework supports a wide range of cloud resources. The framework produces an intermediation layer which translates the calls between the format of the initial cloud platform and the new target cloud platform. The intermediation layer can be consumed by any programming language. We demonstrate that cloud application portability can be achieved. Our solution successfully maps cloud-based services with an overall median of 100% for requests, and 74.8% for responses. Furthermore, we show that the intermediation layer introduces minimal additional latency.

关键词： cloud computing cloud applications portability service-oriented architecture cloud services

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：