检索结果-内蒙古大学图书馆

23rd ieee International parallel and distributed processing symposium

作者： Kaiser, Christian Hoefler, Torsten Bierbaum, Boris Bemmerl, thomas Dolphin Interconnect Solut Siebengebirgsblick 26 D-53343 Wachtberg Germany Indiana Univ Open Syst Lab Bloomington IN 47405 USA Rhein Westfal TH Aachen Univ Chair Operating Syst D-52056 Aachen Germany

ISBN: (纸本)9781424437511

Nonblocking collective communication operations are currently being considered for inclusion into the MPI standard and are at? area of active research. the benefits of such operations are documented by several recent publications, but so jar research concentrates on InfiniBand clusters. this paper describes an implementation of nonblocking collectives for clusters with the Scalable Coherent Interface (SCI) interconnect. We use synthetic and application kernel benchmarks to show, that with nonblocking functions for collective communication performance enhancements can be achieved on SCI systems. Our results indicate that for the implementation of these nonblocking collectives data transfer methods other that? those usually used for the blocking version should be considered to realize such improvements.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

Intrusion detection and tolerance for transaction based applications in wireless environments

Intrusion detection and tolerance for transaction based appl...

引用

23rd ieee International parallel and distributed processing symposium

作者： Djemaiel, Yacine Boudriga, Noureddine Univ 7th November Carthage CN&S Res Lab Tunis Tunisia

ISBN: (纸本)9781424437511

Nowadays, many intrusion detection and tolerance systems have been proposed in order to detect attacks in both wired and wireless networks. Even if these solutions have shown some efficiency by detecting a set of complex attacks in wireless environments, they are unable to detect attacks using transaction bared traffic in wireless environments. In this context, we propose an intrusion detection and tolerance scheme that is able to monitor heterogeneous traffic and to detect and tolerate attacks targeting transaction based applications interoperating in wireless environments. A case study is given to illustrate the proposed system capabilities against a complex attack scenario targeting a multi-player wireless gaming service.

关键词： transaction intrusion detection intrusion tolerance wireless environment scheduler

来源：评论

学校读者我要写书评

暂无评论

Sequence-preserving parallel IP lookup using multiple SRAM-based pipelines

引用

JOURNAL OF parallel AND distributed COMPUTING 2009年第9期69卷 778-789页

作者： Jiang, Weirong Prasanna, Viktor K. Univ So Calif Ming Hsieh Dept Elect Engn Los Angeles CA 90089 USA

SRAM (static random access memory)-based pipelined algorithmic solutions have become competitive alternatives to TCAMs (ternary content addressable memories) for high-throughput IP lookup. Multiple pipelines can be utilized in parallel to improve the throughput further. However, several challenges must be addressed to make such solutions feasible. First, the memory distribution over different pipelines, as well as across different stages of each pipeline, must be balanced. Second, the traffic among these pipelines should be balanced. third, the intra-flow packet order (i.e. the sequence) must be preserved. In this paper, we propose a parallel SRAM-based multi-pipeline architecture for IP lookup. A two-level mapping scheme is developed to balance the memory requirement among the pipelines as well as across the stages in each pipeline. To balance the traffic, we propose an early caching scheme to exploit the data locality inherent in the architecture. Our technique uses neither a large reorder buffer nor complex reorder logic. Instead, a flow-aware queuing scheme exploiting the flow information is used to maintain the intra-flow sequence. Extensive simulation using real-life traffic traces shows that the proposed architecture with 8 pipelines can achieve a throughput of up to 10 billion packets per second, i.e. 3.2 Tbps for minimum size (40 bytes) packets, while preserving intra-flow packet order. (c) 2009 Elsevier Inc. All rights reserved.

关键词： IP lookup Pipeline SRAM Router

来源：评论

学校读者我要写书评

暂无评论

An Approach for parallel Interest Matching in distributed Virtual Environments 09

An Approach for Parallel Interest Matching in Distributed Vi...

引用

13th ieee/ACM symposium on distributed Simulation and Real-Time Applications (DS-RT 2009)

作者： Liu, Elvis S. theodoropoulos, Georgios K. Univ Birmingham Sch Comp Sci Birmingham B15 2TT W Midlands England

ISBN: (纸本)9780769538686

Interest management is essential for real-time large-scale distributed virtual environments (DVEs) which seeks to filter irrelevant messages on the network. Many existing interest management schemes such as HLA DDM focus on providing precise message filtering mechanisms. However, this leads to a second problem: the computational overhead of the interest matching process. If the CPU cost of interest matching is too high, it would be unsuitable for real-time applications such as multiplayer online games for which runtime performance is important. this paper evaluates the performance of existing interest matching algorithms and proposes a new algorithm based on parallel processing. the new algorithm is expected to have better computational efficiency than existing algorithms and maintain the same accuracy of message filtering as them. Experimental evidence shows that our approach works well in practice.

关键词： Computer Games distributed Virtual Environments Interest Management High Level Architecture Data Distribution Management

来源：评论

学校读者我要写书评

暂无评论

Switching to High Gear: Opportunities for Grand-scale Real-time parallel Simulations 09

Switching to High Gear: Opportunities for Grand-scale Real-t...

引用

13th ieee/ACM symposium on distributed Simulation and Real-Time Applications (DS-RT 2009)

作者： Perumalla, Kalyan S. Oak Ridge Natl Lab Oak Ridge TN 37831 USA

ISBN: (纸本)9780769538686

the recent emergence of dramatically large computational power, spanning desktops with multi-core processors and multiple graphics cards to supercomputers with 10(5) processor cores, has suddenly resulted in simulation-based solutions trailing behind in the ability to fully tap the new computational capacity. Here, we motivate the need for switching the parallel simulation research to a higher gear to exploit the new, immense levels of computational power. the potential for grand-scale real-time solutions is illustrated using preliminary results from prototypes in four example application areas: (a) state- or regional-scale vehicular mobility modeling, (b) very large-scale epidemic modeling, (c) modeling the propagation of wireless network signals in very large, cluttered terrains, and, (d) country- or world-scale social behavioral modeling. We believe the stage is perfectly poised for the parallel/distributed simulation community to envision and formulate similar grand-scale, real-time simulation-based solutions in many application areas.

关键词： Vehicular Traffic Social Behaviors Real-time Simulation parallel Simulation Epidemic Simulations Wireless Signal

来源：评论

学校读者我要写书评

暂无评论

Toward Adjoinable MPI

Toward Adjoinable MPI

引用

23rd ieee International parallel and distributed processing symposium

作者： Utke, Jean Hascoet, Laurent Heimbach, Patrick Hill, Chris Hovland, Paul Naumann, Uwe Univ Chicago Chicago IL 60637 USA Argonne Natl Lab Argonne IL USA INRIA Sophia Antipolis Valbonne France MIT EAPS Cambridge MA USA Rhein Westfal TH Aachen Dept Comp Sci Aachen Germany

ISBN: (纸本)9781424437511

Automatic differentiation is the primary means of obtaining analytic derivatives from a numerical model given as a computer program. therefore, it is an essential productivity tool in numerous computational science and engineering domains. Computing gradients with the adjoint (also called reverse) mode via source transformation is a particularly beneficial but also challenging use of automatic differentiation. To date only ad hoc solutions for adjoint differentiation of MPI programs have been available, forcing automatic differentiation tool users to reason about parallel communication dataflow and dependencies and manually develop adjoint communication code. Using the communication graph as a model we characterize the principal problems of adjoining the most frequently used communication idioms. We propose solutions to cover these idioms and consider the consequences for the MPI implementation, the MPI user and MPI-aware program analysis. the MIT general circulation model serves as a use case to illustrate the viability of our approach.

关键词： MPI automatic differentiation source transformation reverse mode

来源：评论

学校读者我要写书评

暂无评论

CaravelaMPI: Message Passing Interface for parallel GPU-based Applications

CaravelaMPI: Message Passing Interface for Parallel GPU-base...

引用

8th International symposium on parallel and distributed Computing

作者： Yamagiwa, Shinichi Sousa, Leonel INESC ID IST P-1000029 Lisbon Portugal

ISBN: (纸本)9780769536804

With the ever increasing demand for high quality 3D image processing on markets such as cinema and gaming, graphics processing units (GPUs) capabilities have shown tremendous advances. Although GPU-based cluster computing, which uses GPUs as the processing units, is one of the most promising high performance parallel computing platforms, currently there is no programming environment, interface or library designed to use these multiple computing resources to compute tasks in parallel. this paper proposes the CaravelaMPI, a new message passing interface targeted for GPU cluster computing, providing a unified and transparent interface to manage both communication and GPU execution. Experimental results show that the transparent interface of CaravelaMPI allows to efficiently program GPU-based clusters, not only decreasing the required programming effort but also increasing the performance of GPU-based cluster computing platforms.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

PIC detector for joint distributed STBC under imperfect synchronization

PIC detector for joint distributed STBC under imperfect sync...

引用

5th International Conference on Wireless Communications, Networking and Mobile Computing, WiCOM 2009

作者： Xin, Wang Zhuo, Wu Shanghai University China

ISBN: (纸本)9781424436934

In this paper, a new signal detection scheme using both parallel interference cancellation (PIC) and equalization for the efficient joint distributed space-time coding is proposed to suppress the impact of imperfect synchronisation in distributed cellular systems. Simulation results show that the proposed scheme outperforms the existing joint DSTBC decoded scheme in terms of BER. And also it achieves the same system capacity as the existing scheme. Meanwhile, a low structure and computational complexity has been retained. ©2009 ieee.

关键词： Synchronization

来源：评论

学校读者我要写书评

暂无评论

Scalability of efficient parallel K-Means

Scalability of efficient parallel K-Means

引用

2009 5th ieee International Conference on e-Science Workshops, e-science 2009

作者： Pettinger, David Fatta, Giuseppe Di School of Systems Engineering University of Reading Whiteknights Reading Berkshire RG6 6AY United Kingdom

ISBN: (纸本)9781424459452

Clustering is defined as the grouping of similar items in a set, and is an important process within the field of data mining. As the amount of data for various applications continues to increase, in terms of its size and dimensionality, it is necessary to have efficient clustering methods. A popular clustering algorithm is K-Means, which adopts a greedy approach to produce a set of K-clusters with associated centres of mass, and uses a squared error distortion measure to determine convergence. Methods for improving the efficiency of K-Means have been largely explored in two main directions. the amount of computation can be significantly reduced by adopting a more efficient data structure, notably a multi-dimensional binary search tree (KD-Tree) to store either centroids or data points. A second direction is parallel processing, where data and computation loads are distributed over many processing nodes. However, little work has been done to provide a parallel formulation of the efficient sequential techniques based on KD-Trees. Such approaches are expected to have an irregular distribution of computation load and can suffer from load imbalance. this issue has so far limited the adoption of these efficient K-Means techniques in parallel computational environments. In this work, we provide a parallel formulation for the KD-Tree based K-Means algorithm and address its load balancing issues. © 2009 ieee.

关键词： Clustering algorithms

来源：评论

学校读者我要写书评

暂无评论

Horizon - Exploiting Timing Information for parallel Network Simulation

Horizon - Exploiting Timing Information for Parallel Network...

引用

17th ieee International symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems

作者： Kunz, Georg Landsiedel, Olaf Wehrle, Klaus Rhein Westfal TH Aachen Distributed Syst Grp Aachen Germany

ISBN: (纸本)9781424449262

Network simulation faces an increasing demand for highly detailed simulation models which in turn require efficient handling of their inherent computational complexity. this demand for detailed models includes both accurate estimations of processing time and in-depth modeling of wireless technologies. For instance, one might want to investigate if a particular device can incorporate a computationally complex radio transmission technology while meeting the deadlines of a multi-media streaming application such as VoIP.

关键词： Media streaming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：