the scalability of a parallel algorithm on a parallel architecture is a measure of its capability to effectively utilize an increasing number of processors. the scalability analysis may be used to select the best algo...
详细信息
the proceedings contain 59 papers. the special focus in this conference is on parallel Computing in Regular Structures. the topics include: Analytical modeling of parallel application in heterogeneous computing enviro...
ISBN:
(纸本)3540663630
the proceedings contain 59 papers. the special focus in this conference is on parallel Computing in Regular Structures. the topics include: Analytical modeling of parallel application in heterogeneous computing environments;skeletons and transformations in an integrated parallel programming environment;sequential unification and aggressive lookahead mechanisms for data memory accesses;a coordination model and facilities for efficient parallel computation;parallelizing of sequential programs on the basis of pipeline and speculative features of the operators;kinetic model of parallel data processing;PSA approach to population models for parallel genetic algorithms;highly accurate numerical methods for incompressible 3D fluid flows on parallelarchitectures;dynamic task scheduling with precedence constraints and communication delays;two-dimensional scheduling of algorithms with uniform dependencies;consistent lamport clocks for asynchronous groups with process crashes;comparative analysis of learning methods of cellular-neural associative memory;emergence and propagation of round autowave in cellular neural network;routing and embeddings in super cayley graphs;implementing cellular automata based models on parallelarchitectures;overview, design innovations, and preliminary results;implementing model checking and equivalence checking for time petri nets by the RT-MEC tool;learning concurrent programming;the speedup performance of an associative memory based logic simulator;a high-level programming environment for distributed memory architectures;virtual shared files;an object oriented environment to manage the parallelism of the FIIT applications;performance studies of shared-nothing parallel transaction processing systems;synergetic tool environments and logically instantaneous communication on top of distributed memory parallel machines.
this paper introduces a number of modifications that allow for significant improvements of parallel LLL reduction. Experiments show that these modifications result in an increase of the speed-up by a factor of more th...
详细信息
ISBN:
(纸本)9783642246494
this paper introduces a number of modifications that allow for significant improvements of parallel LLL reduction. Experiments show that these modifications result in an increase of the speed-up by a factor of more than 1.35 for SVP challenge type lattice bases in comparing the new algorithm withthe state-of-the-art parallel LLL algorithm.
Hierarchical clustering technology plays a very important role in image processing, intrusion detection and bioinformatics applications, which is one of the most extensively studied branch in data mining. Presently th...
详细信息
ISBN:
(纸本)9780769549323;9781467356527
Hierarchical clustering technology plays a very important role in image processing, intrusion detection and bioinformatics applications, which is one of the most extensively studied branch in data mining. Presently the parallel hierarchical algorithms aren't very good at processing large data. To overcome this shortcomings, a new parallel data preprocessing algorithm based on Hierarchical Clustering is proposed in this paper this algorithm can reduce the scale of data and runtime. accounting for one tenth of it in the best situation. the experiment proof the performance of our algorithm.
Associative processing based on content-addressable memories has been argued to be the natural solution for non-numerical information processing applications. Unfortunately, the implementation requirements of these ar...
详细信息
the space track catalog of satellites in orbit is generally maintained using analytic methods. New technology developments in the area of parallelprocessing provide the capability to apply more exact methods to deter...
详细信息
ISBN:
(纸本)0819411906
the space track catalog of satellites in orbit is generally maintained using analytic methods. New technology developments in the area of parallelprocessing provide the capability to apply more exact methods to determine and maintain the ephemerides of a larger number of space objects more precisely. Space object tracking accuracy is becoming increasingly important in space programs such as the Space Station, where the collision hazard is critical, and for military application requiring precise positioning of satellites. Affordable massively parallelprocessingarchitectures will soon be available to address this problem. We expect that before long the number of processing elements in a single affordable box will approach or exceed the number of satellites in orbit. In this paper we consider algorithms and architectures for processing a large number of space objects in a parallel sense. these improvements will enable the tracking of many small objects with precision and will improve the confidence with which collision hazards can be assessed. In addition, as sensor capabilities are improved through technology upgrades, the accuracy of these computational methods will continue to exceed the precision of the measurements.
Withthe rapid development of Internet and the continuous rise of network users, the network traffic in various regions is increasing rapidly. In the face of a large number of high speed and high throughput of the net...
详细信息
ISBN:
(纸本)9781538694039
Withthe rapid development of Internet and the continuous rise of network users, the network traffic in various regions is increasing rapidly. In the face of a large number of high speed and high throughput of the network environment, traditional packet capture methods and processing capabilities cannot reach the corresponding speed, which results in severe packet loss. this paper focuses on a high-performance packet acquisition and distribution method to break through the performance bottleneck of universal servers and network cards. this paper studies a packet capture method based on DPDK platform, and uses the processing of hash value in RSS to improve the efficiency of data packet distribution, which realizes the process from performance acquisition to efficiently multi-core parallelprocessing. this method can effectively reduce packet loss and improve the data packet processing rate. It can also reduce resource waste and network overhead for traffic capture and distribution. Preliminary experiments show that DPDK-based traffic processing has obvious advantages over PF-RING and Netmap in data processing speed.
the emergence and continuing use of multi-core architectures and graphics processing units require changes in the existing software and sometimes even a redesign of the established algorithms in order to take advantag...
详细信息
the emergence and continuing use of multi-core architectures and graphics processing units require changes in the existing software and sometimes even a redesign of the established algorithms in order to take advantage of now prevailing parallelism. parallel Linear Algebra for Scalable Multi-core architectures (PLASMA) and Matrix Algebra on GPU and Multics architectures (MAGMA) are two projects that aims to achieve high performance and portability across a wide range of multi-core architectures and hybrid systems respectively. We present in this document a comparative study of PLASMA's performance against established linear algebra packages and some preliminary results of MAGMA on hybrid multi-core and GPU systems.
this paper describes a new parallel Branch-and-Bound algorithm for solving the classical permutation flow shop scheduling problem as well as its implementation on a cluster of six computers. the experimental study of ...
详细信息
ISBN:
(纸本)9783642131356
this paper describes a new parallel Branch-and-Bound algorithm for solving the classical permutation flow shop scheduling problem as well as its implementation on a cluster of six computers. the experimental study of our distributed parallel algorithm gives promising results and shows clearly the benefit of the parallel paradigm to solve large-scale instances in moderate CPU time.
A concept for a future integer arithmetic unit as well as a first implementation of the arithmetic unit's core as smart pixel detector chip is presented. this architecture is well-suited for a realization with 3-D...
详细信息
ISBN:
(纸本)0818685727
A concept for a future integer arithmetic unit as well as a first implementation of the arithmetic unit's core as smart pixel detector chip is presented. this architecture is well-suited for a realization with 3-D optoelectronic very large scale integrated (VLSI) circuits. Due to the use of optical interconnections running vertically to the circuit's surface no pin limitation is given. this allows massively parallelism and a higher throughput performance than in all-electronic solutions. To exploit the potential of optical interconnections in VLSI systems efficiently well-adapted low-level algorithms and architectures have to be developed. this is demonstrated for a pipelined arithmetic unit using a redundant number representation. A gate layout for the optoelectronic circuits is given as well as a specification for the necessary optical interconnection scheme linking the circuits with free-space optics. It is shown that the throughput can be increased by a factor of 10 to 50 compared to current all-electronic processors by considering state-of-the-art optical and optoelectronic technolgy.
暂无评论