Driven by the ever increasing algorithm complexity on the field of mobile communications systems, SIMD DSP architectures have emerged as an approach that offers the necessary processing power at reasonable levels of d...
详细信息
ISBN:
(纸本)0769520804
Driven by the ever increasing algorithm complexity on the field of mobile communications systems, SIMD DSP architectures have emerged as an approach that offers the necessary processing power at reasonable levels of die size and power consumption. However, this kind of DSP architectures imposes new challenges for programmers, since algorithms have to be designed to exploit the available parallelism on the processor. Taking as a starting point an algebraic framework that captures the SIMD computational model, we report in this paper about our efforts to design and automatically generate object code for our family of DSP architectures independent of the available SIMD parallelism. We show how these algebraic structures can be used as a high level programming language that offers a unified approach to design and describe algorithms using SIMD parallelism. Moreover, we show how these algebraic structures offer concise rules for the automatic code generation.
the problem of load balancing is a key factor that influences the performance of the cluster computing system. In this paper, we describe a dynamic scheduling system and a load balancing system based on PVM, this syst...
详细信息
ISBN:
(纸本)7506262096
the problem of load balancing is a key factor that influences the performance of the cluster computing system. In this paper, we describe a dynamic scheduling system and a load balancing system based on PVM, this system supports dynamic scheduling parallel processes and balances load in a cluster computing environment, this work is used in CSCW application. the results of the performance test show this dynamic load balancing system can reduce the execution time of some parallel applications.
Irregular and dynamic memory reference patterns can cause performance variations for low level algorithms in general and for parallelalgorithms in particular. We present an adaptive algorithm selection framework whic...
详细信息
ISBN:
(纸本)0769522297
Irregular and dynamic memory reference patterns can cause performance variations for low level algorithms in general and for parallelalgorithms in particular. We present an adaptive algorithm selection framework which can collect and interpret the inputs of a particular instance of a parallel algorithm and select the best performing one from a an existing library. In this paper present the dynamic selection of parallel reduction algorithms. First we introduce a set of high-level parameters that can characterize different parallel reduction algorithms. then we describe an off-line, systematic process to generate predictive models which can be used for run-time algorithm selection. Our experiments show that our framework: (a) selects the most appropriate algorithms in 85% of the cases studied, (b) overall delievers 98% of the optimal performance, (c) adaptively selects the best algorithms for dynamic phases of a running program (resulting in performance improvements otherwise not possible), and (d) adapts to the underlying machine architecture (tested on IBM Regatta and HP V-Class systems).
the influence of different parallel genetic algorithm (PGA) architectures on the GA convergence properties is analysed. Next, two proposed versions of these PGA architectures are compared - homogenous and heterogeneou...
详细信息
ISBN:
(纸本)3540230920
the influence of different parallel genetic algorithm (PGA) architectures on the GA convergence properties is analysed. Next, two proposed versions of these PGA architectures are compared - homogenous and heterogeneous. Finally the effect of re-initialisation in some partial populations on the PGA convergence has been analysed. the proposed PGA modifications are useful mainly in case of non-smooth cost function optimisation.
By converting thread-level parallelism to instruction level parallelism, Simultaneous Multithreaded (SMT) processors are emerging as effective ways to utilize the resources of modern superscalar architectures. However...
详细信息
ISBN:
(纸本)0769522297
By converting thread-level parallelism to instruction level parallelism, Simultaneous Multithreaded (SMT) processors are emerging as effective ways to utilize the resources of modern superscalar architectures. However the full potential of SMT has not yet been reached as most modern operating systems use existing single-thread or multiprocessor algorithms to schedule threads, neglecting contention for resources between threads. To date, even the best SMT scheduling algorithms simply try to group threads for co-residency based on each thread's expected resource utilization but do not take into account variance in thread behavior As such, we introduce architectural support that enables new thread scheduling algorithms to group threads for co-residency based on fine-grain memory system activity information. the proposed memory monitoring framework centers oil the concept of a cache activity vector, which exposes runtime cache resource information to the operating system to improve job scheduling. Using this scheduling technique, we experimentally evaluate the overall performance improvement of workloads on an SMT machine compared against the most recent Linux job scheduler this work is first motivated with experiments in a simulated environment, then validated on a Hyperthreading-enabled Intel Pentium-4 Xeon microprocessor running a modified version of the latest Linux Kernel.
In a review paper about audio-visual (AV) fusion models in speech perception, we (Schwartz et al., 1998) proposed a taxonomy of models around two basic questions: architecture and control. Six years after, it appears ...
In a review paper about audio-visual (AV) fusion models in speech perception, we (Schwartz et al., 1998) proposed a taxonomy of models around two basic questions: architecture and control. Six years after, it appears that the proposals we made still seem rather convenient for discussing major questions about AV fusion. Moreover - and more importantly - recent experimental and theoretical progress seem to provide some elements of answer in both aspects. the aim of this paper is to review these elements, and to incorporate them into the general architecture-and-control framework.
this paper extends our previous work [1] on the island model parallel hybrid-genetic algorithm (PHGA) for large scale Quadratic Assignment Problems (QAPs). Some issues on the control parameters of the migration proces...
详细信息
ISBN:
(纸本)0780386531
this paper extends our previous work [1] on the island model parallel hybrid-genetic algorithm (PHGA) for large scale Quadratic Assignment Problems (QAPs). Some issues on the control parameters of the migration process and how they affect the quality of the solutions and the efficiency of algorithm deserve further evaluative study. In this paper, we investigate the effect of migration topology on the performance of the PHGA. Two topologies, one-way ring topology and random topology, are studied and analyzed. the empirical results show that the PHGA with ring topology is better able to achieve an appropriate tradeoff between exploration and exploitation and hence more helpful to improve the performance of PHGA for solving large scale QAPs.
Implementing image-processing systems can require significant effort and resources due to information volume and algorithm complexity. Model Integrated Computing (MIC) based image processing systems show promise in su...
详细信息
ISBN:
(纸本)0769521258
Implementing image-processing systems can require significant effort and resources due to information volume and algorithm complexity. Model Integrated Computing (MIC) based image processing systems show promise in supporting solutions of these complex problems. While MIC has contributed to the advancement of performing complex image processing tasks on parallel-embedded systems, it has not addressed a challenging class of algorithmsthat adapt the image-processing algorithm based on the information or state of the image processing system. this proposed effort addresses creating an adaptive image-processing environment based on MIC that allows solutions of complex image processing problems to be built and executed rapidly. this effort will involve creating a new modeling representation for image processing adaptation mechanisms. the proposed MIC-based adaptive image-processing environment will generate a solution given the modeling constraints and execute it on a number of hardware architectures.
Dealing with changes in processes has become unavoidable in deploying workflow. To improve the ability of workflow management in dealing with changes has emerged as a hot research topic in the area of workflow managem...
详细信息
ISBN:
(纸本)7506262096
Dealing with changes in processes has become unavoidable in deploying workflow. To improve the ability of workflow management in dealing with changes has emerged as a hot research topic in the area of workflow management. According to the changeable characteristics, workflow processes are categorized into dynamic, adaptive and flexible processes. In this paper, we primarily deal withthe flexible processes. Flexibility is the ability of workflow process to execute on the basis of a loosely, or partially specified model, where the full specification of the model is made at runtime, and may be unique to each instance. To provide full support for flexibility, we propose a simple but applicable flexible workflow model based on ECA rules and composition of activities at runtime. Furthermore, selection constraints and composition constraints are put forward to ensure the valid selection and composition of activities at runtime. We also design two algorithms, one is to automate the composition withthe best concurrence which is defined as the ability of activities to be executed in parallel and the other is to verify the validness of the manual selection and composition.
Tone is an essential component for word formation in all tone languages. Substantial work has been done on using tone information to improve speech recognition of tone languages. In this paper, a new method, called Pa...
Tone is an essential component for word formation in all tone languages. Substantial work has been done on using tone information to improve speech recognition of tone languages. In this paper, a new method, called parallel Tone Score Association (PTSA), for effectively and efficiently using tone in speech recognition is proposed. Experimental results show that the relative character error rates are reduced by as much as 20.94% for Cantonese, and 20.49% for Mandarin compared withthe recognition results without tone information. this relative reduction in error rates compares favorably with results reported for other recognition experiments on tone languages.
暂无评论