检索结果-内蒙古大学图书馆

Euromicro Workshop on Parallel and Distributed Processing

作者： E. Mehofer B. Scholz Institute for Software Science University of Technology Vienna Vienna Austria Institute of Computer Languages University of Technology Vienna Vienna Austria

ISBN: (纸本)0769509878

In high-performance systems execution time is of crucial importance justifying advanced optimization techniques. Traditionally, optimization is based on static program analysis. The quality of program optimizations, however, can be substantially improved by utilizing runtime information. Probabilistic data-flow frameworks compute the probability with what data-flow facts may hold at some program point based on representative profile runs. Advanced optimizations can use this information in order to produce highly efficient code. In this paper we introduce a novel optimization technique in the context of High Performance Fortran (HPF) that is based on probabilistic data-flow information. We consider statically undefined attributes which play an important role for parallelization and compute for those attributes the probabilities to hold some specific value during runtime. For the most probable attribute values highly-optimized, specialized code is generated. In this way significantly better performance results can be achieved. The implementation of our optimization is done in the context of VFC, a source-to-source parallelizing compiler for HPF/F90.

关键词： data analysis Runtime Concurrent computing Optimizing compilers Program processors Computer languages data flow computing Information analysis

来源：评论

学校读者我要写书评

暂无评论

Efficient synthesis of array intensive computations onto FPGA based accelerators

Efficient synthesis of array intensive computations onto FPG...

引用

International Conference on VLSI Design

作者： N. Shenoy P. Banerjee A. Choudhary M. Kandemir Electrical and Computer Engineering Northwestern University Evanston IL USA

ISBN: (纸本)0769508316

Array intensive computations are characterized by processing of large arrays stored in external memory in multiple loops. Synthesizing these computations onto FPGAs involves automatic translation of the behavioral description into state machines controlled by a clock such that the execution time of the program as a whole is the minimum and area requirement does not exceed a predefined limit. The synthesis algorithm also needs to efficiently sequence the array, accesses taking into account memory access requirements such as pipelining. In this paper we present two algorithms each with a specific emphasis to handle this synthesis problem. Our heuristic algorithm generates good solutions in a very short time (less than a second), while our mixed integer linear programming (MILP) based algorithm can generate optimal solution given sufficient time. Both try to minimize execution time and area. Our algorithms not only look at individual loops to exploit parallelism but also consider them together while deciding the clock. The overall execution time is minimized and not just the number of cycles or the cycle time. They also efficiently synthesize memory accesses to fully exploit the memory pipelining. We compare these two algorithms in terms of their relative strengths.

关键词： Field programmable gate arrays Acceleration Clocks Automatic control Parallel processing flow graphs data flow computing Pipeline processing Heuristic algorithms Mixed integer linear programming

来源：评论

学校读者我要写书评

暂无评论

Distributed evolutionary design of constant-coefficient multipliers

Distributed evolutionary design of constant-coefficient mult...

引用

IEEE International Conference on Electronics, Circuits and Systems (ICECS)

作者： D. Chen T. Aoki N. Homma T. Higuchi Higuchi Laboratory Graduate School of Information Sciences University of Tohoku Sendai Japan

A parallel version of the evolutionary graph generation (EGG) system, called the distributed EGG (DEGG) system, was developed on a cluster of PCs using a message-passing interface (MPI). To demonstrate the capability of DEGG, it is applied to seeking the optimal design of various multipliers. Experimental results substantially show that DEGG consistently performs better than the EGG and known conventional designs.

关键词： Circuits Personal communication networks Digital signal processing data flow computing Genetic mutations Laboratories Signal design Digital arithmetic Process design Design optimization

来源：评论

学校读者我要写书评

暂无评论

Enridged contour maps 01

Enridged contour maps

引用

IEEE Conference on Visualization

作者： J.J. van Wijk A. Telea Dept. of Mathematics and Computer Science Eindhoven University of Technology Eindhoven MB The Netherlands

ISBN: (纸本)0780372018

The visualization of scalar functions of two variables is a classic and ubiquitous application. We present a new method to visualize such data. The method is based on a nonlinear mapping of the function to a height field, followed by visualization as a shaded mountain landscape. The method is easy to implement and efficient, and leads to intriguing and insightful images: The visualization is enriched by adding ridges. Three types of applications are discussed: visualization of iso-levels, clusters (multivariate data visualization), and dense contours (flow visualization).

关键词： data visualization Chromium Computer displays data flow computing Pervasive computing Graphics Image generation Clustering algorithms Temperature Humans

来源：评论

学校读者我要写书评

暂无评论

A numerical analytical model for the continuous dynamic network equilibrium problem with limited capacity and spill back

A numerical analytical model for the continuous dynamic netw...

引用

International Conference on Intelligent Transportation

作者： J.M. Rubio-Ardanaz Jia Hao Wu M. Florian Center for Research on Transportation Université de Montréal Montreal Canada

The analytical approaches to dynamic traffic assignment did not consider until now the limited capacity of arcs. In this paper a model and an algorithm are developed for this problem. The method was coded and computat... 详细信息

关键词： Analytical models Telecommunication traffic Traffic control Transportation Queueing analysis Tail Tellurium data flow computing

来源：评论

学校读者我要写书评

暂无评论

Improved merging of datapath operators using information content and required precision analysis 01

Improved merging of datapath operators using information con...

引用

Design Automation Conference

作者： A. Mathur S. Saluja Cadence Design Systems San Jose CA USA

ISBN: (纸本)1581132972

We introduce the notions of required precision and information content of datapath signals and use them to define functionally safe transformations on data flow graphs. These transformations reduce widths of datapath operators and enhance their mergeability. Using efficient algorithms to compute required precision and information content of signals, we define a new algorithm for partitioning a data flow graph consisting of datapath operators into mergeable clusters. Experimental results indicate that use of our clustering algorithm for operator merging based synthesis of datapath intensive designs, can lead to significant improvement in the delay and area of the implementation.

关键词： Merging Information analysis flow graphs Signal processing algorithms Clustering algorithms Partitioning algorithms Delay Signal synthesis data flow computing Tree graphs

来源：评论

学校读者我要写书评

暂无评论

Determining the optimum extended instruction-set architecture for application specific reconfigurable VLIW CPUs

Determining the optimum extended instruction-set architectur...

引用

International Workshop on Rapid System Prototyping (RSP)

作者： C. Alippi W. Fornaciari L. Pozzi M. Sami Dipartimento di Elettronica e Informazione Politecnico di Milano Milan Italy

ISBN: (纸本)0769512062

Considers reconfigurable computing for application-specific systems, with particular reference to mixed-technology chips. A VLIW "core" is augmented by means of reconfigurable functional units (RFUs) and register files implemented via FPGA on to the same chip. The application is analyzed to extract segments of computation that could be usefully collapsed into complex instructions decoded and executed by the RFUs. In this paper, we focus on the problem of selecting the optimum extension to the native instruction set by means of the "best" segments of the computation that will become complex instructions. In particular, a genetic algorithm approach is introduced to analyze the population of candidates; modifications to the classic genetic operators are introduced to take into account the peculiarity of our problem. Applying the proposed methodology to some significant applications has validated the overall approach.

关键词： Field programmable gate arrays Coprocessors data flow computing VLIW CMOS technology High performance computing Computer architecture Paper technology Registers Decoding

来源：评论

学校读者我要写书评

暂无评论

The waveform description language: moving from implementation to specification

The waveform description language: moving from implementatio...

引用

MILCOM, Military Communications Conference

作者： E.D. Willink Thales Research Limited Reading UK

ISBN: (纸本)0780372255

Many current research and development activities make significant contributions to the quality of some particular implementation approach. We describe the waveform description language, in which the best characteristics of a variety of distinct programming approaches are exploited so that standard implementation domain practices can be applied in the specification domain. A single WDL specification may be refined to support semi-automated conversion to a variety of implementations. A WDL specification avoids the ambiguities and contradictions characteristic of many conventional specifications with an underlying formality that remains accessible and familiar to programmers.

关键词： Programming profession Specification languages Research and development Hazards Mathematics Safety Costs Libraries data flow computing Instruments

来源：评论

学校读者我要写书评

暂无评论

Conflicting criteria in embedded system design

引用

IEEE DESIGN & TEST OF COMPUTERS 2000年第2期17卷 51-59页

作者： Eisenring, M Thiele, L Zitzler, E Swiss Fed Inst Technol Comp Engn & Networks Lab CH-8092 Zurich Switzerland

This article presents a methodology to cope with the simultaneous optimization of multiple competing objectives and the different sources of heterogeneity in embedded system design.

关键词： Embedded system Computer interfaces data flow computing Embedded computing Signal synthesis databases Design methodology Computer architecture Field programmable gate arrays Signal design

来源：评论

学校读者我要写书评

暂无评论

The NA48 event-building PC farm

引用

IEEE TRANSACTIONS ON NUCLEAR SCIENCE 2000年第2期47卷 348-352页

作者： Wittgen, M Peters, A Marouelli, P Luitz, S Bal, F Boyle, O Gianoli, A Lacourt, A Panzer, B Vossnack, O Johannes Gutenberg Univ Mainz Inst Phys D-55099 Mainz Germany CERN CH-1211 Geneva 23 Switzerland

The NA48 experiment at the CERN SPS aims to measure the parameter R epsilon(epsilon'/epsilon) of direct CP violation in the neutral kaon system with an accuracy of 2 x 10(-4). Based on the requirements of: high event rates (up to 10 kHz) with negligible dead time support for a variety of detectors with very wide variation in the number of readout channels data rates of up to 150 MByte/s sustained over the beam burst. level-3 filtering and remote data logging in the CERN computer center the collaboration has designed and built a modular pipelined data how system with 40 MHz sampling rate. The architecture combines custom-designed components with commercially available hardware for cost effectiveness and flexibility. To increase the available data bandwidth and to add filtering and monitoring capabilities, the original custom-built event builder hardware has been replaced by a farm of 24 Intel PentiumII based PCs running the Linux operating system during the shutdown between the 1997 and 1998 data taking periods. During the data taking period 1998 the system has been successfully operated taking ca. 70 Terabyte of data.

关键词： Filtering Hardware Mesons Event detection data flow computing Collaboration Sampling methods Computer architecture Costs Bandwidth

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：