检索结果-内蒙古大学图书馆

EUROMICRO Conference

作者： A. Belloum L.O. Hertzberger Computer Architecture and Parallel Systems Group Department of Computer Science Universiteit van Amsterdam Amsterdam Netherlands

A proper initialization requires starting the process in a state close to the expected steady state. In Web caching, the initialization problem is faced each time a new document enters the cache. Independently of the method used to sort the documents into the cache, the newly referenced document is inserted in a so called "removal-list", from which documents are removed when storage space is needed. Often, undesirable documents are assigned a high priority, consequently these documents remain for quite a long time in the cache, leading to a decrease in cache server performances. We investigate one category of undesirable documents, which pass the filters commonly used to control the cache processing.

关键词： Internet Telecommunication traffic Web server Cache storage computer architecture computer science Steady-state Pattern analysis IP networks

来源：评论

学校读者我要写书评

暂无评论

Performance measurement using low perturbation and high precision hardware assists

Performance measurement using low perturbation and high prec...

引用

Real-Time systems Symposium (RTSS)

作者： A. Mink W. Salamon J.K. Hollingsworth R. Arunachalam Scalable Parallel Systems Group Information Technology Laboratory National Institute for Standards and Technology Gaithersburg MD USA Computer Science Department University of Maryland College Park MD USA

We present the design and implementation of MultiKron PCI, a hardware performance monitor that can be plugged into any computer with a free PCI bus slot. The monitor provides a series of high-resolution timers, and the ability to monitor the utilization of the PCI bus. We also demonstrate how the monitor can be integrated with online performance monitoring tools such as the Paradyn parallel performance measurement tools to improve the overhead of key timer operations by a factor of 25. In addition, we present a series of case studies using the MultiKron hardware performance monitor to measure and tune high-performance parallel completing applications. By using the monitor, we were able to find and correct a performance bug in a popular implementation of the MPI message passing library that caused some communication primitives to run at one half their potential speed.

关键词： Measurement Hardware computerized monitoring Performance evaluation NIST Debugging Timing Runtime Counting circuits Clocks

来源：评论

学校读者我要写书评

暂无评论

Local Bayesian Regularisation of Parsimonious Neurofuzzy Models for Real World Dynamic Processes

引用

IFAC Proceedings Volumes 1998年第4期31卷 17-22页

作者： K.M. Bossley M. Brown C.J. Harris Parallel Applications Centre 2 Venture Road Chilworth Southampton SO16 7NP UK Image Speech and Intelligent Systems Research Group Department of Electronics and Computer Science University of Southampton SO17 IBJ UK

By combining properties of fuzzy systems and neural networks, neurofuzzy modelling is ideally suited to many system identification and data modelling applications. Recently, data-driven model construction algorithms have been developed to identify these models. These algorithms have proved essential for producing accurate parsimonious models. However, due to problems with sparse data and restricted model structures, models with high model variance are often produced. Thus resulting in models which generalise poorly. In this paper local Bayesian inference techniques are applied to neurofuzzy models, multiple prior probability density functions are placed on the weights and superfluous model variance is controlled. This gives a form of regularisation where Bayesian estimation produces simple re-estimation formulae which identify a suitable bias/variance balance from the data. This approach is considered a post-processing step to model construction, the merits of which are demonstrated by the application to a real world data set.

关键词： System identification neural networks fuzzy logic regularisation

来源：评论

学校读者我要写书评

暂无评论

A methodology for guided behavioral-level optimization

A methodology for guided behavioral-level optimization

引用

Design Automation Conference

作者： L. Guerra M. Potkonjak J. Rabaey Advanced VLSI Architecture Group Rockwell Semiconductor Systems Inc. Newport Beach CA USA Computer Science Department University of California Los Angeles CA USA EECS Department University of California Berkeley CA USA

Optimization at the early stages of design are crucial. However, due to an overwhelming number of design and optimization options, design exploration is often conducted in a qualitative, ad-hoc manner. This paper presents a methodology and interactive environment for guiding the exploration process. A prototype targeting behavioral-level optimization for datapath-intensive ASIC implementations has been developed. The key to the approach is encapsulated knowledge about the various optimizations and a set of techniques to automatically extract the "essence" of a design description. At each stage in the exploration process, the system suggests and ranks potential optimizations, both in terms of immediate and longer-term impact. It also provides evaluations of the design and of the likely affects each optimization will have on metrics like power and performance. In the new approach, the designer is responsible for making the actual optimization selections. However, using the provided guidance, designers can make decisions in a more informed manner, and therefore can explore the design solution space more effectively. The effectiveness of the approach is demonstrated on a number of designs.

关键词： Optimization methods Design optimization Permission Application specific integrated circuits Space exploration Very large scale integration computer architecture computer science Prototypes Clocks

来源：评论

学校读者我要写书评

暂无评论

Wavelet Packet Analysis in the Condition Monitoring of Rotating Machinery

引用

IFAC Proceedings Volumes 1998年第29期31卷 89-94页

作者： Kevin Bossley R. Mckendrick C.J. Harris C. Mercer Parallel Applications Centre 2 Venture Road Chilworth Southampton SO16 7NP UK Image Speech and Intelligent Systems Research Group Department of Electronics and Computer Science University of Southampton SO17 1BJ UK Prosig Ltd Link House High Street Fareham P016 7BQ UK

In this paper, novel methods for performing condition monitoring for power station turbine shafts are presented. The objective of this work is to investigate methods for producing accurate turbine vibration fault alarms during turbine shaft rundowns. Wavelet packet analysis is employed to extract spectral features from healthy vibration signals and the probability density functions of these features are estimated. Both Gaussian models, using Bayesian inferencing, and mixture models are employed. Preliminary results show that the more computationally expensive mixture models produce more accurate density estimates and hence more reliable fault alarms.

关键词： Condition monitoring conditional probability estimation neural networks

来源：评论

学校读者我要写书评

暂无评论

Wavelet Packet Analysis in the Condition Monitoring of Rotating Machinery

引用

IFAC Proceedings Volumes 1998年第29期31卷 37-39页

作者： K.M. Bossley R.J. Mckendrick C. Mercer C.J. Harris Parallel Applications Centre 2 Venture Road Chilworth Southampton SO16 7NP UK Prosig Ltd Link House High Street Fareham PO16 7BQ UK Image Speech and Intelligent Systems (ISIS) Research Group Department of Electronics and Computer Science University of Southampton Southampton SO17 1BJ United Kingdom

来源：评论

学校读者我要写书评

暂无评论

Coordinated thread scheduling for workstation clusters under windows NT 1

Coordinated thread scheduling for workstation clusters under...

引用

1st USENIX Windows NT Workshop

作者： Buchanan, Matt Chien, Andrew A. Concurrent Systems Architecture Group Department of Computer Science University of Illinois United States

Coordinated thread scheduling is a critical factor in achieving good performance for tightly-coupled parallel jobs on workstation clusters. We are building a coordinated scheduling system that coexists with the Windows NT scheduler which both provides coordinated scheduling and can generalize to provide a wide range of resource abstractions. We describe the basic approach, called "demand-based coscheduling", and implementation in the context of Windows NT. We report preliminary performance data characterizing the effectiveness of our approach and describe benefits and limitations of our approach. © USENIX 1997.

关键词： Scheduling

来源：评论

学校读者我要写书评

暂无评论

Automatic inline allocation of objects 97

Automatic inline allocation of objects

引用

SIGPLAN 97 Conference on Programming Language Design and Implementation

作者： Dolby, J Concurrent Systems Architecture Group Department of Computer Science University of Illinois 1304 West Springfield Avenue Urbana IL

ISBN: (纸本)9780897919074

Object-oriented languages like Java and Smalltalk provide a uniform object model that simplifies programming by providing a consistent, abstract model of object behavior. But direct implementations introduce overhead, removal of which requires aggressive implementation techniques (e.g. type inference, function specialization);in this paper, we introduce object inlining, an optimization that automatically inline allocates objects within containers (as is done by hand in CS++) within a uniform model. We present our technique, which includes novel program analyses that track how inlinable objects are used throughout the program. We evaluated object inlining on several object-oriented benchmarks. It produces performance up to three times as fast as a dynamic model without inlining and roughly equal to that of manually-inlined codes.

关键词： Object oriented programming

来源：评论

学校读者我要写书评

暂无评论

Runtime mechanisms for efficient dynamic multithreading

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 1996年第1期37卷 21-40页

作者： Karamcheti, V Plevyak, J Chien, AA Concurrent Systems Architecture Group Department of Computer Science University of Illinois at Urbana–Champaign 1304 West Springfield Avenue Urbana Illinois 61801-2987

High performance on distributed memory machines for programming models with dynamic thread creation and multithreading requires efficient thread management and communication. Traditional multithreading runtimes, consisting of few general-purpose, bundled mechanisms that assume minimal compiler and hardware support, are suitable for computations involving coarse-grained threads but provide low efficiency in the presence of small granularity threads and irregular communication behavior. We describe two mechanisms of the Illinois Concert runtime system which address this shortcoming. The first, hybrid stack-heap execution, exploits close coupling with the compiler to dynamically form coarse-grained execution units;threads are lazily created as required by runtime situations. The second, pull messaging, exploits hardware support to implement a distributed message queue with receiver-initiated data transfer, delivering robust performance across a wide range of dynamic communication characteristics. We measure their performance impact based on a Gray T3D implementation of the Concert system. Individually, the mechanisms increase absolute execution efficiency by up to 50%. Together, they increase the feasible space of efficient computations, enabling compute granularities an order of magnitude smaller. Performance results for two large irregular applications demonstrate that expressing programs using dynamic multithreading need not compromise on performance. (C) Academic Press, Inc.

关键词： Programs Legal Executions compilers Multithreading Persuasive Communication memory machine Robust control HIGH-PERFORMANCE Runtime Message Queue

来源：评论

学校读者我要写书评

暂无评论

PERFORMANCE PREDICTION OF parallel SELF CONSISTENT FIELD COMPUTATION

引用

parallel Algorithms and Applications. 1996年第1-2期10卷 127-143页

作者： M. J. ZEMERLY T. J. ATHERTON J. PAPAY G. R. NUDD Parallel Systems Group Department of Computer Science University of Warwick Coventry UK Parallel Systems Group Department of Computer Science University of Warwick Coventry UK Parallel Systems Group Department of Computer Science University of Warwick Coventry UK Parallel Systems Group Department of Computer Science University of Warwick Coventry UK

This paper presents a methodology for performance prediction of parallel algorithms and illustrates its use on a large scale computational chemistry application. The performance prediction uses a component time characterization technique which splits up the sequential code into computational components and measures the time for each of them. The parallel algorithm is built from these components by adding communication routines. A “Processor Activity Graph” (PAG) providing a graphical representation of the parallel algorithm runtime behaviour is used for predicting the execution time. For a case study a Self Consistent Field (SCF) computation has been selected which forms the basis of many computational chemistry packages [4, 5]. The performance model of SCF computation has been built and the prediction have been compared with the results of measurements. The measurements have been provided on a mesh connected distributed memory parallel computer (128 T800 Parsytec SuperCluster). The prediction error is less than 10%. Performance optimisation of the application has been achieved by reducing the communication overhead and changing the data representation.

关键词： Performance prediction processor activity graph self consistent field computations

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：