检索结果-内蒙古大学图书馆

An experimental study of a simple, distributed edge-coloring algorithm

acm Journal of Experimental Algorithmics 2004年 9卷 1.3–es页

作者： Marathe, Madhav V. Panconesi, Alessandro Risinger, Larry D. Basic and Applied Simulation Science (CCS-5) MS M997 Los Alamos National Laboratory P.O. Box 1663 Los Alamos 87545 NM United States Dipartimento di Informatica Universitá La Sapienza di Roma via Salaria 113 Roma 00198 Italy ISR5 Los Alamos National Laboratory MS J570 P.O. Box 1663 Los Alamos 87545 NM United States

We conduct an experimental analysis of a distributed randomized algorithm for edge coloring simple undirected graphs. The algorithm is extremely simple yet, according to the probabilistic analysis, it computes nearly optimal colorings very quickly [Grable and Panconesi 1997]. We test the algorithm on a number of random as well as nonrandom graph families. The test cases were chosen based on two objectives: (i) to provide insights into the worst-case behavior (in terms of time and quality) of the algorithm and (ii) to test the performance of the algorithm with instances that are likely to arise in practice. Our main results include the following: (1) The empirical results obtained compare very well with the recent empirical results reported by other researchers [Durand et al. 1994, 1998;Jain and Werth 1995]. (2) The empirical results confirm the bounds on the running time and the solution quality as claimed in the theoretical paper. Our results show that for certain classes of graphs the algorithm is likely to perform much better than the analysis suggests. (3) The results demonstrate that the algorithm might be well suited (from a theoretical as well as practical standpoint) for edge coloring graphs quickly and efficiently in a distributed setting. Based on our empirical study, we propose a simple modification of the original algorithm with substantially improved performance in practice. © 2004 acm.

关键词： Distributed algorithms Edge coloring Experimental analysis of algorithms High performance computing Randomized algorithms Scheduling

来源：评论

学校读者我要写书评

暂无评论

Coupled Plasticity and Damage Modeling and Their Applications in a Three‐Dimensional Eulerian Hydrocode

引用

AIP Conference proceedings 2004年第1期706卷 529-532页

作者： Michael W. Burkett Sean P. Clancy Paul J. Maudlin Kathleen S. Holian 1Applied Physics Div. Primary Design and Assessment Grp. (X‐4) MS T086 Los Alamos National Laboratory Los Alamos NM 87545 2Applied Physics Div. Integrated Physics Methods Grp. (X‐3) MS F644 Los Alamos National Laboratory Los Alamos NM 87545 3Theoretical Div. Fluid Dynamics Grp. (T‐3) MS B216 Los Alamos National Laboratory Los Alamos NM 87545 4Computing Communications & Networking Div. Scientific Software Engineering Grp. (CCN‐12) MS B295 Los Alamos National Laboratory Los Alamos NM 87545

Previously developed constitutive models and solution algorithms for continuum‐level anisotropic elastoplastic material strength and an isotropic damage model TEPLA have been implemented in the three‐dimensional Eulerian hydrodynamics code known as CONEJO. The anisotropic constitutive modeling is posed in an unrotated material frame of reference using the theorem of polar decomposition to compute rigid‐body rotation. TEPLA is based upon the Gurson flow surface (a potential function used in conjunction with the associated flow law). The original TEPLA equation set has been extended to include anisotropic elastoplasticity and has been recast into a new implicit solution algorithm based upon an eigenvalue scheme to accommodate the anisotropy. This algorithm solves a two‐by‐two system of nonlinear equations using a Newton‐Raphson iteration scheme. Simulations of a shaped‐charge jet formation, a Taylor cylinder impact, and an explosively loaded hemishell were selected to demonstrate the utility of this modeling capability. The predicted deformation topology, plastic strain, and porosity distributions are shown for the three simulations.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Speculative hedge: regulating compile-time speculation against profile variations 29

Speculative hedge: regulating compile-time speculation again...

引用

proceedings of the 29th annual acm/IEEE international symposium on Microarchitecture

作者： Brian L. Deitrich Wen-mei W. Hwu Center for Reliable and High-Performance Computing University of Illinois Urbana-Champaign IL

ISBN: (纸本)9780818676413

Path-oriented scheduling methods, such as trace scheduling and hyperblock scheduling, use speculation to extract instruction-level parallelism from control-intensive programs. These methods predict important execution paths in the current scheduling scope using execution profiling or frequency estimation. Aggressive speculation is then applied to the important execution paths, possibly at the cost of degraded performance along other paths. Therefore, the speed of the output code can be sensitive to the compiler's ability to accurately predict the important execution paths. Prior work in this area has utilized the speculative yield function by Fisher, coupled with dependence height, to distribute instruction priority among execution paths in the scheduling scope. While this technique provides more stability of performance by paying attention to the needs of all paths, it does not directly address the problem of mismatch between compile-time prediction and run-time behavior. The work presented in this paper extends the speculative yield and dependence height heuristic to explicitly minimize the penalty suffered by other paths when instructions are speculated along a path. Since the execution time of a path is determined by the number of cycles spent between a path's entrance and exit in the scheduling scope, the heuristic attempts to eliminate unnecessary speculation that delays any path's exit. Such control of speculation makes the performance much less sensitive to the actual path taken at run time. The proposed method has a strong emphasis on achieving minimal delay to all exits. Thus the name, speculative hedge, is used. This paper presents the speculative hedge heuristic, and shows how it controls over-speculation in a superblock/hyperblock scheduler. The stability of output code performance in the presence of execution variation is demonstrated with six programs from the SPEC CINT92 benchmark suite.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A multi-FPGA based platform for emulating a 100m-transistor-scale processor with high-speed peripherals (abstract only) 10

A multi-FPGA based platform for emulating a 100m-transistor-...

引用

proceedings of the 18th annual acm/SIGDA international symposium on Field programmable gate arrays

作者： Huandong Wang Xiang Gao Yunji Chen Dan Tang Weiwu Hu Key Laboratory of Computer System and Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China

ISBN: (纸本)9781605589114

This paper describes a multi-FPGA based platform for emulating the Loongson-2G micro-processor on different mother boards. This platform is developed targeting at verification and evaluation of the Loongson-2G micro-processor, which is the next generation of Loongson-2 family, composed by one four-issue, out-of-order execution way 64-bit MIPS-compatible processor core named GS464, one 1M byte secondary Cache, one HyperTransport IO interface, one DDR2/3 memory interface and some other low speed IO interfaces. Most parts of this micro-process are mapped into the multi-FPGA based platform which consists two Vertex-5 330 FPGA chips. Semi-custom partitioning tactics within the entire design flow are developed to synthesize the whole designed into the multi-FPGA based platform. Modifications in architectural level are applied to the original architecture of the chip, in order to make it easy to be partitioned into two parts. High speed SEDES of HyperTransport IO link and DDR2/3 memory interface are emulated by using several clocks with different clock phases. To resolve the problem that hard to debug in FPGA system, a method by software probe with help of injected hardware modules in FPGA is developed and used to debug the problem causing by behavior mismatching between the ASIC ram block and the FPGA ram block. Some evaluation work on performance of Loongson-2G is done on this multi-FPGA based platform as pre-silicon test. To the authors' knowledge, there has been no previous work on such a big design used for verification and evaluation.

关键词： multi-fpga emulation loongson verification evaluation fpga

来源：评论

学校读者我要写书评

暂无评论

Energy-optimizing source code transformations for OS-driven embedded software

Energy-optimizing source code transformations for OS-driven ...

引用

International Conference on VLSI Design

作者： Yunsi Fei S. Ravi A. Raghunathan N.K. Jha Department of Electrical Engineering Princeton University NJ USA NEC Laboratories Princeton NJ USA

The increasing software content of battery-powered embedded systems has fueled much interest in techniques for developing energy-efficient embedded software. Source code transformations have previously been considered for application software to reduce its energy consumption. For complex embedded software applications, which consist of multiple concurrent processes running with the support of an embedded operating system (OS), it is known that the OS and the application-OS interaction significantly affect energy consumption. However, source code transformations explicitly targeting these effects have not been sufficiently studied. This paper proposes novel transformations for the source code of OS-driven multi-process embedded software programs in order to reduce their energy consumption. The key features of our optimizations are that they span process boundaries, and that they minimize the energy consumed in the execution of OS functions and services-opportunities which are beyond the reach of conventional compiler optimizations and source code transformation techniques. We propose four types of transformations, namely process-level concurrency management, message vectorization, computation migration and inter-process communication mechanism selection. We discuss how to systematically identify opportunities for the proposed transformations and apply them directly to the program source code. We have applied the proposed techniques to several multi-process software benchmark programs, and evaluated their applicability in the context of an embedded system containing an Intel StrongARM processor and embedded Linux OS. Our techniques achieve up to 37.9% (23.8% on an average) energy reduction compared to highly compiler-optimized implementations.

关键词： Embedded software Energy consumption Embedded system Application software Optimizing compilers Energy efficiency Operating systems Concurrent computing Context Linux

来源：评论

学校读者我要写书评

暂无评论

Method for computing the probability distribution of fault critical clearing time

引用

Zhongguo Dianji Gongcheng Xuebao/proceedings of the Chinese Society of Electrical Engineering 2004年第1期24卷 6-10页

作者： Wang, Cheng-Shan Yu, Xu-Yang Sch. of Elec. Automat. Tianjin Univ. Tianjin 300072 China

A stability analysis is a fundamental problem in power system operation and control. The traditional stability analysis methods are based on the determinate initial parameter of power system. Because of some reasons, the determinate value of initial parameter of power system, such as load, cannot be decided, and only their probability distribution can often be gotten. The traditional stability analysis methods cannot be applied to the new condition. This paper proposed a method to compute the probability distribution of fault critical clearing time. This method uses Gram-Charlier expansion of random variable and the property of cumulant, and on the basis of sensitivity computation, changes the computation of probability distribution of fault critical clearing time to the computation of the cumulant of initial parameters. This method was tested in 39-machine system. Compared with the Monte-Carlo simulation, the method does not need a large amount of simulations as statistical samples, but only need once simulation when computing the sensitivity of the fault critical clearing time;The computation results show that the method can accurately approximate the cumulative distribution function of fault critical clearing time, and reduces computational burden and improves the speed of computation. Using this method, the probability of fault critical clearing time distributing in an interval closed to the expectation value can be determined, and it can act as an implement for stability analysis.

关键词： Electric power systems

来源：评论

学校读者我要写书评

暂无评论

一种临界故障切除时间概率分布的求解方法

引用

中国电机工程学报 2004年第1期24卷 6-10页

作者：王成山余旭阳天津大学电气与自动化工程学院天津300072

稳定性分析是电力系统运行与控制中必须考虑的一个以及半不变量的性质,在敏感度计算的基础上,将临界故障最基本的问题。传统稳定分析是在确定系统初始参数后进行的。由于某些原因,电力系统的初始参数,如母线负荷等,因得不到其确定值而... 详细信息

稳定性分析是电力系统运行与控制中必须考虑的一个以及半不变量的性质,在敏感度计算的基础上,将临界故障最基本的问题。传统稳定分析是在确定系统初始参数后进行的。由于某些原因,电力系统的初始参数,如母线负荷等,因得不到其确定值而只能知道其可能分布。这使得传统的稳定分析方法已不能适用于新的形势,为此提出了概率稳定分析这一新的课题。该文提出一种新的临界故障切除时间概率分布的求取方法,利用随机变量的Cram-Charlier级数展开式切除时间概率分布的求取转化为对初始参数的半不变量的求取。该算法在39机系统中进行了测试,与Monte-Carlo仿真结果相比,该算法不需要反复进行大量的数值仿真的采样计算,而只需在确定临界故障切除时间的灵敏度时进行一次数值仿真;计算结果能很好地描述临界故障切除时间的实际概率分布,且能显著地减少计算量,提高计算速度。利用该方法,可根据故障后临界故障切除时间的概率分布,确定临界故障切除时间分布在期望值附近某区间内的概率,为稳定分析提供了判断的依据。

关键词：电力系统稳定性分析临界故障切除时间概率分布求解方法

来源：评论

学校读者我要写书评

暂无评论

Semenyih Reservoir lifetime prediction using empirical method

引用

AIP Conference proceedings 2023年第1期2500卷

作者： Ayman Hussin Mohd Khairul Bazli Bin Mohd Aziz Ahlam ALJabali Ku Muhammad Na'im Bin Ku Khalif Centre for Mathematical Sciences College of Computing & Applied Sciences Universiti Malaysia Pahang 26300 Gambang Pahang Malaysia

Reservoir lifetime can be interpreted as the number of years a reservoir can be used to fulfil its purpose. This study proposes an approach to predict the remaining life of Semenyih reservoir using an empirical method. The result of this study can help in making important decisions in the water supply. This paper mainly focuses on Semenyih dam which is one of the Klang Valley major dams in Selangor, Malaysia built in 1985, with a lifetime plan of 100 years. This watershed basin has one of the main rivers in the state of Selangor, which has been negatively affected by industrial and urban wastes since early 1990. The annual sedimentation rate plan of Semenyih Reservoir was estimated from 13,165,400 m3 in 2004 to 13,511,900m3 in 2010, however as calculated in 2016, sediment delivery reached 13,665,450 m3 in the Semenyih catchment. The proposed result of the Semenyih Reservoir remaining lifetime using the empirical method can be estimated by years, and this number then can be used as a reference to predict the remaining volume of Semenyih Reservoir dead storage using the sediment approach. In this paper the lifetime of the reservoir was estimated at 65 years.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：