检索结果-内蒙古大学图书馆

IEEE/ACM 11th Workshop on Fault Tolerance for HPC at Extreme Scale (FTXS)

作者： Johnson, Trokon Lam, Herman Univ Florida Elect & Comp Engn Dept Gainesville FL 32611 USA

ISBN: (纸本)9781665420594

As the design space for high-performance computer (HPC) systems grows larger and more complex, modeling and simulation (MODSIM) techniques become more important to better optimize systems. Furthermore, recent extreme-scale systems and newer technologies can lead to higher system fault rates, which negatively affect system performance and other metrics. Therefore, it is important for system designers to consider the effects of faults and fault-tolerance (FT) techniques on system design through MODSIM. BE-SST is an existing MODSIM methodology and workflow that facilitates preliminary exploration & reduction of large design spaces, particularly by highlighting areas of the space for detailed study and pruning less optimal areas. This paper presents the overall methodology for adding fault-tolerance awareness (FT-awareness) into BE-SST. We present the process used to extend BE-SST, enabling the creation of models that predict the time needed to perform a checkpoint instance for the given system configuration. Additionally, this paper presents a case study where a full HPC system is simulated using BE-SST, including application, hardware, and checkpointing. We validate the models and simulation against actual system measurements, finding an average percent error of less than 17% for the instance models and about 20% for system simulation, a level of accuracy acceptable for initial exploration and pruning of the design space. Finally, we show how FT-aware simulation results are used for comparing FT levels in the design space.

关键词： fault-tolerance aware system design and evaluation system-level modeling and simulation high-performance computing design space exploration

来源：评论

学校读者我要写书评

暂无评论

A data-driven framework for error estimation and mesh-model optimization in system-level thermal-hydraulic simulation

引用

NUCLEAR ENGINEERING AND DESIGN 2019年第Aug.期349卷 27-45页

作者： Bao, Han Dinh, Nam T. Lane, Jeffrey W. Youngblood, Robert W. Idaho Natl Lab Syst Integrat Dept POB 1625MS 3860 Idaho Falls ID 83415 USA North Carolina State Univ Dept Nucl Engn 3140 Burlington Engn Labs2500 Stinson Dr Raleigh NC 27695 USA Zachry Nucl Engn Inc 200 Regency Forest Dr Cary NC 27518 USA Idaho Natl Lab Risk Assessment & Management Serv POB 1625MS 3870 Idaho Falls ID 83415 USA

Over the past decades, several computer codes have been developed for simulation and analysis of thermal-hydraulics and system response in nuclear reactors under operating, abnormal transient, and accident conditions. However, simulation errors and uncertainties still inevitably exist even while these codes have been extensively assessed and used. In this work, a data-driven framework (Optimal Mesh/Model Information system, OMIS) is formulated and demonstrated to estimate simulation error and suggest optimal selection of computational mesh size (i.e., nodalization) and constitutive correlations (e.g., wall functions and turbulence models) for low-fidelity, coarse-mesh thermal-hydraulic simulation, in order to achieve accuracy comparable to that of high-fidelity simulation. Using results from high-fidelity simulations and experimental data with many fast-running low-fidelity simulations, an error database is built and used to train a machine learning model that can determine the relationship between local simulation error and local physical features. This machine learning model is then used to generate insight and help correct low-fidelity simulations for similar physical conditions. The OMIS framework is designed as a modularized six-step procedure and accomplished with state-of-the-art methods and algorithms. A mixed-convection case study was performed to illustrate the entire framework.

关键词： Coarse mesh Error estimation system-level modeling and simulation Machine learning Physical feature

来源：评论

学校读者我要写书评

暂无评论

Assessing system Software Performance in Complex system of systems Environments

Assessing System Software Performance in Complex System of S...

引用

MILCOM Military Communications Conference

作者： Wessel, James T. Meyer, Bryce L. Carnegie Mellon Univ Inst Software Engn Pittsburgh PA 15213 USA Carnegie Mellon Univ Inst Software Engn St Louis MO USA

ISBN: (纸本)9781424481804

The characterization of software performance (SWP) in complex, service-oriented architecture (SOA)-based system of systems (SoS) environments is an emergent study area. This report focuses on both qualitative and quantitative ways of determining the current state of SWP in terms of both test coverage (what has been tested) and confidence (degree of testing) for SOA-based SoS environments. Practical tools and methodologies are offered to aid technical and programmatic managers in the form of a stepwise methodology toward SWP selection. Included are system architecture design considerations, resource limiters of SWP, test event design considerations, organizational and process suggestions toward improved SWP management and a matrix of measurement suggestions.

关键词： Assuring mission success Network-centric systems and technologies system-level modeling and simulation system of systems (SoS) Service Oriented Architecture (SOA)

来源：评论

学校读者我要写书评

暂无评论

Calibration of abstract performance models for system-level design space exploration

引用

JOURNAL OF SIGNAL PROCESSING systemS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY 2008年第2期50卷 99-114页

作者： Pimentel, Andy D. Thompson, Mark Polstra, Simon Erbas, Cagkan Univ Amsterdam Inst Informat Comp Syst Architecture Grp NL-1098 SJ Amsterdam Netherlands

High-level performance modeling and simulation have become a key ingredient of system-level design as they facilitate early architectural design space exploration. An important precondition for such high-level modeling and simulation methods is that they should yield trustworthy performance estimations. This requires validation ( if possible) and calibration of the simulation models, which are two aspects that have not yet been widely addressed in the system-level community. This article presents a number of mechanisms for both calibrating isolated model components as well as a system-level performance model as a whole. We discuss these model calibration mechanisms in the context of our Sesame system-level simulation framework. Two illustrative case studies will also be presented to indicate the merits of model calibration.

关键词： system-level modeling and simulation performance analysis model calibration

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：