检索结果-内蒙古大学图书馆

What’s Missing in Agile Hardware Design? Verification!

Journal of computer Science & Technology 2023年第4期38卷 735-736页

作者： Babak Falsafi Parallel Systems Architecture Laboratory Institute of Computer and Communication SciencesSchool of Computer andCommunication SciencesEcole Polytechnique Fédérale de LausanneLausanneCH-1015Switzerland

Agile hardware design is an approach to developing hardware systems that draws inspiration from the principles and practices of agile software *** emphasizes collaboration,flexibility,iterative development,and quick adaptation to changing *** agile hardware design,the focus is on delivering functionalhardware systems in shorter development cycles while maintaining high-quality and customer *** particular,agile hardware design is of great interest in the open-source hardware ***-sourcehardware development—such as RISC-V—is at the forefront of initiatives to democratize hardware and drive innovation in chip design *** design is instrumental for the RISC-V community because it supportsrapid iteration,accommodates the evolving RISC-V standard and the addition of custom extensions,improvescommunity collaboration and time-to-market,and addresses the design challenges associated with complex architectural features.

关键词： hardware agile architectural

来源：评论

学校读者我要写书评

暂无评论

parallel-in-Time Multi-Level Integration of the Shallow-Water Equations on the Rotating Sphere

arXiv

引用

arXiv 2019年

作者： Hamon, François P. Schreiber, Martin Minion, Michael L. Center for Computational Sciences and Engineering Lawrence Berkeley National Laboratory Berkeley United States Chair of Computer Architecture and Parallel Systems Technical University of Munich Germany Department of Applied Mathematics Lawrence Berkeley National Laboratory Berkeley United States

The modeling of atmospheric processes in the context of weather and climate simulations is an important and computationally expensive challenge. The temporal integration of the underlying PDEs requires a very large number of time steps, even when the terms accounting for the propagation of fast atmospheric waves are treated implicitly. Therefore, the use of parallel-in-time integration schemes to reduce the time-to-solution is of increasing interest, particularly in the numerical weather forecasting field. We present a multi-level parallel-in-time integration method combining the parallel Full Approximation Scheme in Space and Time (PFASST) with a spatial discretization based on Spherical Harmonics (SH). The iterative algorithm computes multiple time steps concurrently by interweaving parallel high-order fine corrections and serial corrections performed on a coarsened problem. To do that, we design a methodology relying on the spectral basis of the SH to coarsen and interpolate the problem in space. The methods are evaluated on the shallow-water equations on the sphere using a set of tests commonly used in the atmospheric flow community. We assess the convergence of PFASST-SH upon refinement in time. We also investigate the impact of the coarsening strategy on the accuracy of the scheme, and specifically on its ability to capture the high-frequency modes accumulating in the solution. Finally, we study the computational cost of PFASST-SH to demonstrate that our scheme resolves the main features of the solution multiple times faster than the serial schemes. Copyright © 2019, The Authors. All rights reserved.

关键词： Harmonic analysis

来源：评论

学校读者我要写书评

暂无评论

Multi-level spectral deferred corrections scheme for the shallow water equations on the rotating sphere

arXiv

引用

arXiv 2018年

作者： Hamona, François P. Schreiberb, Martin Miniond, Michael L. Center for Computational Sciences and Engineering Lawrence Berkeley National Laboratory Berkeley United States Department of Mathematics/Computer Science University of Exeter Exeter United Kingdom Computer Architecture and Parallel Systems Technical University of Munich Germany Department of Applied Mathematics Lawrence Berkeley National Laboratory Berkeley United States

Effcient time integration schemes are necessary to capture the complex processes involved in atmospheric ows over long periods of time. In this work, we propose a high-order, implicit-explicit numerical scheme that combines Multi-Level Spectral Deferred Corrections (MLSDC) and the Spherical Harmonics (SH) transform to solve the wave-propagation problems arising from the shallow-water equations on the rotating *** iterative temporal integration is based on a sequence of corrections distributed on coupled spacetime levels to perform a significant portion of the calculations on a coarse representation of the problem and hence to reduce the time-to-solution while preserving accuracy. In our scheme, referred to as MLSDCSH, the spatial discretization plays a key role in the efficiency of MLSDC, since the SH basis allows for consistent transfer functions between space-time levels that preserve important physical properties of the *** study the performance of the MLSDC-SH scheme with shallow-water test cases commonly used in numerical atmospheric modeling. We use this suite of test cases, which gradually adds more complexity to the nonlinear system of governing partial differential equations, to perform a detailed analysis of the accuracy of MLSDC-SH upon renement in time. We illustrate the stability properties of MLSDC-SH and show that the proposed scheme achieves up to eighth-order convergence in time. Finally, we study the conditions in which MLSDC-SH achieves its theoretical speedup, and we show that it can significantlyreduce the computational cost compared to single-level Spectral Deferred Corrections (SDC). Copyright © 2018, The Authors. All rights reserved.

关键词： Harmonic analysis

来源：评论

学校读者我要写书评

暂无评论

Asynchronous Runtimes in Action: An Introspective Framework for a Next Gen Runtime

Asynchronous Runtimes in Action: An Introspective Framework ...

引用

IEEE International Symposium on parallel and Distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Joshua Suetterlein Joshua Landwehr Andrés Márquez Joseph B. Manzano Guang R. Gao The Computer Architecture and Parallel Systems Laboratory University of Delaware Newark Delaware Pacific Northwest National Laboratory Richland Washington

ISBN: (纸本)9781509036837

One of the most critical challenges that new highperformance systems face is the lack of system software supportfor these large scale systems. Investment on system stack componentsis essential in the development, debugging and optimizationof the new emerging programming models. These emergingmodels have the promise to better utilize the vast hardwareresources available in current and future systems. To aid in thedevelopment of applications and new system stacks, runtimes, asinstances of their respective execution models, need to producefacilities to introspect their inner workings and allow an indepthattribution of performance bottlenecks and computationalpatterns. In other words, the runtime systems need to reducetheir opacity to observers so that users of a novel programexecution model can adapt their designs to fit the intended modelusage, regardless of the layer that they are working on. Thisdesign/development loop (akin to co-design) enables synergisticopportunities across the entire computational stack. This paper presents the design and implementation of a simple"gray" box performance attribution harness running inside a finegrain runtime system: the Open Community Runtime (OCR). We showcase what such a framework can indicate regarding theruntime behavior while running at scale. To this end, we havedesigned a set of synthetic scenarios aimed to test the runtime attheir best and worst cases. We present an analysis of the mostimportant runtime features, properties and idiosyncrasies thatwill affect the development of new runtime features, algorithmicselection, and application development.

关键词： Runtime Optical character recognition software Instruction sets Computational modeling Adaptation models Message systems Protocols

来源：评论

学校读者我要写书评

暂无评论

Codelet Scheduling by Genetic Algorithm

Codelet Scheduling by Genetic Algorithm

引用

IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)

作者： Songwen Pei Jinkai Wang Wenyang Cui Linhua Jiang Tongsheng Geng Jean-Luc Gaudiot Stéphane Zuckerman Shanghai Key Lab of Modern Optical Systems University of Shanghai for Science and Technology Shanghai China Parallel Systems and Computer Architecture Lab University of California Irvine CA USA Computer Architecture & Parallel Systems Laboratory University of Delaware Newark DE

ISBN: (纸本)9781509032068

Codelet model is a fine-grained, event-driven hybrid parallel model inspired by dataflow, whose performance depends on the scheduling policy. How to design optimal codelet scheduling policy based on the features of tasks is important to the codelet-based system performance. In this paper, we propose an adaptive codelet scheduling policy by combing "pure" genetic algorithm for tasks with complex dependencies. It is verified that the policy is effective based on bunches of experimental results.

关键词： Computational modeling Schedules Genetic algorithms Scheduling Runtime Optimal scheduling Sociology

来源：评论

学校读者我要写书评

暂无评论

PreCrime to the rescue: Defeating mobile malware one-step ahead 14

PreCrime to the rescue: Defeating mobile malware one-step ah...

引用

5th ACM Asia-Pacific Workshop on systems, APSYS 2014

作者： Tan, Cheng Li, Haibo Xia, Yubin Zang, Binyu Chu, Cheng-Kang Li, Tieyan Institute of Parallel and Distributed Systems Shanghai Jiao Tong University China Software School Fudan University China State Key Laboratory of Computer Architecture ICT Chinese Academy of Sciences China Huawei Technologies Pte Ltd. Singapore Singapore

ISBN: (纸本)9781450330244

Prior mobile malware defensive means is usually retroactive, which may either lead to high false negatives or can hardly recover systems states from malware activities. PreCrime is a proactive malware detection scheme that detects and stops malware activities from happening. PreCrime creates mirrors of a mobile device in a resource-rich and trusted cloud, which speculatively executes multiple likely user operations concurrently to detect potential tampering and information leakage. Our preliminary evaluation shows that PreCrime introduces small performance overhead on smartphones and feasible delay during speculative execution on the cloud. © 2014 ACM.

关键词： Malware

来源：评论

学校读者我要写书评

暂无评论

Massively parallel breadth first search using a tree-structured memory model

Massively parallel breadth first search using a tree-structu...

引用

2012 International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2012

作者： St. John, Tom Dennis, Jack B. Gao, Guang R. Computer Architecture and Parallel Systems Laboratory University of Delaware United States Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology United States

ISBN: (纸本)9781450312110

Analysis of massive graphs has emerged as an important area for massively parallel computation. In this paper, it is shown how the Fresh Breeze trees-of-chunks memory model may be used to perform breadth-first search of large undirected graphs. Overall, the computation can be expressed as a data flow process wherein a set of vertices to be searched is partitioned into a set of sub-domains and processed independently by many concurrent tasks. The main contributions of the paper are listed below. • We present the first case study demonstrating the power of the Fresh Breeze program execution model (PXM) in the exploitation of fine-grain parallelism found in irregular applications such as graph algorithms. • We present a novel parallel breadth-first search algorithm which is fully determinate. • We describe a unique sparse vector representation that represents the set of adjacencies for each vertex. • We provide an experimental study and analysis of our implementation. An estimate is also made of the performance that might be achieved with a massively parallel system built according to Fresh Breeze principles. © 2012 ACM.

关键词： Forestry

来源：评论

学校读者我要写书评

暂无评论

Combining gait research of the quadruped/biped reconfigurable walking chair with parallel leg mechanism

引用

4th International Conference on Social Robotics, ICSR 2012

作者： Hu, Xing Wang, Hongbo Sang, Lingfeng Gu, Qifang Yuan, Lin School of Mechanical and Electrical Engineering Xi'an University of Architecture and Technology Xi'an Shanxi 710055 China Ministry of Education Key Laboratory of Advanced Forging Technology and Science Hebei Province Key Laboratory of Parallel Robot and Mechatronics Systems Yanshan University Qinhuangdao Hebei 066004 China Department of Computer Wuxi City College of Vocational Technology Wuxi 214063 China

ISBN: (纸本)9783642341021

The quadruped/biped reconfigurable walking robot with parallel leg mechanism can realize not only the quadruped walking, but also the biped walking. The converting process from the quadruped to the biped includes locking the vertical revolute pair hinged with the upper platform and combining the corresponding lower platforms. Based on the previous study, the combining schemes of walking chair are researched in this paper, and then the correctness of the combining schemes is analyzed by using the position workspace of the swing leg and the body mechanism in different states which are obtained by the MATLAB software and anti-solution search method. Compared with the stability margin and the adjustment coordination of the body in the different combining schemes, the optimal combining gaits of walking chair are selected, which lays the theoretical foundation for the quadruped/biped converting control of walking chair. © 2012 Springer-Verlag.

关键词： MATLAB

来源：评论

学校读者我要写书评

暂无评论

Flexible hardware acceleration for instruction-grain lifeguards

Flexible hardware acceleration for instruction-grain lifegua...

引用

作者： Chen, Shimin Kozuch, Michael Gibbons, Phillip B. Ryan, Michael Strigkos, Theodoros Mowry, Todd C. Ruwase, Olatunji Vlachos, Evangelos Falsafi, Babak Ramachandran, Vijaya Intel Research Pittsburgh 4720 Forbes Ave. Pittsburgh PA 15213 United States Computer Science Department Carnegie Mellon University Pittsburgh PA United States Parallel Systems Architecture Laboratory École Polytechnique Fédérale de Lausanne Lausanne Switzerland Deartment of Computer Science University of Texas at Austin Austin TX United States

Instruction-grain lifeguards monitor executing programs at the granularity of individual instructions to quickly detect bugs and security attacks, but their fine-grain nature incurs high monitoring overheads. This article identifies three common sources of these overheads and proposes three techniques that together constitute a general-purpose hardware acceleration framework for lifeguards. © 2009 IEEE.

关键词： Data mining

来源：评论

学校读者我要写书评

暂无评论

Chip-Level Redundancy in Distributed Shared-Memory Multiprocessors

Chip-Level Redundancy in Distributed Shared-Memory Multiproc...

引用

Pacific Rim International Symposium on Dependable Computing

作者： Brian T. Gold Babak Falsafi James C. Hoe Computer Architecture Laboratory Carnegie Mellon University USA Sun MicroSystems Laboratories Inc. Menlo Park CA USA Parallel Systems Architecture Laboratory École Polytechnique Fédérale de Lausanne Switzerland

Distributed shared-memory (DSM) multiprocessors provide a scalable hardware platform, but lack the necessary redundancy for mainframe-level reliability and availability. Chip-level redundancy in a DSM server faces a key challenge: the increased latency to check results among redundant components. To address performance overheads, we propose a checking filter that reduces the number of checking operations impeding the critical path of execution. Furthermore, we propose to decouple checking operations from the coherence protocol, which simplifies the implementation and permits reuse of existing coherence controller hardware. Our simulation results of commercial workloads indicate average performance overhead is within 4% (9% maximum) of tightly coupled DMR solutions.

关键词： Hardware Redundancy Filters Protocols Protection computer architecture Delay Clocks Circuit faults Availability

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：