In this study, highperformancecomputing (HPC) was performed for estimation of wind-induced pressures on various kinds of dome using the K computer (Japanese supercomputer). Focusing on the super critical Reynolds nu...
详细信息
ISBN:
(纸本)9788412110104
In this study, highperformancecomputing (HPC) was performed for estimation of wind-induced pressures on various kinds of dome using the K computer (Japanese supercomputer). Focusing on the super critical Reynolds number (Re) and complexity of a long span roof of actual open stadium, real conditions are set up for discussing aerodynamic characteristics of a dome. Using the LES model for strong wind, this study numerically elucidates wind flow patterns and unsteady pressures around a dome with various shapes and provides the information of wind load acting on a dome under the real conditions for a wind resistant design.
In this paper we introduce a novel advertisement system, dedicated to multimedia documents broadcasted over the Internet. the proposed approach takes into account the consumer’s perspective and inserts contextual rel...
详细信息
针对可分离卷积神经网络在星载飞机目标型号分类应用中存在的速度瓶颈以及功耗限制等问题,提出了一种基于现场可编程门阵列(FPGA)数据流调度的浮点深度分离卷积神经网络加速方法,对通用MobileNet的图像分类模型进行加速。采用基于乘法矩阵与前向加法树的深度分离卷积计算阵列设计,解决了深度分离卷积浮点加速的线速吞吐瓶颈。实验结果表明,基于FPGA的目标分类速度为633 FPS,功耗为22.226 W,运算性能为236.04 GFLOPS,计算速度达到了Titan Xp GPU的1.10~2.61倍,计算效能是Titan Xp GPU的7.44~18.66倍。在同类基于FPGA的浮点卷积加速方案中,该方法在运算性能及能效比上达到了最优。同时,该方法提供了与原模型一致性的图像分类准确率,解耦合了软硬件协同开发流程,降低了应用开发人员使用FPGA加速计算的门槛。
Withthe increasing demand for indoor navigation applications, indoor navigation has become a research hotspot in many technical fields. high-precision sensors are expensive, and they are often used in industrial and ...
详细信息
the need to create a cloud architecture is due to the fact that most people do not pay proper attention to their health, often putting off an appointment with a doctor indefinitely. this circumstance in some cases can...
详细信息
ISBN:
(数字)9781728174433
ISBN:
(纸本)9781728174440
the need to create a cloud architecture is due to the fact that most people do not pay proper attention to their health, often putting off an appointment with a doctor indefinitely. this circumstance in some cases can lead to grave consequences. Using the remote cloud monitoring model will allow patients registered in a medical institution to receive medical care in a timely manner due to remote automated health monitoring. the provision of services and emergency decisions with various levels of complexity is realized through the interaction of the web interface, the layer of intelligent data processing, the agent unit, the accumulation and analysis of experience, as well as by adapting the neural network data processing for high-performancecomputing systems.
Due to the limited search space in the existing performance optimization approaches at software architectures of cloud applications (SAoCA) level, it is difficult for these methods to obtain the cloud resource usage s...
详细信息
Several devices are capable of capturing images with a large number of people, including those of high resolution known as gigapixel images. these images can be helpful for studies and investigations, such as finding ...
详细信息
Electronic medical discharge summaries provide a wealth of information. Extracting useful structured information from such unstructured text is challenging. However, supervised machine learning (ML) algorithms can ach...
详细信息
Over a billion mobile consumer system-on-chip (SoC) chipsets ship each year. Of these, the mobile consumer market undoubtedly involving smartphones has a significant market share. Most modern smartphones comprise of a...
详细信息
ISBN:
(纸本)9781728114446
Over a billion mobile consumer system-on-chip (SoC) chipsets ship each year. Of these, the mobile consumer market undoubtedly involving smartphones has a significant market share. Most modern smartphones comprise of advanced SoC architectures that are made up of multiple cores, GPS, and many different programmable and fixed-function accelerators connected via a complex hierarchy of interconnects withthe goal of running a dozen or more critical software usecases under strict power, thermal and energy constraints. the steadily growing complexity of a modern SoC challenges hardware computer architects on how best to do early stage ideation. Late SoC design typically relies on detailed full-system simulation once the hardware is specified and accelerator software is written or ported. However, early-stage SoC design must often select accelerators before a single line of software is written. To help frame SoC thinking and guide early stage mobile SoC design, in this paper we contribute the Gables model that refines and retargets the Roofline model-designed originally for the performance and bandwidth limits of a multicore chip-to model each accelerator on a SoC, to apportion work concurrently among different accelerators (justified by our usecase analysis), and calculate a SoC performance upper bound. We evaluate the Gables model with an existing SoC and develop several extensions that allow Gables to inform early stage mobile SoC design.
In recent years, as the progress of VLSI technology, artificial intelligence / deep learning has become a major trend. To satisfy the demand of highperformancecomputing, many AI ASICs adopt multi-core architecture. ...
详细信息
ISBN:
(纸本)9781728106557
In recent years, as the progress of VLSI technology, artificial intelligence / deep learning has become a major trend. To satisfy the demand of highperformancecomputing, many AI ASICs adopt multi-core architecture. the challenges for clocking of this architecture consists of timing closure issues and the implementation of low latency, low skew and low OCV top-level clock tree for highspeed operation (> 1GHz). In this paper, we share our experiences on clock tree synthesis of HPC ASICs. Two different clocking strategies are introduced including H-tree planning with customized big drivers and clock mesh.
暂无评论