检索结果-内蒙古大学图书馆

34th Chinese Control and Decision Conference (CCDC)

作者： Wang, Yu Feng, Wenbin Yu, Chongchong Hu, Xinyu Zhang, Yuqiu Beijing Technol & Business Univ China Light Ind Key Lab Ind Internet & Big Data Beijing 100048 Peoples R China Beijing Technol & Business Univ Sch Artificial Intelligence Beijing 100048 Peoples R China Shenyang Res Inst State Key Lab Coal Mine Safety Technol China Coal Technol & Engn Grp Shenyang 113122 Peoples R China Oil Prod Plant 3 PetroChina Changqing Oilfield Co Oil Prod Proc Res Inst Yinchuan 750006 Ningxia Peoples R China

ISBN: (纸本)9781665478960

In order to solve the problems of low model accuracy, poor computing power, poor parallel ability and excessive power consumption in the deployment of RGBD based 3D target detection model at the embedded end, this paper first proposes an improved RGBD 3D target detection model based on ENet semantic segmentation model, which takes ENet as the semantic segmentation network, RGB image and depth information are fused to realize 3D target detection. Secondly, in order to apply the model at the edge, this paper constructs a lightweight network and cuts the network in the down-sampling stage of ENet model. Finally, this paper uses Xilinx ZCU104 as the hardware development kit, which takes FPGA as the auxiliary parallel operation unit and ARM as the main operation unit. It is a heterogeneous computing architecture with the ability to deal with complex operations. The architecture uses FPGA to accelerate the depth model in parallel, which improves the operation speed and reduces the power consumption. The test results of the model on ZCU104 are compared with other hardware. The results show that while ensuring the accuracy, the power consumption of the heterogeneous computing architecture used in this paper is 93% lower than that of Intel Xeon e5-2620 v4 CPU, the speed is 12 times higher, and the speed is more than 180 times higher than that of ARM Cortex-A53 commonly used at the edge.

关键词： FPGA parallel acceleration heterogeneous computing architecture 3D target detection

来源：评论

学校读者我要写书评

暂无评论

Implementation of improved RGBD 3D target detection model based on FPGA heterogeneous computing architecture

Implementation of improved RGBD 3D target detection model ba...

引用

第34届中国控制与决策会议

作者： Yu WANG Wenbin FENG Chongchong YU Xinyu HU Yuqiu ZHANG China Light Industry Key Laboratory of Industrial Internet and Big Data Beijing Technology and Business University School of artificial intelligence Beijing Technology and Business University State Key Laboratory of Coal Mine Safety Technology China Coal Technology & Engineering Group Shenyang Research Institute Oil production Process Research Institute Oil Production Plant 3 of PetroChina Changqing Oilfield Company

In order to solve the problems of low model accuracy,poor computing power,poor parallel ability and excessive power consumption in the deployment of RGBD based 3 D target detection model at the embedded end,this paper first proposes an improved RGBD 3 D target detection model based on ENet semantic segmentation model,which takes ENet as the semantic segmentation network,RGB image and depth information are fused to realize 3 D target ***,in order to apply the model at the edge,this paper constructs a lightweight network and cuts the network in the down-sampling stage of ENet ***,this paper uses Xilinx ZCU104 as the hardware development kit,which takes FPGA as the auxiliary parallel operation unit and ARM as the main operation *** is a heterogeneous computing architecture with the ability to deal with complex *** architecture uses FPGA to accelerate the depth model in parallel,which improves the operation speed and reduces the power *** test results of the model on ZCU104 are compared with other *** results show that while ensuring the accuracy,the power consumption of the heterogeneous computing architecture used in this paper is 93% lowerthan that of Intel Xeon e5-2620 v4 CPU,the speed is 12 times higher,and the speed is more than 180 times higher than that of ARM Cortex-A53 commonly used at the edge.

关键词： FPGA parallel acceleration heterogeneous computing architecture 3D target detection

来源：评论

学校读者我要写书评

暂无评论

A Simulation Framework for Memristor-Based heterogeneous computing architectures

引用

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS 2022年第12期41卷 5476-5488页

作者： Liu, Haikun Xu, Jiahong Liao, Xiaofei Jin, Hai Zhang, Yu Mao, Fubing Huazhong Univ Sci & Technol Natl Engn Res Ctr Big Data Technol & Syst Sch Comp Sci & Technol Serv Comp Technol & Syst LabCluster & Grid Comp Wuhan 430074 Peoples R China

Memristor-based accelerator (MBA) has demonstrated its capability in accelerating matrix-vector multiplication (MVM) with high performance and energy efficiency. However, it is hard to determine whether and how well an application can benefit from MBAs in a heterogeneous computing architecture. In this article, we propose a simulation framework called MHSim to evaluate the energy efficiency and performance of applications running with both MBAs and CPUs. MHSim provides flexible system-level interfaces and circuit-level simulation models for designers to configure heterogeneous computing architectures. We design a general-purpose MBA which enables floating-point computation models for general matrix-matrix multiplication (GEMM). Our simulation framework can quantify the performance and energy efficiency of different MBA architectures for various applications. We validate our simulation framework with SPICE and evaluate the accuracy and performance of MBAs via several case studies. Experimental results demonstrate that the deviations of energy consumption and latency are only 0.47% and 0.49% on average compared with SPICE-based simulation.

关键词： heterogeneous computing architecture memristor memristor-based accelerator (MBA) simulation framework

来源：评论

学校读者我要写书评

暂无评论

A heterogeneous In-Memory computing Cluster for Flexible End-to-End Inference of Real-World Deep Neural Networks

引用

IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS 2022年第2期12卷 422-435页

作者： Garofalo, Angelo Ottavi, Gianmarco Conti, Francesco Karunaratne, Geethan Boybat, Irem Benini, Luca Rossi, Davide Univ Bologna Dept Elect Elect & Informat Engn DEI I-40126 Bologna Italy IBM Res Europe CH-8803 Zurich Switzerland Swiss Fed Inst Technol IIS Integrated Syst Lab CH-8092 Zurich Switzerland

Deployment of modern TinyML tasks on small battery-constrained IoT devices requires high computational energy efficiency. Analog In-Memory computing (IMC) using non-volatile memory (NVM) promises major efficiency improvements in deep neural network (DNN) inference and serves as on-chip memory storage for DNN weights. However, IMC's functional flexibility limitations and their impact on performance, energy, and area efficiency are not yet fully understood at the system level. To target practical end-to-end loT applications, IMC arrays must be enclosed in heterogeneous programmable systems, introducing new system-level challenges which we aim at addressing in this work. We present a heterogeneous tightly-coupled clustered architecture integrating 8 RISC-V cores, an in-memory computing accelerator (IMA), and digital accelerators. We benchmark the system on a highly heterogeneous workload such as the Bottleneck layer from a MobileNetV2, showing 11.5x performance and 9.5x energy efficiency improvements, compared to highly optimized parallel execution on the cores. Furthermore, we explore the requirements for end-to-end inference of a full mobile-grade DNN (MobileNetV2) in terms of IMC array resources, by scaling up our heterogeneous architecture to a multi-array accelerator. Our results show that our solution, on the end-to-end inference of the MobileNetV2, is one order of magnitude better in terms of execution latency than existing programmable architectures and two orders of magnitude better than state-of-the-art heterogeneous solutions integrating in-memory computing analog cores.

关键词： In-memory computing RISC-V heterogeneous computing architecture MobileNetV2

来源：评论

学校读者我要写书评

暂无评论

Cheat Detection Processing: A GPU versus CPU Comparison

Cheat Detection Processing: A GPU versus CPU Comparison

引用

9th Annual Workshop on Network and Systems Support for Games (NetGames)

作者： Stensland, Hakon Kvale Myrseth, Martin Oinaes Griwodz, Carsten Halvorsen, Pal Simula Res Lab Fornebu Norway Univ Oslo Dept Informat N-0316 Oslo Norway

ISBN: (纸本)9781424483556

In modern online multi-player games, game providers are struggling to keep up with the many different types of cheating. Cheat detection is a task that requires a lot of computational resources. Advances made within the field of heterogeneous computing architectures, such as graphics processing units (GPUs), have given developers easier access to considerably more computational resources, enabling a new approach to solving this issue. In this paper, we have developed a small game simulator that includes a customizable physics engine and a cheat detection mechanism that checks the physical model used by the game. To make sure that the mechanisms are fair to all players, they are executed on the server side of the game system. We investigate the advantages of implementing physics cheat detection mechanisms on a GPU using the Nvidia CUDA framework, and we compare the GPU implementation of the cheat detection mechanism with a CPU implementation. The results obtained from the simulations show that offloading the cheat detection mechanisms to the GPU reduces the time spent on cheat detection, enabling the servers to support a larger number of clients.

关键词： CPU GPU Internet Nvidia CUDA framework cheat detection processing computer games computer graphic equipment coprocessors customizable physics engine game simulator graphics processing unit heterogeneous computing architecture multiprocessing systems onlin

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：