检索结果-内蒙古大学图书馆

Optimized training for convolutional neural network using enhanced grey wolf optimization algorithm

Informatica (Slovenia) 2021年第5期45卷 731-739页

作者： Guernine, Akram Kimour, Mohamed Tahar Laboratory of Embedded Systems Department of Computer Science University of Badji Mokhtar-Annaba Po Box.12 Annaba Sidi Amar Algeria Center of research on environment Annaba Algeria

Convolutional Neural Networks (CNNs) are widely used in image classification tasks and have achieved significant performance. They have different applications with great success, especially in the medical field. The choice of architecture and hyperparameter settings of the CNN, highly influences both accuracy and its convergence speed. The empirical design and optimization of a new CNN architecture require a lot of expertise and can be very time-consuming. This paper proposes an enhanced Grey Wolf Optimization (GWO) algorithm to efficiently explore a defined space of potentially suitable CNN architectures, and simultaneously optimize their hyperparameters. Moreover, we introduce a spatial resolution reduction for a given image processing task, while taking skin cancer detection as a practical application. Through conducted experiments, we have shown that the obtained results are better than other classification methods in terms of accuracy and convergence speed. © 2021 Slovene Society Informatika. All rights reserved.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Exploration of Optimizing FPGA-based Qubit Controller for Experiments on Superconducting Quantum Computing Hardware

Exploration of Optimizing FPGA-based Qubit Controller for Ex...

引用

IEEE International Conference on Electro-Information Technology

作者： Hans Johnson Silvia Zorzetti Jafar Saniie Department of Electrical and Computer Engineering Embedded Computing and Signal Processing (ECASP) Research Laboratory Illinois Institute of Technology Chicago IL U.S.A. Superconducting Quantum Materials and Systems Center (SQMS) Fermi National Accelerator Laboratory Batavia IL U.S.A.

This work explores avenues and target areas for optimizing FPGA-based control hardware for experiments conducted on superconducting quantum computing systems and serves as an introduction to some of the current research at the intersection of classical and quantum computing hardware. With the promise of building larger-scale error-corrected quantum computers based on superconducting qubit architecture, innovations to room-temperature control electronics are needed to bring these quantum realizations to fruition. The QICK (Quantum Instrumentation Control Kit) is one leading experimental FPGA-based implementations. However, its integration into other experimental quantum computing architectures, especially those using superconducting radiofrequency (SRF) cavities, is largely unexplored. We identify some key target areas for optimizing control electronics for superconducting qubit architectures and provide some preliminary results to the resolution of a control pulse waveform. With optimizations targeted at 3D superconducting qubit setups, we hope to bring to light some of the requirements in classical computational methodologies to bring out the full potential of this quantum computing architecture, and to convey the excitement of progress in this research.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Demonstrating the Potential of Adaptive LMS Filtering on FPGA-Based Qubit Control Platforms for Improved Qubit Readout in 2D and 3D Quantum Processing Units

Demonstrating the Potential of Adaptive LMS Filtering on FPG...

引用

Quantum Computing and Engineering (QCE), IEEE International Conference on

作者： Hans Johnson Nicholas Bornman Taeyoon Kim David Van Zanten Silvia Zorzetti Jafar Saniie Department of Electrical and Computer Engineering Embedded Computing and Signal Processing (ECASP) Research Laboratory Illinois Institute of Technology Chicago IL U.S.A. Fermi National Accelerator Laboratory Superconducting Quantum Materials and Systems (SQMS) Center Batavia IL U.S.A. Department of Physics and Astronomy Center for Applied Physics and Superconducting Technologies (CAPST) Northwestern University Evanston IL U.S.A.

ISBN: (数字)9798331541378

ISBN: (纸本)9798331541385

Advancements in quantum computing underscore the critical need for sophisticated qubit readout techniques to accurately discern quantum states. This abstract presents our research intended for optimizing readout pulse fidelity for 2D and 3D Quantum Processing Units (QPUs), the latter coupled with Superconducting Radio Frequency (SRF) cavities. Focusing specifically on the application of the Least Mean Squares (LMS) adaptive filtering algorithm, we explore its integration into the FPGA-based control systems to enhance the accuracy and efficiency of qubit state detection by improving Signal-to-Noise Ratio (SNR). Implementing the LMS algorithm on the Zynq UltraScale+ RFSoC Gen 3 devices (RFSoC 4x2 FPGA and ZCU216 FPGA) using the Quantum Instrumentation Control Kit (QICK) open-source platform, we aim to dynamically test and adjust the filtering parameters in real-time to characterize and adapt to the noise profile presented in quantum computing readout signals. Our preliminary results demonstrate the LMS filter's capability to maintain high readout accuracy while efficiently managing FPGA resources. These findings are expected to contribute to developing more reliable and scalable quantum computing architectures, highlighting the pivotal role of adaptive signal processing in quantum technology advancements.

关键词： Three-dimensional displays Accuracy Filtering Qubit Signal processing algorithms Superconducting filters Real-time systems Reliability Field programmable gate arrays Signal to noise ratio

来源：评论

学校读者我要写书评

暂无评论

ARB-NET: A novel adaptive monitoring platform for stacked mesh 3D NoC architectures

ARB-NET: A novel adaptive monitoring platform for stacked me...

引用

Asia and South Pacific Design Automation Conference

作者： Amir-Mohammad Rahmani Khalid Latif Kameswar Rao Vaddina Pasi Liljeberg Juha Plosila Hannu Tenhunen Turku Center for Computer Science Finland Department of Information Technology Embedded Computer Systems Laboratory University of Turku Finland

The emerging three-dimensional integrated circuits (3D ICs) offer a promising solution to mitigate the barriers of interconnect scaling in modern systems. In order to exploit the intrinsic capability of reducing the wire length in 3D ICs, 3D NoC-Bus Hybrid mesh architecture was proposed. Besides its various advantages in terms of area, power consumption, and performance, this architecture has a unique and hitherto previously unexplored way to implement an efficient system-wide monitoring network. In this paper, an integrated low-cost monitoring platform for 3D stacked mesh architectures is proposed which can be efficiently used for various system management purposes. The proposed generic monitoring platform called ARB-NET utilizes bus arbiters to exchange the monitoring information directly with each other without using the data network. As a test case, based on the proposed monitoring platform, a fully congestion-aware adaptive routing algorithm named AdaptiveXYZ is presented taking advantage from viable information generated within bus arbiters. Our extensive simulations with synthetic and real benchmarks reveal that our architecture using the AdaptiveXYZ routing can help achieving significant power and performance improvements compared to recently proposed stacked mesh 3D NoCs.

关键词： Monitoring Three dimensional displays Stress Routing computer architecture Measurement units System-on-a-chip

来源：评论

学校读者我要写书评

暂无评论

An Efficient Hybridization Scheme for Stacked Mesh 3D NoC Architecture

An Efficient Hybridization Scheme for Stacked Mesh 3D NoC Ar...

引用

Euromicro Conference on Parallel, Distributed and Network-Based Processing

作者： Amir-Mohammad Rahmani Pasi Liljeberg Juha Plosila Hannu Tenhunen Turku Center for Computer Science Finland Embedded Computer Systems Laboratory Department of Information Technology University of Turku Finland

Three-dimensional (3D) integration is a viable design paradigm to overcome the existing interconnect bottleneck in integrated systems and enhance system power/performance characteristics. In order to exploit the intrinsic capability of reducing the wire length in 3D ICs, stacked mesh 3D NoC architecture was proposed. However, this architecture suffers from naive and straightforward hybridization between NoC and bus media. In this paper, an efficient hybridization scheme is presented to enhance system performance, power consumption, and area of stacked mesh 3D NoC architectures. By utilizing a routing rule called LastZ the proposed hybridization scheme offers many advantages investigated in detail to emphasize the significant achievements. Our extensive simulations with synthetic and real benchmarks, including an integrated videoconference application show that compared to a typical 3D NoC-Bus Hybrid Mesh architecture, our hybridization scheme achieves significant power, performance, and area improvements.

关键词： Three dimensional displays Routing computer architecture Power demand Through-silicon vias Throughput Hybrid power systems

来源：评论

学校读者我要写书评

暂无评论

OpenMP directive extension for BlackFin 561 dual core processor

OpenMP directive extension for BlackFin 561 dual core proces...

引用

6th IEEE International Conference on computer and Information Technology, CIT 2006

作者： Seo, Hee Kim, Seon Wook Compilers and Embedded Systems Laboratory Department of Electronics and Computer Engineering Korea University Seoul Korea Republic of

ISBN: (纸本)076952687X

Many researchers and vendors are exploiting the increasing number of transistors to build chip multiprocessors (CMPs) by partitioning a chip into multiple simple ILP cores. As in traditional multiprocessors, CMPs extract thread-level parallelism (TLP) from programs by running multiple independent program segments, i.e., threads, in parallel. Currently CMPs are used widely in high performance servers, and even in embedded systems. In this paper, we present an extension of the OpenMP shared directive for performance optimization on BlackFin 561 (ADSP-BF561) dual core processors. In order to support memory consistency between multiple cores, many architectures have been proposed. On the dual core processor, like ADSP-BF561, each core has its own private LI cache, and a shared L2 cache. In order to execute multithreaded parallel programs, we need to consider carefully where to allocate shared variables on targeted memory architecture. We could improve the speedup by up to 107% and reduce the energy consumption by up to 108% in our measured benchmarks with respect to no use of our extension. © 2006 IEEE.

关键词： Program processors

来源：评论

学校读者我要写书评

暂无评论

OpenMP Directive Extension for BlackFin 561 Dual Core Processor

OpenMP Directive Extension for BlackFin 561 Dual Core Proces...

引用

International Conference on computer and Information Technology (CIT)

作者： Hee Seo Seon Wook Kim Compilers and Embedded Systems Laboratory Department of Electronics and Computer Engineering Korea University of Technology and Education Seoul South Korea

Many researchers and vendors are exploiting the increasing number of transistors to build chip multiprocessors (CMPs) by partitioning a chip into multiple simple ILP cores. As in traditional multiprocessors, CMPs extract thread-level parallelism (TLP) from programs by running multiple independent program segments, i.e., threads, in parallel. Currently CMPs are used widely in high performance servers, and even in embedded systems. In this paper, we present an extension of the OpenMP shared directive for performance optimization on BlackFin 561 (ADSPBF561) dual core processors. In order to support memory consistency between multiple cores, many architectures have been proposed. On the dual core processor, like ADSP-BF561, each core has its own private L1 cache, and a shared L2 cache. In order to execute multithreaded parallel programs, we need to consider carefully where to allocate shared variables on targeted memory architecture. We could improve the speedup by up to 107% and reduce the energy consumption by up to 108% in our measured benchmarks with respect to no use of our extension.

关键词： embedded system Yarn Energy consumption Surface-mount technology Parallel processing Energy measurement Velocity measurement Memory management Prefetching Laboratories

来源：评论

学校读者我要写书评

暂无评论

Static Energy Saving Through Multi-Bank Memory Architecture

Static Energy Saving Through Multi-Bank Memory Architecture

引用

International Conference on embedded computer systems: architectures, Modeling and Simulation (IC-SAMOS)

作者： Sebastien Lafond Johan Lilius Embedded Systems Laboratory Turku Center for Computer Science Turku Finland Department of Information Technologies Abo Akademi University Abohar Finland

Managing the energy consumption of embedded systems has become a major problem with the increasing demand for portable electronic devices. This paper proposes a multi-bank memory architecture as a solution to decrease the static energy cost in memory. We set up the equations ruling the optimization problem for decreasing the memory static energy cost, analyze the impact of different parameters on the energy cost and finally present some case study results

关键词： Memory architecture Energy consumption Cost function Consumer electronics Equations computer architecture Application software computer science embedded system Laboratories

来源：评论

学校读者我要写书评

暂无评论

Functional validation of programmable architectures

Functional validation of programmable architectures

引用

Euromicro Symposium on Digital System Design

作者： P. Mishra N. Dutt Architectures and Compilers for Embedded Systems (ACES) Laboratory Center for Embedded Computer Systems University of California Irvine USA

Validation of programmable architectures, consisting of processor cores, coprocessors, and memory subsystems, is one of the major bottlenecks in current system-on-chip design methodology. A critical challenge in validation of such systems is the lack of a golden reference model. Traditional validation techniques employ different reference models depending on the abstraction level and verification task (e.g., functional simulation or property checking), resulting in potential inconsistencies between multiple reference models. This paper presents a validation methodology that uses an architecture description language (ADL) based specification as a golden reference model for validation of programmable architectures, and generation of executable models such as simulators and hardware prototypes. We present a validation framework that uses the generated hardware as a reference model to verify the hand-written implementation using a combination of symbolic simulation and equivalence checking. We also present functional coverage based test generation techniques for validation of pipelined processor architectures. Finally, the generated simulator and hardware models are also used for early exploration of programmable architectures.

关键词： Pervasive computing computer architecture embedded computing computer bugs Hardware Personal digital assistants Coprocessors Computational modeling Handheld computers North America

来源：评论

学校读者我要写书评

暂无评论

Synthesis-driven exploration of pipelined embedded processors

Synthesis-driven exploration of pipelined embedded processor...

引用

International Conference on VLSI Design

作者： P. Mishra A. Kejariwal N. Dutt Architectures and Compilers for Embedded Systems (ACES) Laboratory Center for Embedded Computer Systems University of California Irvine CA USA

Recent advances on language based software toolkit generation enables performance driven exploration of embedded systems by exploiting the application behavior. There is a need for an automatic generation of hardware to determine the required silicon area, clock frequency, and power consumption of the candidate architectures. In this paper, we present a language based exploration framework that automatically generates synthesizable RTL models for pipelined processors. Our framework allows varied micro-architectural modifications, such as, addition of pipeline stages, pipeline paths, opcodes and new functional units. The generated RTL is synthesized to determine the area, power, and clock frequency of the modified architectures. Our exploration results demonstrate the power of reuse in composing heterogeneous architectures using functional abstraction primitives allowing for a reduction in the time for specification and exploration by at least an order of magnitude.

关键词： Power generation Clocks Pipelines embedded software Software performance Software tools Distributed power generation embedded system Application software Hardware

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：