检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

2,087 篇 会议
31 篇 期刊文献
8 册 图书

馆藏范围

2,126 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,363 篇 工学
- 1,143 篇 计算机科学与技术...
- 757 篇 软件工程
- 452 篇 电气工程
- 226 篇 电子科学与技术（可...
- 90 篇 信息与通信工程
- 79 篇 控制科学与工程
- 41 篇 机械工程
- 38 篇 仪器科学与技术
- 32 篇 动力工程及工程热...
- 29 篇 建筑学
- 19 篇 土木工程
- 19 篇 生物工程
- 17 篇 核科学与技术
- 15 篇 光学工程
- 13 篇 生物医学工程（可授...
- 9 篇 材料科学与工程（可...
- 9 篇 安全科学与工程
- 8 篇 交通运输工程
- 6 篇 石油与天然气工程
181 篇 理学
- 104 篇 数学
- 43 篇 物理学
- 22 篇 生物学
- 20 篇 系统科学
- 13 篇 统计学（可授理学、...
- 8 篇 化学
53 篇 管理学
- 41 篇 管理科学与工程(可...
- 21 篇 工商管理
- 15 篇 图书情报与档案管...
17 篇 法学
- 15 篇 社会学
13 篇 医学
11 篇 经济学
- 11 篇 应用经济学
6 篇 军事学
5 篇 农学
3 篇 教育学

主题

947 篇 field programmab...
735 篇 field programmab...
322 篇 hardware
172 篇 computer archite...
146 篇 logic gates
130 篇 table lookup
115 篇 clocks
96 篇 throughput
94 篇 random access me...
85 篇 routing
82 篇 software
80 篇 acceleration
75 篇 delays
74 篇 optimization
67 篇 kernel
62 篇 logic
62 篇 switches
61 篇 registers
56 篇 algorithm design...
52 篇 parallel process...

机构

17 篇 univ toronto dep...
9 篇 imperial coll lo...
9 篇 ecole polytech f...
8 篇 department of co...
8 篇 tokyo inst techn...
7 篇 univ british col...
7 篇 school of comput...
7 篇 brigham young un...
7 篇 imperial coll lo...
6 篇 univ tsukuba tsu...
6 篇 xilinx inc 2100 ...
6 篇 university of ch...
6 篇 department of el...
6 篇 univ warwick sch...
6 篇 xilinx inc san j...
5 篇 univ manchester ...
5 篇 department of el...
5 篇 univ toronto dep...
5 篇 department of el...
5 篇 fudan univ state...

作者

31 篇 luk wayne
21 篇 maruyama tsutomu
16 篇 koch dirk
13 篇 ienne paolo
12 篇 wayne luk
12 篇 wilton steven j....
12 篇 betz vaughn
11 篇 fahmy suhaib a.
11 篇 cheung peter y. ...
11 篇 chow paul
10 篇 constantinides g...
9 篇 vaughn betz
9 篇 prasanna viktor ...
8 篇 kapre nachiket
8 篇 suhaib a. fahmy
8 篇 alonso gustavo
8 篇 hutchings brad
8 篇 paolo ienne
8 篇 habib mehrez
8 篇 amano hideharu

语言

2,111 篇 英文
9 篇 其他
5 篇 中文
1 篇 俄文

检索条件"任意字段=16th International Conference on Field Programmable Logic and Applications"

共 2126 条记录，以下是121-130 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

A Scalable and Cost-Efficient Antenna Testbed Using FPGA-Server Compound Structures for Prototyping 6G applications 19

A Scalable and Cost-Efficient Antenna Testbed Using FPGA-Ser...

引用

19th Annual international conference on Distributed Computing in Smart Systems and the Internet of things, DCOSS-IoT 2023

作者： Neu, Marc Karle, Christian Nuss, Benjamin Groeschel, Patrick Becker, Juergen Itiv Karlsruhe Institute of Technology Karlsruhe Germany Lhft Nuremberg University of Erlangen Nuremberg Germany

ISBN: (纸本)9798350346497

Developing the sixth generation mobile network infrastructure is expected to utilize mm-Wave technology and multiple-input multiple-output (MIMO) arrays to enable high-speed, low-latency communication. Designing suitable MIMO arrays that support multi-GHz bandwidths per channel imposes enormous requirements on all sub-components performing analog-digital conversion, data acquisition, and digital signal processing. Current MIMO testbeds lack the bandwidth and radio-frequency sampling rates of future mobile network applications. therefore, we propose a cost-efficient architecture based on off-the-shelf components that is scalable in the number of antennas per testbed and the attainable bandwidth per channel. We demonstrate the feasibility of our approach by implementing two testbeds and transmitting a broadband orthogonal frequency-division multiplexing signal over concurrent channels. In comparison to current MIMO testbeds, we support 2.46 times higher bandwidths per channel and 9.84 times higher radio-frequency sampling rates, which we demonstrate by performing infield measurements. To conclude, our testbed allows for the proto-typing of broadband MIMO antennas, forming the foundation for cost-efficient evaluation of novel communication protocols in large test fields. © 2023 IEEE.

关键词： field programmable gate arrays (FPGA)

来源：评论

学校读者我要写书评

暂无评论

Tetracycline Intelligent Target-Inducing logic Gate Based on Triple-Stranded DNA Nanoswitch 16th

Tetracycline Intelligent Target-Inducing Logic Gate Based on...

引用

16th international conference on Bio-Inspired Computing: theories and applications, BIC-TA 2021

作者： Xi, Sunfan Wang, Yue Hu, Mengyang Wang, Luhui Cheng, Meng Dong, Yafei College of Life Science Shaanxi Normal University Shaanxi Xi’an710119 China School of Computer Science Shaanxi Normal University Shaanxi Xi’an710119 China

ISBN: (纸本)9789811912559

A previously unreported three-strand DNA (ts-DNA) non-metal structure with hairpin structure was designed based on a strategy of logic switching and tetracycline (TC) signaling molecule switching. Will target and TC-binding aptamer (TBA) is the combination of body, and its relation with three functional chain DNA conformation, with programmable DNA sensor signal by FAM input molecule fluorescence emission monitoring. After the combination, the input signal is converted to a fluorescent signal to perform a logical operation. By introducing a novel nanomaterial, graphene oxide (GO), as a signal regulator, the fluorescence response changed by intermolecular interactions is the key to achieving logical calculation. the harm of tetracycline can generally cause kidney damage, ear damage, gastrointestinal adverse reactions, skin allergies, *** tetracycline as a molecular trigger to induce conformational changes in DNA, a logical operation target detection method was constructed. In addition, nanoscale devices can be reconfigured and optimized to accommodate additional logical calculations and target detection. © 2022, Springer Nature Singapore Pte Ltd.

关键词： DNA

来源：评论

学校读者我要写书评

暂无评论

Fitop-Trans: Maximizing Transformer Pipeline Efficiency through Fixed-Length Token Pruning on FPGA

Fitop-Trans: Maximizing Transformer Pipeline Efficiency thro...

引用

international conference on field programmable logic and applications

作者： Kejia Shi Manting Zhang Keqing Zhao Xiaoxing Wu Yang Liu Jun Yu Kun Wang School of Microelectronics Fudan University Shanghai China

ISBN: (数字)9798331530075

ISBN: (纸本)9798331530082

Recent years have witnessed Transformers emerge as a groundbreaking innovation in the Natural Language Processing (NLP) field. Unlike Recurrent Neural Network (RNN) models, Transformers process sequences in parallel, boosting accuracy for longer sequences. However, Transformers face challenges with extended processing time. this is particularly due to the requirement of padding inputs to match the longest sentence in a batch, thereby increasing computational demands. In this paper, we present Fitop-Trans, the first algorithm-hardware co-optimized framework using Fixed-Length Token Pruning strategy while deploying Transformers on FPGA. At the algorithmic level, we propose Fixed-Length Token Pruning. It is a novel pruning method which can maximize hardware efficiency in attention computation, aimed at eliminating unimportant tokens before the first layer. On the hardware side, a token selector is designed for Fixed-Length Token Pruning, which minimizes off-chip memory traffic. In addition, a partitionable Systolic Array (SA) is adopted, which is capable of handling varying input lengths and maximizing Digital Signal Processor (DSP) resource utilization. Furthermore, a scheduling module is designed to optimize hardware resource allocation and enhance pipeline attention throughput. Experimental results reveal that our hardware design on FPGA achieves a speedup of $580 \times$ and $6.39 \times$ in latency compared to Intel Xeon Gold CPU and NVIDIA GeForce RTX 3090.

关键词： Gold Recurrent neural networks Processor scheduling Pipelines Signal processing algorithms Transformers Hardware Natural language processing Resource management field programmable gate arrays

来源：评论

学校读者我要写书评

暂无评论

Leveraging Dual Output LUTs with Pipelining for Efficient BCD to Binary Converter on FPGA

Leveraging Dual Output LUTs with Pipelining for Efficient BC...

引用

international conference on VLSI Design

作者： Santosh Kumar Ayan Palchaudhuri Department of Electronics & Communication Engineering School of Electrical and Computer Sciences Indian Institute of Technology Bhubaneswar Argul Khordha Odisha India

ISBN: (数字)9798331522445

ISBN: (纸本)9798331522452

Decimal operands expressed in BCD is often the convenient data format used in embedded systems and human-centric applications. For optimized arithmetic computation on hardware where binary arithmetic can be conveniently processed, input BCD operands should first be converted to binary. this paper explores the versatility of FPGA specific logic primitives, namely the dual output Look-Up Tables (LUTs) and carry chain for efficient realization of parallel and pipelined architectures to optimize throughput and resource utilization for BCD to binary conversion architectures. Primitive instantiation was adopted to ensure design optimization, which involved direct configuration of FPGA logic primitives. the utility ratio of the configured FPGA primitives was increased to extract the best performance. Experimental results demonstrate a good trade-off in speed and area for our proposed architectures when compared with existing designs. We have investigated multiple implementation variants for the addition tree, which serves as a crucial functional logic block in the converter design.

关键词： Embedded systems Computer architecture Very large scale integration throughput Table lookup logic field programmable gate arrays Adders Optimization Arithmetic

来源：评论

学校读者我要写书评

暂无评论

Small Area Footprint FPGA Architecture for Approximate atan2(a, b) Algorithm 9

Small Area Footprint FPGA Architecture for Approximate atan2...

引用

9th IEEE Uttar Pradesh Section international conference on Electrical, Electronics and Computer Engineering, UPCON 2022

作者： Kumar, Bharat Sarawadekar, Kishor Dept. of Electronics Engineering Varanasi India

ISBN: (数字)9798350332506

ISBN: (纸本)9798350332506

Arctangent or inverse tangent function has numerous applications like gradient-based feature extraction, phase noise determination, range rate measurement etc. this paper presents a small area footprint hardware architecture for computing the arctangent of a complex number. the proposed method uses numerical approximation and LUTs used to improve the accuracy of the results obtained. Single-precision floating-point representation is used to implement the proposed design and the results demonstrate very good accuracy with an error of about ±0.0004 radian with 256×32 bits memory size. the proposed architecture is implemented on Nexys4 DDR FPGA board using Verilog and it operates at 19.8 MHz. Integrated logic Analyzer (ILA) is used to debug and validate the proposed design. Further, it is observed that results obtained with the proposed design are in agreement with the Matlab simulation results. © 2022 IEEE.

关键词： field programmable gate arrays (FPGA)

来源：评论

学校读者我要写书评

暂无评论

A Software-programmable Neural Processing Unit for Graph Neural Network Inference on FPGAs

A Software-Programmable Neural Processing Unit for Graph Neu...

引用

international conference on field programmable logic and applications

作者： Taikun Zhang Andrew Boutros Sergey Gribok Kwadwo Boateng Vaughn Betz Department of Electrical and Computer Engineering University of Toronto Toronto ON Canada Programmable Solutions Group Intel Corporation

ISBN: (数字)9798331530075

ISBN: (纸本)9798331530082

Graph neural networks (GNNs) are a widely-used class of deep learning (DL) models for learning latent representations of graph-structured data for a variety of node/graph-level prediction tasks, some of which require real-time low latency inference. Most existing GNN accelerators rely on preprocessing input graphs on a host/embedded CPU to parallelize computations on different sub-graphs, making them unsuitable for real-time use cases. Others are extremely specialized streaming pipelines for only a specific type of GNN and therefore suffer from long FPGA bitstream compile times when the model is updated and cannot be used in applications that combine GNNs with other classes of DL models. In this work, we enhance the neural processing unit (NPU) FPGA overlay architecture, instruction set, and software stack to support a variety of GNN models. We achieve this without sacrificing the NPU flexibility; our enhanced NPU can be programmed purely through software to accelerate different GNNs or any of its originally supported DL workloads (e.g. MLPs, RNNs, GRUs, LSTMs). In addition, this flexibility enables our NPU software compiler to generate GNN kernels with different performance targets (throughput-optimized vs. latency-optimized) by exploiting different dimensions of compute parallelism on the same overlay architecture. Besides the flexibility benefits, our NPU implemented on an Intel Stratix 10 NX (14 nm) FPGA can process $7.8 \times$ more graphs per second at a similar latency on average compared to a state-of-the-art model-specific FPGA accelerator targeting real-time applications on an AMD Ultrascale+ same-generation FPGA. It also achieves 5.8 $\times$ higher throughput compared to an Nvidia RTX A6000 GPU (8 nm) and $2.6 \times$ lower latency than a state-of-the-art accelerator that combines CPU-based graph preprocessing with AMD Versal (7 nm) fabric and AI engine compute. Finally, we present a case study for using our enhanced NPU in real-time GNNbased multi-input

关键词： Computational modeling Data preprocessing Full stack Computer architecture throughput Graph neural networks Real-time systems Software Vectors field programmable gate arrays

来源：评论

学校读者我要写书评

暂无评论

LORA: A Latency-Oriented Recurrent Architecture for GPT Model on Multi-FPGA Platform with Communication Optimization

LORA: A Latency-Oriented Recurrent Architecture for GPT Mode...

引用

international conference on field programmable logic and applications

作者： ZhenDong Zheng Qianyu Cheng Teng Wang Lei Gong Xianglan Chen Cheng Tang Chao Wang Xuehai Zhou School of Computer Science and Technology Suzhou Institute for Advanced Research University of Science and Technology of China

ISBN: (数字)9798331530075

ISBN: (纸本)9798331530082

Large Language Models (LLMs) have been widely deployed in data centers to provide various services, among which the most representative is the Generative Pre-trained Transformer (GPT). the GPT model has heavy memory and computing overhead, and its inference process has two stages with distinct computing characteristics: Prefill and Decode. Utilizing existing GPUs and FPGA accelerators to construct a platform for deploying GPT in data centers faces the challenges of needing more effective synchronization schemes or structures with higher computational intensity. this paper proposes LORA, a low latency end-to-end GPT acceleration platform utilizing multiple FPGAs. Firstly, we optimize the synchronization timing of the GPT model to reduce the computation and communication overhead. Secondly, we devise some efficient synchronization steps for specific layers of the GPT model that overlap part of the computation and communication delay to improve the latency of our platform. Finally, we deploy recurrent structures on each FPGA to accelerate the different stages of the GPT model. Implemented on the Xilinx Alveo U280 FPGAs, LORA achieves an average $11.1 \times$ speedup over NVIDIA V100 GPUs on the modern GPT-2 model. Compared to the existing multi-FPGA accelerator appliance, LORA shows performance improvements of up to $4 \times$ and $2.7 \times$ in the Prefill and Decode stages.

关键词： Data centers Computational modeling LoRa Prototypes Transformers Data models Synchronization Low latency communication field programmable gate arrays Optimization

来源：评论

学校读者我要写书评

暂无评论

NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions

NeuraLUT: Hiding Neural Network Density in Boolean Synthesiz...

引用

international conference on field programmable logic and applications

作者： Marta Andronic George A. Constantinides Department of Electrical and Electronic Engineering Imperial College London UK

ISBN: (数字)9798331530075

ISBN: (纸本)9798331530082

field-programmable Gate Array (FPGA) accelerators have proven successful in handling latency- and resource-critical deep neural network (DNN) inference tasks. Among the most computationally intensive operations in a neural network (NN) is the dot product between the feature and weight vectors. thus, some previous FPGA acceleration works have proposed mapping neurons with quantized inputs and outputs directly to lookup tables (LUTs) for hardware implementation. In these works, the boundaries of the neurons coincide with the boundaries of the LUTs. We propose relaxing these boundaries and mapping entire sub-networks to a single LUT. As the sub-networks are absorbed within the LUT, the NN topology and precision within a partition do not affect the size of the lookup tables generated. therefore, we utilize fully connected layers with floating-point precision inside each partition, which benefit from being universal function approximators, but with rigid sparsity and quantization enforced between partitions, where the NN topology becomes exposed to the circuit topology. Although cheap to implement, this approach can lead to very deep NNs, and so to tackle challenges like vanishing gradients, we also introduce skip connections inside the partitions. the resulting methodology can be seen as training DNNs with a specific FPGA hardware-inspired sparsity pattern that allows them to be mapped to much shallower circuit-level networks, thereby significantly improving latency. We validate our proposed method on a known latency-critical task, jet substructure tagging, and on the classical computer vision task, digit classification using MNIST. Our approach allows for greater function expressivity within the LUTs compared to existing work, leading to up to $4.3 \times$ lower latency NNs for the same accuracy.

关键词： Training Quantization (signal) Neurons Artificial neural networks Tagging Vectors Table lookup Topology field programmable gate arrays Biological neural networks

来源：评论

学校读者我要写书评

暂无评论

New FPGA architecture for 8, 16 and 24 bit chaotic interleaver. 5

New FPGA architecture for 8, 16 and 24 bit chaotic interleav...

引用

5th international conference on Engineering Technology and its applications, IICETA 2022

作者： Humadi, Zeina Abdullah Al-Doori, Qusay F. University of Technology Control and Systems Engineering Department Baghdad Iraq

ISBN: (纸本)9781665472159

In a digital system, the interleaver is essential. It's widely used to enhance the effectiveness of forward error correction codes. A lot of research has shown the efficiency of the chaotic interleaver in correcting the 2d burst error. We used this type of interleaver based on the baker map equation. the problem with using such encoding technique is in adaptive in multi standard systems stream of bit can be 8, 16, and 24 hence there is a need for three types of interleavers which increase the complexity of the system. In this paper, an efficient chaotic interleaver model was presented with a new FPGA architecture that includes 8-, 16- and 24-bit square matrix. this new design makes the system easily deal with more than one size of the matrix input to the system and this is called a multi-standard system instead of having each size separately in its own design. the results were obtained using the Xilinx ISE 14.7 design suite. In the system generator, the generated waveforms are viewed using the scope block of Simulink. © 2022 IEEE.

关键词： field programmable gate arrays (FPGA)

来源：评论

学校读者我要写书评

暂无评论

HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator

HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accele...

引用

international conference on field programmable logic and applications

作者： Zhewen Yu Sudarshan Sreeram Krish Agrawal Junyi Wu Alexander Montgomerie-Corcoran Cheng Zhang Jianyi Cheng Christos-Savvas Bouganis Yiren Zhao Imperial College London UK University of Cambridge UK

ISBN: (数字)9798331530075

ISBN: (纸本)9798331530082

Deep Neural Networks (DNNs) excel in learning hierarchical representations from raw data, such as images, audio, and text. To compute these DNN models with high performance and energy efficiency, these models are usually deployed onto customized hardware accelerators. Among various accelerator designs, dataflow architecture has shown promising performance due to its layer-pipelined structure and its scalability in data *** weights and activations sparsity can further enhance memory storage and computation efficiency. However, existing approaches focus on exploiting sparsity in non-dataflow accelerators, which cannot be applied onto dataflow accelerators because of the large hardware design space introduced. As such, this could miss opportunities to find an optimal combination of sparsity features and hardware *** this paper, we propose a novel approach to exploit unstructured weights and activations sparsity for dataflow accelerators, using software and hardware co-optimization. We propose a Hardware-Aware Sparsity Search (HASS) to systematically determine an efficient sparsity solution for dataflow accelerators. Over a set of models, we achieve an efficiency improvement ranging from $1.3 \times$ to $4.2 \times$ compared to existing sparse designs, which are either non-dataflow or non-hardware-aware. Particularly, the throughput of MobileNetV3 can be optimized to 4895 images per second. HASS is open-source: https://***/Yu-Zhewen/HASS

关键词： Systematics Computational modeling Scalability Heuristic algorithms Memory management Artificial neural networks throughput Software logic Optimization

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共213页 << < 9 10 11 12 13 14 15 16 17 18 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：