检索结果-内蒙古大学图书馆

IEEE Pacific Rim Conference on Communications, Computers and signal processing

作者： Duanyi Wang H. Kobayashi Department of Electrical Engineering Princeton University Princeton NJ USA

ISBN: (纸本)0780370805

This paper describes two new matrix transform algorithms for the max-log-MAP decoding of turbo codes. In the proposed algorithms, the successive decoding procedures carried out in the conventional max-log-MAP algorithm are performed in parallel, and well formulated into a set of simple and regular matrix operations, which can therefore considerably speed up the decoding operations and reduce the computational complexity. The matrix max-log-MAP algorithms also maintain the advantage of the general logarithmic MAP like algorithms in avoiding complex numerical representation problems. They particularly facilitate the implementations of the logarithmic MAP like algorithms in special-purpose parallel processing VLSI hardware architectures. The matrix algorithms also allow simple implementations by using shift registers. The proposed implementation architectures for the matrix max-log-MAP decoding can effectively reduce the memory capacity and simplify the data accesses and transfers required by the conventional max-log-MAP as well as MAP algorithms.

关键词： Turbo codes Very large scale integration Iterative algorithms Computational complexity Parallel processing Hardware Iterative decoding Computer architecture Shift registers Circuits

来源：评论

学校读者我要写书评

暂无评论

VHDL-based implementations of area and power efficient filter architectures

VHDL-based implementations of area and power efficient filte...

引用

European signal processing Conference (EUSIPCO)

作者： Ilkka Saastamoinen Tapio Saramäki Olii Vainio Dept. of Information Technology Tampere University of Technology P.O. Box 553 Tampere Finland

Digital signal processing operations, e.g., digital filters, are one important class of application in communication devices. A digital filtering algorithm can be implemented in various ways by selecting one architecture from the set of possible realizations. By choosing an advanced architecture notable advantages in both the silicon area and power dissipation can be achieved compared to the conventional direct-form realization. This paper focuses on interpolated finite impulse response (interpolated FIR) filter and recursive running-sum (RRS) filter based architectures. The VHDL-based implementations prove that these advanced architectures are efficient when low-power or low-area characteristics are desired. Over 55 percent savings in the area and in the power dissipation were achieved when an FIR filter with a narrow transition band was implemented using these architectures.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Fast MPEG-4 Motion Estimation: Processor Based and Flexible VLSI implementations

引用

Journal of VLSI signal processing systems for signal, image and video technology 1999年第1期23卷 67-92页

作者： Kuhn, Peter M. Sony Corporation Tokyo Japan

MPEG-4 is a new multimedia standard combining interactivity, object-based natural and synthetic digital video, audio and computer-graphics. For the implementation of the video part of the MPEG-4 standard a high degree of flexibility is required, where the motion estimation requires the highest part of the computational power. Therefore, in this paper fast algorithms for MPEG-4 motion estimation are evaluated in terms of visual quality and computational power requirements for processor based implementations. Due to the object-based nature of MPEG-4 also new VLSI architectures for MPEG-4 motion estimation are required. Therefore known motion estimation architectures are evaluated on their capability of being modified for MPEG-4 support. Based on this evaluation a new dedicated, but flexible MPEG-4 motion estimation architecture targeted for low-power handheld applications is presented, which resulted to be advantageous to processor based implementations by magnitudes of order.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Adaptive lattice filter implementations on pipelined multiprocessor architectures

Adaptive lattice filter implementations on pipelined multipr...

引用

International Conference on Acoustics, Speech, and signal processing (ICASSP)

作者： M.D. Meyer D.P. Agrawal Department of Electrical and Computer Engineering and Computer Systems Laboratory North Carolina State University Raleigh NC USA

Three pipelined multiprocessor implementations of adaptive lattice filters are examined. The three multiprocessor architectures, which can be characterized as a serial pipeline, a ladder-connected dual pipeline, and a ring pipeline, are derived directly from the computational and data-transfer requirements of adaptive lattice algorithms. The order-recursive nature of the adaptive lattice structure results in architectures which use pipelining extensively. A performance analysis is done for each multiprocessor system, with respect to two different adaptive lattice algorithms. Expressions for approximate computation time and speedup are derived for each combination of architecture and algorithm.< >

关键词： Lattices Adaptive filters Computer architecture Pipeline processing Reflection Transversal filters Feedback Least squares approximation Hardware Filtering

来源：评论

学校读者我要写书评

暂无评论

Analysis and design of parallel algorithms and implementations of matrix multiplications for image and signal processing

Analysis and design of parallel algorithms and implementatio...

引用

IEEE Pacific Rim Conference on Communications, Computers and signal processing

作者： M. Yasrebi J.C. Browne IBM Corporation Austin TX USA Computer Science Department University of Technology Austin TX USA

Parallel matrix multiplication algorithms (based on the common data distribution formats) used in pattern recognition, image processing, and signal processing applications are discussed. A novel algorithm is introduced and is shown to be the fastest one for a determined class of applications. The algorithms are analyzed for performance as a function of array dimension, data distribution formats, and the architecture of the computer upon which the algorithms are executed. Performance bounds and speedups (linear in the number of processors) are established. The results of the analysis are given both as characterizations of executions on selected classes of architectures and also in the form of theorems which establish the relative performance of the algorithms across classes of data distributions and architectures.< >

关键词： Algorithm design and analysis Parallel algorithms signal processing algorithms Application software Performance analysis Computer architecture Pattern recognition Image processing Array signal processing Distributed computing

来源：评论

学校读者我要写书评

暂无评论

CONCURRENT implementations OF MATRIX EIGENVALUE DECOMPOSITION BASED ON ISOSPECTRAL FLOWS.

CONCURRENT IMPLEMENTATIONS OF MATRIX EIGENVALUE DECOMPOSITIO...

引用

Real Time signal processing VI.

作者： Ang, Peng-Huat Delosme, Jean-Marc Morf, Martin Stanford Univ Information Systems Lab Stanford CA USA Stanford Univ Information Systems Lab Stanford CA USA

The authors evaluate several techniques for solving the symmetric tridiagonal problem based on the method of isospectral flow. architectures which result from these considerations are discussed. Their advantages and d... 详细信息

ISBN: (纸本)0892524669

关键词： COMPUTER ARCHITECTURE

来源：评论

学校读者我要写书评

暂无评论

Session 6: Parallel implementations of HEVC and FFT for embedded multi-/manycore systems

Session 6: Parallel implementations of HEVC and FFT for embe...

引用

Conference on Design and architectures for signal and Image processing

作者： Karol Desnos INSA Rennes FR

For many years, following the ever-increasing number of transistors per chip, advances in computer architecture mostly consisted of adding complex mechanisms to mono-core processors to improve their computing performance. In the last decade, the continuous growth of computing performance was supported by the introduction of multi-core architectures, first for high-performance computing, then in mainstream desktop CPUs, and now in smartphones and embedded systems. Today, one of the main challenges researchers must overcome is finding how to implement applications that fully exploit the computing performance offered by these multicore architectures with tens, hundreds, and soon thousands of cores. In this session, parallel implementations of State-of-the-Art signal and video processing applications on multi and manycore architectures are presented. The first two talks of this session focus on implementation of HEVC video encoder on modern architecture. The implementation of intra encoding algorithms of HEVC on heterogeneous multicore architectures will be presented by the Fraunhofer HHI, and the optimization of the complexity-quality tradeoff of hardware-accelerated HEVC coding will be presented by the Politecnico di Torino. Finally, an implementation of the Fast Fourier Transform on a manycore embedded system will be presented as a result of collaboration between Kalray, INSA Rennes, and the Auckland University of Technology.

关键词： Embedded systems Multicore processing Encoding Fast Fourier transforms Transistors Program processors

来源：评论

学校读者我要写书评

暂无评论

FPGA implementations of HEVC Inverse DCT using high-level synthesis

FPGA implementations of HEVC Inverse DCT using high-level sy...

引用

Conference on Design and architectures for signal and Image processing

作者： Ercan Kalali Ilker Hamzaoglu Faculty of Engineering and Natural Sciences Sabanci University Istanbul Tuzla Turkey

High Efficiency Video Coding (HEVC), the recently developed international video compression standard, has 50% better video compression efficiency than H.264 video compression standard at the expense of significantly increased computational complexity. HEVC Inverse Discrete Cosine Transform (IDCT) algorithm accounts for 11% of the computational complexity of an HEVC video encoder. Recently, commercial and academic high-level synthesis (HLS) tools are started to be successfully used for FPGA implementations of digital signal processing algorithms. Therefore, in this paper, the first FPGA implementations of HEVC 2D IDCT algorithm using HLS tools in the literature are proposed. The proposed HEVC IDCT hardware are implemented on Xilinx FPGAs using three HLS tools; Xilinx Vivado HLS, LegUp, MATLAB Simulink HDL Coder. Using HLS tools significantly reduced the FPGA development time, and the resulting FPGA implementations achieved real-time performance. Therefore, HLS tools can be used for FPGA implementation of HEVC video encoder.

关键词： Hardware design languages Field programmable gate arrays MATLAB Standards Video compression Discrete cosine transforms

来源：评论

学校读者我要写书评

暂无评论

Analog VLSI architectures for motion processing: From fundamental limits to system applications

引用

PROCEEDINGS OF THE IEEE 1996年第7期84卷 969-987页

作者： Sarpeshkar, R Kramer, J Indiveri, G Koch, C CALTECH DIV BIOL PASADENA CA 91125 USA

This paper discusses some of the fundamental issues in the design of highly parallel, dense, low-power motion sensors in analog VLSI. Since photoreceptor circuits are an integral part of all visual motion sensors, we discuss how the sizing of photosensitive areas cart affect the performance of such systems. We review the classic gradient and correlation algorithms and give a survey of analog motion-sensing architectures inspired by them. We calculate how the measurable speed range scales with signal-to-noise ratio (SNR) for a classic Reichardt sensor with a fixed time constant. We show how this speed range may be improved using a nonlinear filter with an adaptive time constant, constructed out of a diode and a capacitor, and present data from a velocity sensor based on such a filter. Finally, we describe how arrays of such velocity sensors can be employed To compute the heading direction of a moving subject and to estimate the time-to-contact between the sensor and a moving object.

关键词： .limits various SNR implementations time constant discuss estimate delays classic analog VLSI architectures for Motion motion sensors visual motion motion-sensing Reichardt sensor nonlinear filter velocity sensors

来源：评论

学校读者我要写书评

暂无评论

Extracting side-channel leakage from round unrolled implementations of lightweight ciphers

Extracting side-channel leakage from round unrolled implemen...

引用

2019 IEEE International Symposium on Hardware Oriented Security and Trust, HOST 2019

作者： Chawla, Nikhil Singh, Arvind Rahman, Nael Mizanur Kar, Monodeep Mukhopadhyay, Saibal Georgia Institute of Technology AtlantaGA United States Intel Corporation HillsboroOR United States

ISBN: (纸本)9781538680643

Energy efficiency and security is a critical requirement for computing at edge nodes. Unrolled architectures for lightweight cryptographic algorithms have been shown to be energy-efficient, providing higher performance while meeting resource constraints. Hardware implementations of unrolled datapaths have also been shown to be resistant to side channel analysis (SCA) attacks due to a reduction in signal-to-noise ratio (SNR) and an increased complexity in the leakage model. This paper demonstrates optimal leakage models and an improved CFA attack which makes it feasible to extract first-order side-channel leakages from combinational logic in the initial rounds of unrolled datapaths. Several leakage models, targeting initial rounds, are explored and 1-bit hamming weight (HW) based leakage model is shown to be an optimal choice. Additionally, multi-band narrow bandpass filtering techniques in conjunction with correlation frequency analysis (CFA) is demonstrated to improve SNR by up to 4×, attributed to the removal of the misalignment effect in combinational logics and signal isolation. The improved CFA attack is performed on side channel signatures acquired for 7-round unrolled SIMON datapaths, implemented on Sakura-G (XILINX spartan 6, 45nm) based FPGA platform and a 24× reduction in minimum-traces-to-disclose (MTD) for revealing 80% of the key bits is demonstrated with respect to conventional time domain correlation power analysis (CPA). Finally, the proposed method is successfully applied to a fully-unrolled datapath for PRINCE and a parallel round-based datapath for advanced Encryption Standard (AES) algorithm to demonstrate its general applicability. © 2019 IEEE.

关键词： Field programmable gate arrays (FPGA)

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：