检索结果-内蒙古大学图书馆

2006 8th international conference on Solid-State and Integrated Circuit Technology

作者： Yue-Xi Zhao An-Ping Jiang SOC Lab Department of microelectronicsPeking University

An efficient paraUel processing method for deblocking filter design in H.264 video coding standard is presented in this paper. In order to reduce the memory reference and make the intermediate data reused as soon as possible,an advanced filtering order is taken,and read/write operation on external memory is executed in parallel with filtering computation. Furthermore,preloading operation is taken to reduce complexity of memory structure,and vertical MB processing order is used for improving the efficiency of intermediate data *** a result,the processing cycles of the proposed architecture with single-port memory architecture is reduced by 80.5%compared with the advanced architecture of previous proposals.

关键词： AVC Filter A Novel parallel processing Architecture for Deblocking Filter in H.264 Using Vertical MB Filtering Order

来源：评论

学校读者我要写书评

暂无评论

SystemC-defined SIMD instructions for high performance SoC architectures

SystemC-defined SIMD instructions for high performance SoC a...

引用

13th IEEE international conference on Electronics, Circuits and Systems (ICECS'2006), vol.2

作者： V. A. Chouliaras K. Koutsomyti T. Jacobs S. Parr D. Mulvaney R. thomson Department of Electronic and Electrical Engineering Loughborough University Loughborough Leicestershire UK

this work presents a SystemC-based design of custom SIMD instructions for accelerating media and telecom codes on a next-generation configurable, extensible processor. the SS_SPARC processing platform, incorporates a generic vector unit which can be extended with pipelined, SIMD computation units (datapaths) designed either with established (RTL-based) or in this case, hybrid (SystemC-RTL) methodologies. this work elaborates on a custom methodology for automatically encapsulating the data-parallel sections of the MPEG-4 XviD the G723.1 and G729A reference codes into a SystemC wrapper which is subsequently synthesized to RTL with a commercial SystemC-synthesis tool. the resulting RTL is then attached to the exposed vector unit of the SS_SPARC engine. We present results from a standard-cell RTL synthesis campaign and the VLSI implementation of a high-end (8-contexts, 256bit) and a low-end (2-context, 128bit) configuration of the vector engine for the workloads of interest.

关键词： parallel processing Telecommunications Engines Phase change random access memory Acceleration MPEG 4 Standard Video coding Consumer products Microarchitecture Silicon

来源：评论

学校读者我要写书评

暂无评论

MIMO Multiuser Detection for CDMA Systems

MIMO Multiuser Detection for CDMA Systems

引用

2006 8th international conference on Signal processing

作者： Yang Xiao Moon Ho Lee Institute of Information Science Beijing Jiaotong University Division of Electronics & Information Chonbuk National University Jeonju 561-756Korea

To combine presented MIMO scheme with multiuser detectors for uplink will suffer from the problems of high computation complexity and channenl ***,in this paper we propose a MIMO multiuser detection(MUD) scheme that reduces considerably the system computation complexity. the proposed algorithm adopts inverse channel matrix for MIMO decoding,which is not sensitive to the coherency of *** of the scattering characteristic of the MIMO channel,the inverse channel matrices are always nonsingular, which keeps the receivers can get stable spatial diversity gain. the MUD algorithms can be realized using a parallel modular *** is based on a Minimum Mean Square Error (MMSE) *** results show that our MIMO-MUD performs much better than presented MIMO-MUD for the same order of complexity,though the MIMO CDMA system has only two antennas at each BS and two antennas at each mobile station.

关键词： communication circuits space-time processing MIMO multiuser detection

来源：评论

学校读者我要写书评

暂无评论

Novel Word-Level Sequential Scheme and parallel Architecture For Bit Plane Coding in JPEG2000

Novel Word-Level Sequential Scheme and Parallel Architecture...

引用

2006 8th international conference on Signal processing

作者： Zhirong Gao Chengyi Xiong Department of Computer Science Wuhan University of Science & Engineering College of Electronic Information Engineering South-Center University for Nationalities

this paper presented an improved word -level sequential scheme and parallel architecture for bit plane coding of EBCOT used in JPEG *** bit plane coding adopted by EBCOT is divided into two stages:coding pass prediction and context formation,which work in parallel and pipelined. Word-level and sequential bit plane coding could be achieved that coefficient bits modelling in different bit plane are performed concurrently and,all three passes coding included in each bit plane are completed in one *** result demonstrates that the proposed architecture could efficiently reduce hardware complexity,compared to the up-to -date design.

关键词： bit plane coding word-level sequential pipeline parallel.

来源：评论

学校读者我要写书评

暂无评论

Stateful dynamic partial-order reduction

引用

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2006年 4260 LNCS卷 149-167页

作者： Yi, Xiaodong Wang, Ji Yang, Xuejun National Laboratory for Parallel and Distributed Processing Changsha China

ISBN: (纸本)3540474609

State space explosion is the main obstacle for model checking concurrent programs. Among the solutions, partial-order reduction (POR), especially dynamic partial-order reduction (DPOR) [1], is one of the promising approaches. However, DPOR only supports stateless explorations for acyclic state spaces. In this paper, we present the stateful DPOR approach for may-cyclic state spaces, which naturally combines DPOR with stateful model checking to achieve more efficient reduction. Its basic idea is to summarize the interleaving information for all transition sequences starting from each visited state, and infer the necessary partial-order information based on the summarization when a visited state is encountered again. Experiment results on two programs coming from [1] show that both of the costs of space and time could be remarkably reduced by stateful DPOR with rather reasonable extra memory overhead. © Springer-Verlag Berlin Heidelberg 2006.

关键词： Dynamic programming

来源：评论

学校读者我要写书评

暂无评论

Using predictive adaptive parallelism to address portability and irregularity

Using predictive adaptive parallelism to address portability...

引用

8th international Symposium on parallel architectures, algorithms and Networks

作者： Wangerin, DL Scherson, ID Univ Calif Irvine Sch Comp Sci Irvine CA 92697 USA

A semi-dynamic system is presented that is capable of predicting the performance of parallel programs at runtime. the functionality given by the system allows for efficient handling of portability and irregularity of ... 详细信息

ISBN: (纸本)0769525091

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

External double hashing with choice

External double hashing with choice

引用

8th international Symposium on parallel architectures, algorithms and Networks

作者： Burkhard, WA Univ Calif San Diego Dept Comp Sci & Engn Gemini Storage Syst Lab La Jolla CA 92093 USA

ISBN: (纸本)0769525091

A novel extension to external double hashing providing significant reduction to both successful and unsuccessful search lengths is presented. the experimental and analytical results demonstrate the reductions possible. this method does not restrict the hashing table configuration parameters and utilizes very little additional storage space per bucket. the runtime performance for insertion is slightly greater than for ordinary external double hashing.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

An FPGA-based floating-point Jacobi iterative solver

An FPGA-based floating-point Jacobi iterative solver

引用

8th international Symposium on parallel architectures, algorithms and Networks

作者： Morris, GR Prasanna, VK Univ So Calif Dept Elect Engn Los Angeles CA 90089 USA

ISBN: (纸本)0769525091

Within the parallel computing domain, field programmable gate arrays (FPGA) are no longer restricted to their traditional role as substitutes for application-specific integrated circuits-as hardware "hidden" from the end user Several high performance computing vendors offer parallel reconfigurable computers employing user-programmable FPGAs. these exciting new architectures allow end-users to, in effect, create reconfigurable coprocessors targeting the computationally intensive parts of each problem. the increased capability of contemporary FPGAs coupled with the embarrassingly parallel nature of the Jacobi iterative method make the Jacobi method an ideal candidate for hardware acceleration. this paper introduces a parameterized design for a deeply pipelined, highly parallelized IEEE 64-bit floating-point version of the Jacobi method. A Jacobi circuit is implemented using a Xilinx Virtex-II Pro as the target FPGA device. Implementation statistics and performance estimates are presented.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Coherence maintenances to realize an efficient parallel processing for a cache memory with synchronization on a chip-multiprocessor

Coherence maintenances to realize an efficient parallel proc...

引用

8th international Symposium on parallel architectures, algorithms and Networks

作者： Yamawaki, A Iwane, M Kyushu Inst Technol Kitakyushu Fukuoka 804 Japan

ISBN: (纸本)0769525091

A chip-multiprocessor is one of the promising architectures that can overcome the ILP limitation, high power consumption and high heating that current processors face. On a shared memory multiprocessor a performance improvement relies on an efficient communication and synchronization method via shared variables. the TSVM cache combines communication and synchronization with the coherence maintenance on a chip-multiprocessor that is, the communication and synchronization via shared variables are realized by one coherence transaction through a highspeed on chip inter-connection. the TSVM cache provides several instructions that each instruction has the individual coherence maintenance scheme. the combinations of these instructions can realize the producer-consumers synchronization, mutual exclusion and barrier synchronization with communication easily and systematically. this paper describes how those instructions construct three primitives and shows effect of these primitives using a clock cycle-accurate simulator written in VHDL. the result shows that the TSVM cache can improve a performance of 9.8 times compared with a traditional cache memory, and improve a performance of 2 times compared with a conventional cache memory with synchronization mechanism.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Dynamic estimation of Task Level parallelism with operating system support

Dynamic estimation of Task Level Parallelism with operating ...

引用

8th international Symposium on parallel architectures, algorithms and Networks

作者： Hung, LD Sakai, S Univ Tokyo Grad Sch Informat Sci & Technol Tokyo Japan

ISBN: (纸本)0769525091

the amount of Task Level parallelism (TLP) in runtime workload is useful information to determine the efficient us age of multiprocessors. this paper presents mechanisms to dynamically estimate the amount of TLP in runtime work loads. Modifications are added to the operating system (OS) to collect information about processor utilization, task activities, from which TLP can be calculated. By effectively utilizing the Time Stamp Counter (TSC) hardware, the task activities can be monitored at fine time resolution, result ing in capability of estimation of TLP at fine granularity. We implement the mechanisms on a recent version of Linux OS. Evaluation results indicate that the mechanisms can estimate TLP accurately for various kinds of workloads with small overheads.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：