检索结果-内蒙古大学图书馆

陀螺特征值问题的并行子空间迭代法

引用

高等学校计算数学学报 2010年第2期32卷 179-184页

作者：王顺绪淮海工学院理学院

1引言陀螺系统特征值问题是转子动力学中的基本问题,是一类特殊的二次特征值问题.假设M和K是n阶对称矩阵,C是n阶反对称矩阵,则二次特征值问题（λ;M+λC+K）x=0（1）

关键词： quadratic eigenvalue problem gyroscopic system subspace iterative method parallel computing multi-core computing

来源：评论

学校读者我要写书评

暂无评论

Exploiting ILP, TLP, and DLP to Improve multi-core Performance of One-Sided Jacobi SVD

引用

PARALLEL PROCESSING LETTERS 2009年第2期19卷 355-375页

作者： Soliman, Mostafa I. South Valley Univ Aswan Fac Engn Elect Engn Dept Comp & Syst Sect Aswan 81542 Egypt

This paper shows how the performance of singular value decomposition (SVD) is enhanced through the exploitation of ILP, TLP, and DLP on Intel multi-core processors using superscalar execution, multi-threading computation, and streaming SIMD extensions, respectively. To facilitate the exploitation of TLP on multiple execution cores, the well-known cyclic one-sided Jacobi algorithm is restructured to work in parallel. On two dual-core Intel Xeon processors with hyper-threading technology running at 3.0 GHz, our results show that the multi-threaded implementation of one-sided Jacobi SVD gives about four times faster than the single-threaded superscalar implementation. Furthermore, the multi-threaded SIMD implementation speeds up the execution of single threaded one-sided Jacobi by a factor of 10, which is close to the ideal speedup. On a reasonable large matrix size fitted in the L2 cache, our results show a performance of 11 kA w GFLOPS (double-precision) is achieved on the target system through the exploitation cr, of ILP, TLP, and DLP as well as memory hierarchy.

关键词： multi-core computing multi-threading techniques ILP TLP DLP SVD one-sided Jacobi block algorithms high-performance computing performance evaluation

来源：评论

学校读者我要写书评

暂无评论

Parallel video surveillance on the multi-core cell broadband engine

Parallel video surveillance on the multi-core cell broadband...

引用

2nd International Joint Conference on Computational Sciences and Optimization (CSO)

作者： Rabie, Tamer Kidwai, Hashir Karim Sibai, Fadi N. United Arab Emirates Univ Coll Informat Technol Al Ain 17551 U Arab Emirates

ISBN: (纸本)9780769536057

The IBM Cell Broadband Engine (BE) is a multi-core processor with a PowerPC host processor (PPE) and 8 synergic processor engines (SPEs). The Cell BE architecture is designed to improve upon conventional processors in terms of memory latency bandwidth and power computation. In this paper, we discuss the parallelization, implementation and performance of a video surveillance application on the IBM Cell BE. We report the Video surveillance application's performance measured on a computer with one Cell processor and with varying numbers of synergic processor engines enabled. These results were compared to the results obtained on the Cell's single PPE with all 8 SPEs disabled The results indicate that our video surveillance application performs approximately 16 times faster on the Cell BE than modern RISC processors by processing input data from five separate surveillance video streams in parallel.

关键词： Cell Broadband Engine multi-core computing Video Surveillance

来源：评论

学校读者我要写书评

暂无评论

A high-performance face detection system using OpenMP

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2009年第15期21卷 1819-1837页

作者： Hadjidoukas, P. E. Dimakopoulos, V. V. Delakis, M. Garcia, C. Univ Ioannina Dept Comp Sci GR-45110 Ioannina Greece Orange Labs F-35512 Rennes France

We present the development of a novel high-performance face detection system using a neural network-based classification algorithm and an efficient parallelization with OpenMP. We discuss the design of the system in detail along with experimental assessment. Our parallelization strategy starts with one level of threads and moves to the exploitation of nested parallel regions in order to further improve, by up to 19%, the image-processing capability. The presented system is able to process images in real time (38 images/sec) by sustaining almost linear speedups on a system with a quad-core processor and a particular OpenMP runtime library. Copyright (C) 2009 John Wiley & Sons, Ltd.

关键词： face detection image processing nested parallelism OpenMP multi-core computing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：