检索结果-内蒙古大学图书馆

IEEE 11th International Conference on Communications (COMM)

作者： Zoican, Sorin Vochin, Marius Univ Politehn Bucuresti Bucharest Romania

ISBN: (纸本)9781467381970

In this paper, we investigate computing systems and network architectures, dedicated to high frequency trading applications and evaluate their performances. Both a high processing speed and low network latency are important for high-frequency traders. The financial market literature suggests, however, that extremely high speeds discourage other traders from participating in the market, therefore harming the quality of financial markets. We find that the existing medium cost technology is enough to promote an optimal trading speed and therefore postulate further investment in low latency technology to be inefficient from a technical and economical point of view.

关键词： computer unified device architecture high frequency trading algorithms network latency

来源：评论

学校读者我要写书评

暂无评论

Fast method of constructing image correlations to build a free network based on image multivocabulary trees

引用

JOURNAL OF ELECTRONIC IMAGING 2015年第3期24卷 1-12页

作者： Zhan, Zongqian Wang, Xin Wei, Minglu Wuhan Univ Sch Geodesy & Geomat Wuhan 430072 Peoples R China

In image-based three-dimensional (3-D) reconstruction, one topic of growing importance is how to quickly obtain a 3-D model from a large number of images. The retrieval of the correct and relevant images for the model poses a considerable technological challenge. The "image vocabulary tree" has been proposed as a method to search for similar images. However, a significant drawback of this approach is identified in its low time efficiency and barely satisfactory classification result. The method proposed is inspired by, and improves upon, some recent methods. Specifically, vocabulary quality is considered and multivocabulary trees are designed to improve the classification result. A marked improvement was, indeed, observed in our evaluation of the proposed method. To improve time efficiency, graphics processing unit (GPU) computer unified device architecture parallel computation is applied in the multivocabulary trees. The results of the experiments showed that the GPU was three to four times more efficient than the enumeration matching and CPU methods when the number of images is large. This paper presents a reliable reference method for the rapid construction of a free network to be used for the computing of 3-D information. (C) 2015 SPIE and IS&T

关键词： vocabulary tree vocabulary quality multivocabulary trees graphics processing unit computer unified device architecture free network

来源：评论

学校读者我要写书评

暂无评论

A Fast MHD Code for Gravitationally Stratified Media using Graphical Processing Units: SMAUG

引用

JOURNAL OF ASTROPHYSICS AND ASTRONOMY 2015年第1期36卷 197-223页

作者： Griffiths, M. K. Fedun, V. Erdelyi, R. Univ Sheffield Corp Informat & Comp Serv Sheffield S10 2FN S Yorkshire England Univ Sheffield Dept Automat Control & Syst Engn Sheffield S1 3JD S Yorkshire England Univ Sheffield Sch Math & Stat SP2RC Sheffield S7 3RH S Yorkshire England

Parallelization techniques have been exploited most successfully by the gaming/graphics industry with the adoption of graphical processing units (GPUs), possessing hundreds of processor cores. The opportunity has been recognized by the computational sciences and engineering communities, who have recently harnessed successfully the numerical performance of GPUs. For example, parallel magnetohydrodynamic (MHD) algorithms are important for numerical modelling of highly inhomogeneous solar, astrophysical and geophysical plasmas. Here, we describe the implementation of SMAUG, the Sheffield Magnetohydrodynamics Algorithm Using GPUs. SMAUG is a 1-3D MHD code capable of modelling magnetized and gravitationally stratified plasma. The objective of this paper is to present the numerical methods and techniques used for porting the code to this novel and highly parallel compute architecture. The methods employed are justified by the performance benchmarks and validation results demonstrating that the code successfully simulates the physics for a range of test scenarios including a full 3D realistic model of wave propagation in the solar atmosphere.

关键词： Numerical simulations magnetohydrodynamics computer unified device architecture graphical processing units NVIDIA Sheffield advanced code the Sheffield magnetohydrodynamics algorithm using GPUs versatile advection code

来源：评论

学校读者我要写书评

暂无评论

GPU Implementation of a Deformable 3D Image Registration Algorithm

GPU Implementation of a Deformable 3D Image Registration Alg...

引用

33rd Annual International Conference of the IEEE Engineering-in-Medicine-and-Biology-Society (EMBS)

作者： Mousazadeh, Hamed Marami, Bahram Sirouspour, Shahin Patriciu, Alexandru McMaster Univ Dept Elect & Comp Engn Hamilton ON L8S 4L8 Canada McMaster Univ Sch Biomed Engn Hamilton ON L8S 4L8 Canada

ISBN: (纸本)9781424441228

We present a parallel implementation of a new deformable image registration algorithm using the computer unified device architecture (CUDA). The algorithm co-registers preoperative and intraoperative 3-dimensional magnetic resonance (MR) images of a deforming organ. It employs a linear elastic dynamic finite-element model of the deformation and distance measures such as mutual information and sum of squared differences to align volumetric image data sets. Computationally intensive elements of the method such as interpolation, displacement and force calculation are significantly accelerated using a Graphics Processing Unit (GPU). The result of experiments carried out with a realistic breast phantom tissue shows a 37-fold speedup for the GPU-based implementation compared with an optimized CPU-based implementation in high resolution MR image registration. The GPU implementation is capable of registering 512x512x136 image sets in just over 2 seconds, making it suitable for clinical applications requiring fast and accurate processing of medical images.

关键词： 3D magnetic resonance images CUDA framework computer unified device architecture Equations Finite element methods GPU implementation Graphics Processing Unit Graphics processing unit Image registration Mathematical model Three dimensional displays Vectors

来源：评论

学校读者我要写书评

暂无评论

GPU Implementation of Spiking Neural Networks for Color Image Segmentation

GPU Implementation of Spiking Neural Networks for Color Imag...

引用

2011 4th International Congress on Image and Signal Processing(第四届图像与信号处理国际学术会议 CISP 2011)

作者： Martin McGinnity QingXiang Wu Ermai Xie Jianyong Cai Rontai Cai Intelligent Systems Research Center University of Ulster at Magee Londonderry BT48 7JL Northern Ir School of Physics and OptoElectronics Technology Fujian Normal University Fuzhou 350007 China

Spiking neural networks (SNN) are powerful computational model inspired by the human neural system for engineers and neuroscientists to simulate intelligent computation of the brain. Inspired by the visual system, various spiking neural network models have been used to process visual images. However, it is time-consuming to simulate a large scale of spiking neurons in the networks using CPU programming. Spiking neural networks inherit intrinsically parallel mechanism from biological system. A massively parallel implementation technology is required to simulate them. To address this issue, modern Graphic Processing Units (GPUs), which have parallel array of streaming multiprocessors, allow many thousands of lightweight threads to be run, is proposed and proved as a pertinent solution. This paper presents an approach for implementation of an SNN model which performs color image segmentation on GPU. This approach is then compared with an equivalent implementation on an Intel Xeon CPU. The results show that the GPU approach was found to provide a 31 times faster than the CPU implementation.

关键词： graphic processing units spiking neural network computer unified device architecture colore image segmentation

来源：评论

学校读者我要写书评

暂无评论

Massive Parallel LDPC Decoding on GPU 08

Massive Parallel LDPC Decoding on GPU

引用

ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 08)

作者： Falcao, Gabriel Sousa, Leonel Silva, Vitor Univ Coimbra Inst Telecomunicacoes Dep Elect & Comp Eng P-3000 Coimbra Portugal

ISBN: (纸本)9781595939609

Low-Density Parity-Check (LDPC) codes are powerful error correcting codes (ECC). They have recently been adopted by several data communication standards such as DVB-S2 and WiMax. LDPCs are represented by bipartite graphs, also called Tanner graphs, and their decoding demands very intensive computation. For that reason, VLSI dedicated architectures have been investigated and developed over the last few years. This paper proposes a new approach for LDPC decoding on graphics processing units (GPUs). Efficient data structures and an new algorithm are proposed to represent the Tanner graph and to perform LDPC decoding according to the stream-based computing model. GPUs were programmed to efficiently implement the proposed algorithms by applying data-parallel intensive computing. Experimental results show that GPUs perform LDPC decoding nearly three orders of magnitude faster than modem CPUs. Moreover, they lead to the conclusion that GPUs with their tremendous processing power can be considered as a consistent alternative to state-of-the-art hardware LDPC decoders.

关键词： Parallel processing Graphics Processing Unit Low-Density Parity-Check codes LDPC computer unified device architecture CUDA

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：