咨询与建议

限定检索结果

文献类型

  • 12 篇 期刊文献
  • 1 篇 会议

馆藏范围

  • 13 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 13 篇 工学
    • 13 篇 计算机科学与技术...
    • 8 篇 电气工程
    • 1 篇 软件工程
  • 1 篇 理学
    • 1 篇 数学

主题

  • 13 篇 parallel and vec...
  • 3 篇 simd processors
  • 3 篇 algorithm design...
  • 2 篇 parallel algorit...
  • 2 篇 optimization
  • 2 篇 fast fourier tra...
  • 2 篇 poisson equation...
  • 2 篇 gpu
  • 1 篇 matrix factoriza...
  • 1 篇 parallel process...
  • 1 篇 reliability and ...
  • 1 篇 mathematical sof...
  • 1 篇 elliptic equatio...
  • 1 篇 public key crypt...
  • 1 篇 performance meas...
  • 1 篇 applicative (fun...
  • 1 篇 computational ne...
  • 1 篇 efficiency
  • 1 篇 brain modeling
  • 1 篇 cuda

机构

  • 2 篇 univ maryland de...
  • 1 篇 univ complutense...
  • 1 篇 inria neuromathc...
  • 1 篇 natl taiwan univ...
  • 1 篇 so polytech stat...
  • 1 篇 univ sharjah dep...
  • 1 篇 univ texas san a...
  • 1 篇 arm110 cambridge...
  • 1 篇 mcmaster univ de...
  • 1 篇 univ bristol dep...
  • 1 篇 saudi aramco r&d...
  • 1 篇 george washingto...
  • 1 篇 univ maryland de...
  • 1 篇 univ maryland in...
  • 1 篇 univ karlsruhe r...
  • 1 篇 nara inst sci & ...
  • 1 篇 nipissing univ d...

作者

  • 3 篇 jaja joseph
  • 3 篇 wu jing
  • 1 篇 shibata naoki
  • 1 篇 smart np
  • 1 篇 lin chih-jen
  • 1 篇 kahl wolfram
  • 1 篇 lo chia-tien dan
  • 1 篇 duval david j.
  • 1 篇 setoain javier
  • 1 篇 chin wei-sheng
  • 1 篇 timson mitchell ...
  • 1 篇 petrogalli franc...
  • 1 篇 baladron javier
  • 1 篇 el-moursy ali
  • 1 篇 psarris kleanthi...
  • 1 篇 balaras elias
  • 1 篇 anand christophe...
  • 1 篇 tenllado christi...
  • 1 篇 schönauer w
  • 1 篇 zhuang yong

语言

  • 13 篇 英文
检索条件"主题词=parallel and vector implementations"
13 条 记 录,以下是1-10 订阅
排序:
Three Applications of GPU Computing in Neuroscience
收藏 引用
COMPUTING IN SCIENCE & ENGINEERING 2012年 第3期14卷 40-47页
作者: Baladron, Javier Fasoli, Diego Faugeras, Olivier INRIA NeuroMathComp Grp NeuroMathComp Project Team Sophia Antipolis France
Three scenarios outlined here show the benefits of using a computer system with multiple GPUs in theoretical neuroscience. In each instance, it's clear that the GPU speedup considerably helps answer a scientific o... 详细信息
来源: 评论
parallel cryptographic arithmetic using a redundant Montgomery representation
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 2004年 第11期53卷 1474-1482页
作者: Page, D Smart, NP Univ Bristol Dept Comp Sci Bristol BS8 1UB Avon England
We describe how using a redundant Montgomery representation allows for high-performance SIMD-based implementations of RSA and elliptic curve cryptography. This is in addition to the known benefits of immunity from tim... 详细信息
来源: 评论
An Optimized FFT-Based Direct Poisson Solver on CUDA GPUs
收藏 引用
IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2014年 第3期25卷 550-559页
作者: Wu, Jing JaJa, Joseph Balaras, Elias Univ Maryland Dept Elect & Comp Engn Inst Adv Comp Studies College Pk MD 20742 USA George Washington Univ Dept Mech & Aerosp Engn Acad Ctr 720F Washington DC 20052 USA
A highly multithreaded FFT-based direct Poisson solver that makes effective use of the capabilities of the current NVIDIA graphics processing units (GPUs) is presented. Our algorithms carefully manage the multiple lay... 详细信息
来源: 评论
SLEEF: A Portable vectorized Library of C Standard Mathematical Functions
收藏 引用
IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2020年 第6期31卷 1316-1327页
作者: Shibata, Naoki Petrogalli, Francesco Nara Inst Sci & Technol Grad Sch Informat Sci Nara 6300192 Japan ARM110 Cambridge CB1 9NJ England
In this article, we present techniques used to implement our portable vectorized library of C standard mathematical functions written entirely in C language. In order to make the library portable while maintaining goo... 详细信息
来源: 评论
An Optimized Cell BE Special Function Library Generated by Coconut
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 2009年 第8期58卷 1126-1138页
作者: Anand, Christopher Kumar Kahl, Wolfram McMaster Univ Dept Comp & Software ITB202 Hamilton ON L8S 4K1 Canada
Coconut, a tool for developing high-assurance, high-performance kernels for scientific computing, contains an extensible domain-specific language (DSL) embedded in Haskell. The DSL supports interactive prototyping and... 详细信息
来源: 评论
Adaptive Particle Swarm Optimization with Heterogeneous Multicore parallelism
收藏 引用
IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2017年 第10期28卷 2784-2793页
作者: Wachowiak, Mark P. Timson, Mitchell C. DuVal, David J. Nipissing Univ Dept Comp Sci & Math North Bay ON P1B 8L7 Canada
Much progress has recently been made in global optimization, with particular attention devoted to robust nature-inspired stochastic methods for difficult, high-dimensional problems. This paper presents a computational... 详细信息
来源: 评论
Numerical engineering:: design of PDE black-box solvers
收藏 引用
MATHEMATICS AND COMPUTERS IN SIMULATION 2000年 第4-5期54卷 269-277页
作者: Schönauer, W Univ Karlsruhe Rech Zentrum D-76128 Karlsruhe Germany
The design of PDE black-box solvers (for nonlinear systems of elliptic and parabolic PDEs) needs many compromises between efficiency and robustness which we call 'Numerical Engineering'. The requirements for a... 详细信息
来源: 评论
parallel implementation of the 2D discrete wavelet transform on Graphics Processing Units:: Filter Bank versus Lifting
收藏 引用
IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2008年 第3期19卷 299-310页
作者: Tenllado, Christian Setoain, Javier Prieto, Manuel Pinuel, Luis Tirado, Francisco Univ Complutense Madrid Fac Ciencias Fis Dept Comp Architecture ArTeCS Grp E-28040 Madrid Spain
The widespread usage of the discrete wavelet transform (DWT) has motivated the development of fast DWT algorithms and their tuning on all sorts of computer systems. Several studies have compared the performance of the... 详细信息
来源: 评论
Accelerating Matrix Operations with Improved Deeply Pipelined vector Reduction
收藏 引用
IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2012年 第2期23卷 202-210页
作者: Tai, Yi-Gang Lo, Chia-Tien Dan Psarris, Kleanthis Univ Texas San Antonio Dept Comp Sci San Antonio TX 78249 USA So Polytech State Univ Dept Comp Sci & Software Engn Marietta GA 30060 USA
Many scientific or engineering applications involve matrix operations, in which reduction of vectors is a common operation. If the core operator of the reduction is deeply pipelined, which is usually the case, depende... 详细信息
来源: 评论
Optimized FFT computations on heterogeneous platforms with application to the Poisson equation
收藏 引用
JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2014年 第8期74卷 2745-2756页
作者: Wu, Jing Jaja, Joseph Univ Maryland Dept Elect & Comp Engn College Pk MD 20742 USA Univ Maryland Inst Adv Comp Studies College Pk MD 20742 USA
We develop optimized multi-dimensional FFT implementations on CPU-GPU heterogeneous platforms for the case when the input is too large to fit on the GPU global memory, and use the resulting techniques to develop a fas... 详细信息
来源: 评论