咨询与建议

限定检索结果

文献类型

  • 12 篇 期刊文献
  • 1 篇 会议

馆藏范围

  • 13 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 13 篇 工学
    • 13 篇 计算机科学与技术...
    • 8 篇 电气工程
    • 1 篇 软件工程
  • 1 篇 理学
    • 1 篇 数学

主题

  • 13 篇 parallel and vec...
  • 3 篇 simd processors
  • 3 篇 algorithm design...
  • 2 篇 parallel algorit...
  • 2 篇 optimization
  • 2 篇 fast fourier tra...
  • 2 篇 poisson equation...
  • 2 篇 gpu
  • 1 篇 matrix factoriza...
  • 1 篇 parallel process...
  • 1 篇 reliability and ...
  • 1 篇 mathematical sof...
  • 1 篇 elliptic equatio...
  • 1 篇 public key crypt...
  • 1 篇 performance meas...
  • 1 篇 applicative (fun...
  • 1 篇 computational ne...
  • 1 篇 efficiency
  • 1 篇 brain modeling
  • 1 篇 cuda

机构

  • 2 篇 univ maryland de...
  • 1 篇 univ complutense...
  • 1 篇 inria neuromathc...
  • 1 篇 natl taiwan univ...
  • 1 篇 so polytech stat...
  • 1 篇 univ sharjah dep...
  • 1 篇 univ texas san a...
  • 1 篇 arm110 cambridge...
  • 1 篇 mcmaster univ de...
  • 1 篇 univ bristol dep...
  • 1 篇 saudi aramco r&d...
  • 1 篇 george washingto...
  • 1 篇 univ maryland de...
  • 1 篇 univ maryland in...
  • 1 篇 univ karlsruhe r...
  • 1 篇 nara inst sci & ...
  • 1 篇 nipissing univ d...

作者

  • 3 篇 jaja joseph
  • 3 篇 wu jing
  • 1 篇 shibata naoki
  • 1 篇 smart np
  • 1 篇 lin chih-jen
  • 1 篇 kahl wolfram
  • 1 篇 lo chia-tien dan
  • 1 篇 duval david j.
  • 1 篇 setoain javier
  • 1 篇 chin wei-sheng
  • 1 篇 timson mitchell ...
  • 1 篇 petrogalli franc...
  • 1 篇 baladron javier
  • 1 篇 el-moursy ali
  • 1 篇 psarris kleanthi...
  • 1 篇 balaras elias
  • 1 篇 anand christophe...
  • 1 篇 tenllado christi...
  • 1 篇 schönauer w
  • 1 篇 zhuang yong

语言

  • 13 篇 英文
检索条件"主题词=Parallel and Vector Implementations"
13 条 记 录,以下是1-10 订阅
排序:
SLEEF: A Portable vectorized Library of C Standard Mathematical Functions
收藏 引用
IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2020年 第6期31卷 1316-1327页
作者: Shibata, Naoki Petrogalli, Francesco Nara Inst Sci & Technol Grad Sch Informat Sci Nara 6300192 Japan ARM110 Cambridge CB1 9NJ England
In this article, we present techniques used to implement our portable vectorized library of C standard mathematical functions written entirely in C language. In order to make the library portable while maintaining goo... 详细信息
来源: 评论
Adaptive Particle Swarm Optimization with Heterogeneous Multicore parallelism
收藏 引用
IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2017年 第10期28卷 2784-2793页
作者: Wachowiak, Mark P. Timson, Mitchell C. DuVal, David J. Nipissing Univ Dept Comp Sci & Math North Bay ON P1B 8L7 Canada
Much progress has recently been made in global optimization, with particular attention devoted to robust nature-inspired stochastic methods for difficult, high-dimensional problems. This paper presents a computational... 详细信息
来源: 评论
A Fast parallel Stochastic Gradient Method for Matrix Factorization in Shared Memory Systems
收藏 引用
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY 2015年 第1期6卷 2-2页
作者: Chin, Wei-Sheng Zhuang, Yong Juan, Yu-Chin Lin, Chih-Jen Natl Taiwan Univ Dept Comp Sci Taipei Taiwan
Matrix factorization is known to be an effective method for recommender systems that are given only the ratings from users to items. Currently, stochastic gradient (SG) method is one of the most popular algorithms for... 详细信息
来源: 评论
An Optimized FFT-Based Direct Poisson Solver on CUDA GPUs
收藏 引用
IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2014年 第3期25卷 550-559页
作者: Wu, Jing JaJa, Joseph Balaras, Elias Univ Maryland Dept Elect & Comp Engn Inst Adv Comp Studies College Pk MD 20742 USA George Washington Univ Dept Mech & Aerosp Engn Acad Ctr 720F Washington DC 20052 USA
A highly multithreaded FFT-based direct Poisson solver that makes effective use of the capabilities of the current NVIDIA graphics processing units (GPUs) is presented. Our algorithms carefully manage the multiple lay... 详细信息
来源: 评论
Optimized FFT computations on heterogeneous platforms with application to the Poisson equation
收藏 引用
JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2014年 第8期74卷 2745-2756页
作者: Wu, Jing Jaja, Joseph Univ Maryland Dept Elect & Comp Engn College Pk MD 20742 USA Univ Maryland Inst Adv Comp Studies College Pk MD 20742 USA
We develop optimized multi-dimensional FFT implementations on CPU-GPU heterogeneous platforms for the case when the input is too large to fit on the GPU global memory, and use the resulting techniques to develop a fas... 详细信息
来源: 评论
Performance evaluation and comparison of parallel conjugate gradient on modern multi-core accelerator and massively parallel systems
收藏 引用
INTERNATIONAL JOURNAL OF parallel EMERGENT AND DISTRIBUTED SYSTEMS 2014年 第1期29卷 38-67页
作者: Sibai, Fadi N. El-Moursy, Ali Saudi Aramco R&D Ctr Dhahran 31311 Saudi Arabia Univ Sharjah Dept Elect & Comp Engn Sharjah U Arab Emirates
Two parallel computer paradigms available today are multi-core accelerators such as the Sony, Toshiba and IBM Cell or Graphics Processing Unit (GPUs), and massively parallel message-passing machines such as the IBM Bl... 详细信息
来源: 评论
High Performance FFT Based Poisson Solver on a CPU-GPU Heterogeneous Platform
High Performance FFT Based Poisson Solver on a CPU-GPU Heter...
收藏 引用
IEEE 27th International parallel and Distributed Processing Symposium (IPDPS)
作者: Wu, Jing JaJa, Joseph Univ Maryland Dept Elect & Comp Engn College Pk MD 20742 USA
We develop an optimized FFT based Poisson solver on a CPU-GPU heterogeneous platform for the case when the input is too large to fit on the GPU global memory. The solver involves memory bound computations such as 3D F... 详细信息
来源: 评论
Three Applications of GPU Computing in Neuroscience
收藏 引用
COMPUTING IN SCIENCE & ENGINEERING 2012年 第3期14卷 40-47页
作者: Baladron, Javier Fasoli, Diego Faugeras, Olivier INRIA NeuroMathComp Grp NeuroMathComp Project Team Sophia Antipolis France
Three scenarios outlined here show the benefits of using a computer system with multiple GPUs in theoretical neuroscience. In each instance, it's clear that the GPU speedup considerably helps answer a scientific o... 详细信息
来源: 评论
Accelerating Matrix Operations with Improved Deeply Pipelined vector Reduction
收藏 引用
IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2012年 第2期23卷 202-210页
作者: Tai, Yi-Gang Lo, Chia-Tien Dan Psarris, Kleanthis Univ Texas San Antonio Dept Comp Sci San Antonio TX 78249 USA So Polytech State Univ Dept Comp Sci & Software Engn Marietta GA 30060 USA
Many scientific or engineering applications involve matrix operations, in which reduction of vectors is a common operation. If the core operator of the reduction is deeply pipelined, which is usually the case, depende... 详细信息
来源: 评论
An Optimized Cell BE Special Function Library Generated by Coconut
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 2009年 第8期58卷 1126-1138页
作者: Anand, Christopher Kumar Kahl, Wolfram McMaster Univ Dept Comp & Software ITB202 Hamilton ON L8S 4K1 Canada
Coconut, a tool for developing high-assurance, high-performance kernels for scientific computing, contains an extensible domain-specific language (DSL) embedded in Haskell. The DSL supports interactive prototyping and... 详细信息
来源: 评论