咨询与建议

限定检索结果

文献类型

  • 5 篇 会议
  • 2 篇 期刊文献

馆藏范围

  • 7 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 7 篇 工学
    • 4 篇 电气工程
    • 4 篇 计算机科学与技术...
    • 2 篇 信息与通信工程
    • 2 篇 软件工程
    • 1 篇 材料科学与工程(可...

主题

  • 7 篇 cuda programming...
  • 2 篇 parallel archite...
  • 2 篇 kernel
  • 2 篇 graphics process...
  • 1 篇 unified memory
  • 1 篇 graphics program...
  • 1 篇 cache storage
  • 1 篇 gpu based simula...
  • 1 篇 routing
  • 1 篇 affine shape fun...
  • 1 篇 approximation al...
  • 1 篇 gpu friendly alg...
  • 1 篇 gpu kernels
  • 1 篇 isogeometric dis...
  • 1 篇 thread-block
  • 1 篇 threads
  • 1 篇 bufferless inter...
  • 1 篇 integration
  • 1 篇 parallel program...
  • 1 篇 distributed cach...

机构

  • 1 篇 indian inst tech...
  • 1 篇 univ illinois de...
  • 1 篇 luoyang elect eq...
  • 1 篇 agh univ sci & t...
  • 1 篇 indian inst tech...
  • 1 篇 univ belgrade sc...
  • 1 篇 photogauge india...
  • 1 篇 indian inst tech...
  • 1 篇 nvidia
  • 1 篇 shanghai jiao to...
  • 1 篇 cracow univ tech...
  • 1 篇 tokyo inst techn...

作者

  • 1 篇 papakonstantinou...
  • 1 篇 see simon
  • 1 篇 nasre rupesh
  • 1 篇 hwu wen-mei w.
  • 1 篇 chen yancang
  • 1 篇 bielanski jan
  • 1 篇 sahu aryabarna
  • 1 篇 cui xuewen
  • 1 篇 du jing
  • 1 篇 subramanian sank...
  • 1 篇 gururaj karthik
  • 1 篇 kruzel filip
  • 1 篇 kumar navin
  • 1 篇 cong jason
  • 1 篇 sui sai
  • 1 篇 thiagu mullai
  • 1 篇 wei pei
  • 1 篇 chen deming
  • 1 篇 durdevic dorde m...
  • 1 篇 banas krzysztof

语言

  • 7 篇 英文
检索条件"主题词=CUDA programming model"
7 条 记 录,以下是1-10 订阅
排序:
High-speed, two-dimensional digital image correlation algorithm using heterogeneous (CPU-GPU) framework
收藏 引用
STRAIN 2020年 第3期56卷 e12342-e12342页
作者: Thiagu, Mullai Subramanian, Sankara J. Nasre, Rupesh Indian Inst Technol Madras Dept Engn Design Chennai Tamil Nadu India PhotoGAUGE India Private Ltd Chennai Tamil Nadu India Indian Inst Technol Madras Dept Comp Sci & Engn Chennai 600036 Tamil Nadu India
Two-dimensional digital image correlation (2D-DIC) is an experimental technique used to measure in-plane displacement of a test specimen. Real-time measurement of full-field displacement data is challenging due to eno... 详细信息
来源: 评论
Optimal Kernel Design for Finite-Element Numerical Integration on GPUs
收藏 引用
COMPUTING IN SCIENCE & ENGINEERING 2020年 第6期22卷 61-74页
作者: Banas, Krzysztof Kruzel, Filip Bielanski, Jan AGH Univ Sci & Technol Krakow Poland Cracow Univ Technol Inst Comp Sci Krakow Poland
This article presents the design and optimization of the GPU kernels for numerical integration, as it is applied in the standard form in finite-element codes. The optimization process employs autotuning, with the main... 详细信息
来源: 评论
An Evaluation of Unified Memory Technology on NVIDIA GPUs  15
An Evaluation of Unified Memory Technology on NVIDIA GPUs
收藏 引用
2015 15th IEEE ACM International Symposium on Cluster Cloud and Grid Computing (CCGrid 2015)
作者: Li, Wenqiang Jin, Guanghao Cui, Xuewen See, Simon Shanghai Jiao Tong Univ Ctr High Performance Comp Shanghai 200030 Peoples R China Tokyo Inst Technol Tokyo Japan NVIDIA Singapore Singapore
Unified Memory is an emerging technology which is supported by cuda 6.X. Before cuda 6.X, the existing cuda programming model relies on programmers to explicitly manage data between CPU and GPU and hence increases pro... 详细信息
来源: 评论
The Research on Parallel Optimization of SAR Imaging R-D Algorithm Based on cuda  10
The Research on Parallel Optimization of SAR Imaging R-D Alg...
收藏 引用
10th International Conference on Communication Software and Networks (ICCSN)
作者: Wei, Pei Du, Jing Sui, Sai Chen, Yancang Luoyang Elect Equipment Test Ctr Luoyang Peoples R China
Synthetic Aperture Radar (SAR) imaging technology is widely used in the field of remote sensing observation, navigation positioning and so on, SAR imaging is large in data scale and long in operating time. Based on th... 详细信息
来源: 评论
High-Performance cuda Kernel Execution on FPGAs  09
High-Performance CUDA Kernel Execution on FPGAs
收藏 引用
ACM SIGARCH International Conference on Supercomputing
作者: Papakonstantinou, Alexandros Gururaj, Karthik Stratton, John A. Chen, Deming Cong, Jason Hwu, Wen-Mei W. Univ Illinois Dept Elect & Comp Engn Urbana IL 61801 USA
In this work, we propose a new FPGA design flow that combines the cuda programming model from Nvidia with the state of the art high-level synthesis tool AutoPilot from AutoESL, to efficiently map the exposed paralleli... 详细信息
来源: 评论
Fast cuda-based Codec for Height Fields
Fast CUDA-based Codec for Height Fields
收藏 引用
21st Telecommunications Forum (TELFOR)
作者: Durdevic, Dorde M. Tartalja, Igor I. Univ Belgrade Sch Elect Engn Belgrade 11120 Serbia
Following the advances in remote sensing technology in the last decade, the horizontal and vertical scan resolutions for digital terrains have reached the order of a meter and decimeter, respectively. At these resolut... 详细信息
来源: 评论
DDGSim: GPU Based Simulator for Large Multicore with Bufferless NoC  11
DDGSim: GPU Based Simulator for Large Multicore with Bufferl...
收藏 引用
11th Annual IEEE India Conference (INDICON)
作者: Kumar, Navin Sahu, Aryabarna Indian Inst Technol Guwahati Dept Comp Sci & Engn Gauhati 781039 Assam India
In large scale chip multicore, last level cache management and core interconnection network play important roles in per-formance and power consumption. And in large scale chip multicore, mesh interconnect is used wide... 详细信息
来源: 评论