咨询与建议

限定检索结果

文献类型

  • 6 篇 期刊文献
  • 4 篇 会议

馆藏范围

  • 10 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 10 篇 工学
    • 9 篇 计算机科学与技术...
    • 3 篇 软件工程
    • 2 篇 电气工程
  • 1 篇 理学
    • 1 篇 物理学

主题

  • 10 篇 simd extensions
  • 2 篇 octree
  • 2 篇 dynamic scenes
  • 2 篇 ray tracing
  • 2 篇 binary partition
  • 1 篇 mimetic finite d...
  • 1 篇 loop vectorizati...
  • 1 篇 data level paral...
  • 1 篇 nested if-statem...
  • 1 篇 fdtd
  • 1 篇 compiler optimiz...
  • 1 篇 masked instructi...
  • 1 篇 if-select transf...
  • 1 篇 vectorization me...
  • 1 篇 matrix blocking
  • 1 篇 neural networks
  • 1 篇 lifting scheme
  • 1 篇 acoustic waves
  • 1 篇 loop unswitching
  • 1 篇 multi-threaded t...

机构

  • 2 篇 univ politecn ca...
  • 2 篇 univ munster mun...
  • 2 篇 ipn ctr invest &...
  • 2 篇 univ nacl autono...
  • 2 篇 natl digital swi...
  • 2 篇 intel guadalajar...
  • 1 篇 cent univ venezu...
  • 1 篇 univ alicante in...
  • 1 篇 cent univ venezu...
  • 1 篇 delft univ techn...
  • 1 篇 univ ulm dept ne...
  • 1 篇 barcelona superc...
  • 1 篇 univ alicante dp...
  • 1 篇 univ politecn ca...
  • 1 篇 dept fis ingn si...
  • 1 篇 univ alicante in...
  • 1 篇 south valley uni...
  • 1 篇 pla informat eng...

作者

  • 3 篇 zhao rongcai
  • 2 篇 garcia arturo
  • 2 篇 sun huihui
  • 2 篇 gorlatch sergei
  • 2 篇 ramos felix f.
  • 2 篇 olivares ulises
  • 2 篇 rodriguez hector...
  • 1 篇 strey a
  • 1 篇 rojas otilio
  • 1 篇 gallego s.
  • 1 篇 wang dong
  • 1 篇 guevara-jordan j...
  • 1 篇 solano freysimar
  • 1 篇 shahbahrami a
  • 1 篇 belendez a.
  • 1 篇 rodriguez robert
  • 1 篇 otero b.
  • 1 篇 soliman mostafa ...
  • 1 篇 neipp c.
  • 1 篇 wang qi

语言

  • 10 篇 英文
检索条件"主题词=SIMD extensions"
10 条 记 录,以下是1-10 订阅
排序:
Vectorizing programs with IF-statements for processors with simd extensions
收藏 引用
JOURNAL OF SUPERCOMPUTING 2020年 第6期76卷 4731-4746页
作者: Sun, Huihui Gorlatch, Sergei Zhao, Rongcai Univ Munster Munster Germany Natl Digital Switching Syst Engn & Technol Res Ct Zhengzhou Henan Peoples R China
Vectorization of programs is crucial for achieving high performance on modern processors with simd (Single Instruction Multiple Data) extensions. Programs with IF-statements suffer from control flow divergence that se... 详细信息
来源: 评论
Refactoring Loops with Nested IFs for simd extensions Without Masked Instructions  24th
Refactoring Loops with Nested IFs for SIMD Extensions Withou...
收藏 引用
International European Conference on Parallel and Distributed Computing (Euro-Par)
作者: Sun, Huihui Gorlatch, Sergei Zhao, Rongcai Univ Munster Munster Germany Natl Digital Switching Syst Engn & Technol Res Ct Zhengzhou Peoples R China
Most CPUs in heterogeneous systems are now equipped with simd (Single Instruction Multiple Data) extensions that operate on short vectors in parallel to enable high performance. Refactoring programs for such systems r... 详细信息
来源: 评论
A performance analysis of a mimetic finite difference scheme for acoustic wave propagation on GPU platforms
收藏 引用
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2017年 第4期29卷
作者: Otero, Beatriz Frances, Jorge Rodriguez, Robert Rojas, Otilio Solano, Freysimar Guevara-Jordan, Juan Univ Politecn Cataluna Dept Arquitectura Comp Barcelona Spain Dept Fis Ingn Sistemes & Teoria Senal E-03080 Alicante Spain Univ Alicante Inst Univ Fis Aplicada Ciencias & Tecnol E-03080 Alicante Spain Univ Politecn Cataluna Escola Tecn Super Engn Telecomunicacio Barcelona Barcelona Spain Cent Univ Venezuela Fac Ciencias Escuela Computac Caracas Venezuela Barcelona Supercomp Ctr Dept Comp Applicat Sci & Engn Barcelona Spain Cent Univ Venezuela Fac Ciencias Escuela Matemat Caracas Venezuela
Realistic applications of numerical modeling of acoustic wave dynamics usually demand high-performance computing because of the large size of study domains and demanding accuracy requirements on simulation results. Fo... 详细信息
来源: 评论
Outer-Loop Auto-Vectorization for simd Architectures Based on Open64 Compiler  17
Outer-Loop Auto-Vectorization for SIMD Architectures Based o...
收藏 引用
17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)
作者: Wang Dong Zhao Rongcai Wang Qi Li Yingying PLA Informat Engn Univ Zhengzhou Peoples R China
simd (Single Instruction Multiple Data) extensions are acceleration components integrated in general processor, aiming at extracting instruction and data level parallelism of multimedia and scientific calculation prog... 详细信息
来源: 评论
Efficient construction of bounding volume hierarchies into a complete octree for ray tracing
收藏 引用
COMPUTER ANIMATION AND VIRTUAL WORLDS 2016年 第3-4期27卷 358-368页
作者: Olivares, Ulises Rodriguez, Hector G. Garcia, Arturo Ramos, Felix F. Univ Nacl Autonoma Mexico Dept Computat Biol Lab Nacl Anal & Sintesis Ecol Escuela Nacl Estudios Super Unidad Morelia Morelia Michoacan Mexico IPN Ctr Invest & Estudios Avanzados Dept Elect Engn & Comp Sci Unidad Guadalajara Guadalajara Jalisco Mexico Intel Guadalajara Design Ctr Visual Parallel Comp Grp Guadalajara Jalisco Mexico
This paper proposes an efficient construction scheme for bounding volume hierarchies based on a complete tree. This construction offers up to 4x faster construction times than binned-surface area heuristic and offers ... 详细信息
来源: 评论
Efficient construction of bounding volume hierarchies into a complete octree for ray tracing
Efficient construction of bounding volume hierarchies into a...
收藏 引用
Computer Animation and Social Agents (CASA) Conference
作者: Olivares, Ulises Rodriguez, Hector G. Garcia, Arturo Ramos, Felix F. Univ Nacl Autonoma Mexico Dept Computat Biol Lab Nacl Anal & Sintesis Ecol Escuela Nacl Estudios Super Unidad Morelia Morelia Michoacan Mexico IPN Ctr Invest & Estudios Avanzados Dept Elect Engn & Comp Sci Unidad Guadalajara Guadalajara Jalisco Mexico Intel Guadalajara Design Ctr Visual Parallel Comp Grp Guadalajara Jalisco Mexico
This paper proposes an efficient construction scheme for bounding volume hierarchies based on a complete tree. This construction offers up to 4x faster construction times than binned-surface area heuristic and offers ... 详细信息
来源: 评论
Multi-GPU and multi-CPU accelerated FDTD scheme for vibroacoustic applications
收藏 引用
COMPUTER PHYSICS COMMUNICATIONS 2015年 第1期191卷 43-51页
作者: Frances, J. Otero, B. Bleda, S. Gallego, S. Neipp, C. Marquez, A. Belendez, A. Univ Alicante Dpto Fis Ingn Sistemas & Teoria Senal E-03080 Alicante Spain Univ Alicante Inst Univ Fis Aplicada Las Ciencias & Tecnol E-03080 Alicante Spain Univ Politecn Cataluna Dept Arquitectura Comp Barcelona Spain
The Finite-Difference Time-Domain (FDTD) method is applied to the analysis of vibroacoustic problems and to study the propagation of longitudinal and transversal waves in a stratified media. The potential of the schem... 详细信息
来源: 评论
Performance Evaluation of Multi-Core Intel Xeon Processors on Basic Linear Algebra Subprograms
收藏 引用
PARALLEL PROCESSING LETTERS 2009年 第1期19卷 159-174页
作者: Soliman, Mostafa I. South Valley Univ Aswan Fac Engn Elect Engn Dept Comp & Syst Sect Aswan 81542 Egypt
Multi-core technology is a natural next step in delivering the benefits of Moore's law to computing platforms. On multi-core processors, the performance of many applications would be improved by parallel processin... 详细信息
来源: 评论
On the suitability of simd extensions for neural network simulation
收藏 引用
MICROPROCESSORS AND MICROSYSTEMS 2003年 第7期27卷 341-351页
作者: Strey, A Univ Ulm Dept Neural Informat Proc D-89069 Ulm Germany
Current microprocessors contain simd execution units (also called multimedia or vector extensions) that allow the data-parallel execution of operations on several subwords packed in 64-bit or 128-bit registers. They c... 详细信息
来源: 评论
Performance comparison of simd implementations of the discrete wavelet transform
Performance comparison of SIMD implementations of the discre...
收藏 引用
16th IEEE International Conference on Application-Specific Systems, Architecture and Processors
作者: Shahbahrami, A Juurlink, B Vassiliadis, S Delft Univ Technol Fac Elect Engn Math & Comp Sci Comp Engn Lab NL-2600 AA Delft Netherlands
This paper focuses on simd implementations of the 2D discrete wavelet transform (DWT). The transforms considered are Daubechies' real-to-real method of four coefficients (Daub-4) and the integer-to-integer (5, 3) ... 详细信息
来源: 评论