咨询与建议

限定检索结果

文献类型

  • 16 篇 期刊文献
  • 16 篇 会议
  • 1 篇 学位论文

馆藏范围

  • 33 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 25 篇 工学
    • 23 篇 计算机科学与技术...
    • 7 篇 软件工程
    • 3 篇 电气工程
    • 1 篇 信息与通信工程
    • 1 篇 控制科学与工程
    • 1 篇 生物工程
  • 10 篇 理学
    • 4 篇 数学
    • 4 篇 物理学
    • 1 篇 化学
    • 1 篇 生物学
  • 2 篇 管理学
    • 2 篇 管理科学与工程(可...

主题

  • 33 篇 simd vectorizati...
  • 4 篇 openmp
  • 3 篇 tiling
  • 2 篇 performance
  • 2 篇 program synthesi...
  • 2 篇 mpi
  • 2 篇 compiler optimiz...
  • 2 篇 basic linear alg...
  • 2 篇 speculation
  • 2 篇 csr2
  • 2 篇 avx2
  • 2 篇 spmv
  • 2 篇 atlas
  • 2 篇 ifko
  • 2 篇 iterative compil...
  • 2 篇 dsl
  • 2 篇 matrix multiplic...
  • 2 篇 code generation
  • 2 篇 csr5
  • 1 篇 parallel algorit...

机构

  • 2 篇 tsinghua univ de...
  • 2 篇 qinghai univ dep...
  • 2 篇 swiss fed inst t...
  • 1 篇 louisiana state ...
  • 1 篇 stfc daresbury l...
  • 1 篇 univ paris sacla...
  • 1 篇 warsaw univ tech...
  • 1 篇 tech univ berlin...
  • 1 篇 univ utah sci co...
  • 1 篇 hokkaido univ gr...
  • 1 篇 auis engn dept s...
  • 1 篇 high performance...
  • 1 篇 uppsala univ upp...
  • 1 篇 univ tx san anto...
  • 1 篇 university of te...
  • 1 篇 univ exeter coll...
  • 1 篇 edinburgh resear...
  • 1 篇 lawrence berkele...
  • 1 篇 dept. of compute...
  • 1 篇 univ bologna bol...

作者

  • 2 篇 spampinato danie...
  • 2 篇 bian haodong
  • 2 篇 liu lingbin
  • 2 篇 wang xiaoying
  • 2 篇 huang jianqiang
  • 2 篇 dong runting
  • 2 篇 lobet m.
  • 1 篇 inman jeff
  • 1 篇 alonso-jorda ped...
  • 1 篇 mirsalari seyed ...
  • 1 篇 nikos ntarmos
  • 1 篇 wang patricia p.
  • 1 篇 vay j. -l.
  • 1 篇 hoehnerbach mark...
  • 1 篇 yi qing
  • 1 篇 massimo f.
  • 1 篇 juan touriño
  • 1 篇 martinez hector
  • 1 篇 kidwai hashir k.
  • 1 篇 perez f.

语言

  • 33 篇 英文
检索条件"主题词=SIMD vectorization"
33 条 记 录,以下是21-30 订阅
排序:
An efficient and portable simd algorithm for charge/current deposition in Particle-In-Cell codes
收藏 引用
COMPUTER PHYSICS COMMUNICATIONS 2017年 210卷 145-154页
作者: Vincenti, H. Lobet, M. Lehe, R. Sasanka, R. Vay, J. -L. Lawrence Berkeley Natl Lab 1 Cyclotron Rd Berkeley CA 94720 USA CEA Lasers Interact & Dynam Lab LIDyL Gif Sur Yvette France Intel Corp Hillsboro OR 97124 USA
In current computer architectures, data movement (from die to network) is by far the most energy consuming part of an algorithm (R,'20 pi/word on-die to 10,000 pi/word on the network). To increase memory locality ... 详细信息
来源: 评论
Extending OpenMP simd Support for Target Specific Code and Application to ARM SVE  13th
Extending OpenMP SIMD Support for Target Specific Code and A...
收藏 引用
13th International Workshop on OpenMP (IWOMP)
作者: Lee, Jinpil Petrogalli, Francesco Hunter, Graham Sato, Mitsuhisa RIKEN Adv Inst Computat Sci Kobe Hyogo Japan ARM Ltd Cambridge England
Recent trends in processor design accommodate wide vector extensions. simd vectorization is more important than before to exploit the potential performance of the target architecture. The latest OpenMP specification p... 详细信息
来源: 评论
Multiple Pattern Matching for Network Security Applications: Acceleration through vectorization  46
Multiple Pattern Matching for Network Security Applications:...
收藏 引用
46th International Conference on Parallel Processing Workshops (ICPPW)
作者: Stylianopoulos, Charalampos Almgren, Magnus Landsiedel, Olaf Papatriantafilou, Marina Chalmers Univ Technol Gothenburg Sweden
Pattern matching is a key building block of Intrusion Detection Systems and firewalls, which are deployed nowadays on commodity systems from laptops to massive web servers in the cloud. In fact, pattern matching is on... 详细信息
来源: 评论
Dynamic simd Vector Lane Scheduling
Dynamic SIMD Vector Lane Scheduling
收藏 引用
International Supercomputing Conference (ISC High Performance)
作者: Krzikalla, Olaf Wende, Florian Hoehnerbach, Markus Tech Univ Dresden Dresden Germany Zuse Inst Berlin Germany RWTH Univ Aachen Germany
A classical technique to vectorize code that contains control flow is a control-flow to data-flow conversion. In that approach statements are augmented with masks that denote whether a given vector lane participates i... 详细信息
来源: 评论
A Basic Linear Algebra Compiler for Structured Matrices  16
A Basic Linear Algebra Compiler for Structured Matrices
收藏 引用
14th International Symposium on Code Generation and Optimization (CGO)
作者: Spampinato, Daniele G. Pueschel, Markus Swiss Fed Inst Technol Dept Comp Sci Zurich Switzerland
Many problems in science and engineering are in practice modeled and solved through matrix computations. Often, the matrices involved have structure such as symmetric or triangular, which reduces the operations count ... 详细信息
来源: 评论
An Empirical Study of Performance, Power Consumption, and Energy Cost of Erasure Code Computing for HPC Cloud Storage Systems  10
An Empirical Study of Performance, Power Consumption, and En...
收藏 引用
IEEE International Conference on Networking, Architecture and Storage (NAS 2015)
作者: Chen, Hsing-bung Grider, Gary Inman, Jeff Fields, Parks Kuehn, Jeff Alan Los Alamos Natl Lab Los Alamos NM 87545 USA
Erasure code storage systems are becoming popular choices for cloud storage systems due to cost-effective storage space saving schemes and higher fitult-resilience capabilities. Both erasure code encoding and decoding... 详细信息
来源: 评论
Soft-Output Demapper and Viterbi Decoder for Software-Defined Radio
Soft-Output Demapper and Viterbi Decoder for Software-Define...
收藏 引用
Conference on Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments
作者: Marcin, Darmetko Warsaw Univ Technol Inst Radioelect Warsaw Poland
Viterbi algorithm is commonly used in communication systems to decode convolutional codes. Soft decision demapping can be used to further improve Viterbi decoder performance. This paper presents implementation of soft... 详细信息
来源: 评论
Cross-Loop Optimization of Arithmetic Intensity for Finite Element Local Assembly
收藏 引用
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION 2014年 第4期11卷 57-57页
作者: Luporini, Fabio Varbanescu, Ana Lucia Rathgeber, Florian Bercea, Gheorghe-Teodor Ramanujam, J. Ham, David A. Kelly, Paul H. J. Imperial Coll London Dept Comp London England Univ Amsterdam Inst Informat NL-1012 WX Amsterdam Netherlands Imperial Coll London Dept Math London England Louisiana State Univ Ctr Computat & Technol Baton Rouge LA 70803 USA Louisiana State Univ Sch Elect Engn & Comp Sci Baton Rouge LA 70803 USA
We study and systematically evaluate a class of composable code transformations that improve arithmetic intensity in local assembly operations, which represent a significant fraction of the execution time in finite el... 详细信息
来源: 评论
A Basic Linear Algebra Compiler  14
A Basic Linear Algebra Compiler
收藏 引用
Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization
作者: Daniele G. Spampinato Markus Püschel Dept. of Computer Science ETH Zurich
Many applications in media processing, control, graphics, and other domains require efficient small-scale linear algebra computations. However, most existing high performance libraries for linear algebra, such as ATLA... 详细信息
来源: 评论
vectorization Past Dependent Branches Through Speculation
Vectorization Past Dependent Branches Through Speculation
收藏 引用
22nd International Conference on Parallel Architectures and Compilation Techniques (PACT)
作者: Sujon, Majedul Haque Whaley, R. Clint Yi, Qing Univ TX San Antonio Dept Comp Sci San Antonio TX 78249 USA
Modern architectures increasingly rely on simd vectorization to improve performance for floating point intensive scientific applications. However, existing compiler optimization techniques for automatic vectorization ... 详细信息
来源: 评论