咨询与建议

限定检索结果

文献类型

  • 16 篇 会议
  • 15 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 32 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 25 篇 工学
    • 23 篇 计算机科学与技术...
    • 7 篇 软件工程
    • 3 篇 电气工程
    • 1 篇 信息与通信工程
    • 1 篇 控制科学与工程
    • 1 篇 生物工程
  • 10 篇 理学
    • 4 篇 数学
    • 4 篇 物理学
    • 1 篇 化学
    • 1 篇 生物学
  • 2 篇 管理学
    • 2 篇 管理科学与工程(可...

主题

  • 32 篇 simd vectorizati...
  • 4 篇 openmp
  • 3 篇 tiling
  • 2 篇 performance
  • 2 篇 program synthesi...
  • 2 篇 mpi
  • 2 篇 compiler optimiz...
  • 2 篇 basic linear alg...
  • 2 篇 speculation
  • 2 篇 csr2
  • 2 篇 avx2
  • 2 篇 spmv
  • 2 篇 atlas
  • 2 篇 ifko
  • 2 篇 iterative compil...
  • 2 篇 dsl
  • 2 篇 matrix multiplic...
  • 2 篇 csr5
  • 1 篇 parallel algorit...
  • 1 篇 video retrieval

机构

  • 2 篇 tsinghua univ de...
  • 2 篇 qinghai univ dep...
  • 2 篇 swiss fed inst t...
  • 1 篇 louisiana state ...
  • 1 篇 stfc daresbury l...
  • 1 篇 univ paris sacla...
  • 1 篇 warsaw univ tech...
  • 1 篇 tech univ berlin...
  • 1 篇 univ utah sci co...
  • 1 篇 hokkaido univ gr...
  • 1 篇 auis engn dept s...
  • 1 篇 high performance...
  • 1 篇 uppsala univ upp...
  • 1 篇 univ tx san anto...
  • 1 篇 university of te...
  • 1 篇 univ exeter coll...
  • 1 篇 edinburgh resear...
  • 1 篇 lawrence berkele...
  • 1 篇 dept. of compute...
  • 1 篇 univ bologna bol...

作者

  • 2 篇 spampinato danie...
  • 2 篇 bian haodong
  • 2 篇 liu lingbin
  • 2 篇 wang xiaoying
  • 2 篇 huang jianqiang
  • 2 篇 dong runting
  • 2 篇 lobet m.
  • 1 篇 inman jeff
  • 1 篇 alonso-jorda ped...
  • 1 篇 mirsalari seyed ...
  • 1 篇 nikos ntarmos
  • 1 篇 wang patricia p.
  • 1 篇 vay j. -l.
  • 1 篇 hoehnerbach mark...
  • 1 篇 yi qing
  • 1 篇 massimo f.
  • 1 篇 martinez hector
  • 1 篇 kidwai hashir k.
  • 1 篇 perez f.
  • 1 篇 guo yuluo

语言

  • 32 篇 英文
检索条件"主题词=SIMD vectorization"
32 条 记 录,以下是11-20 订阅
排序:
Structure optimization method based on automatic vectorization
收藏 引用
EVOLUTIONARY INTELLIGENCE 2020年 第1期13卷 51-58页
作者: Li, Yu-ping Guo, Zhan-jie Liu, Hui Shangqiu Normal Univ Sch Informat Technol Shangqiu 476000 Peoples R China Zhengzhou Tech Coll Dept Elect & Elect Engn Zhengzhou 450121 Peoples R China Informat Engn Univ Collge Informat Syst Engn Zhengzhou 450001 Peoples R China
Structure is used more extensively used in program such as scientific computing. But the non-continuity and the non-aligment of vectorization structure array have a dramatic influence on the efficiency of program'... 详细信息
来源: 评论
HACLxN: Verified Generic simd Crypto (for all your favorite platforms)  20
HACLxN: Verified Generic SIMD Crypto (for all your favorite ...
收藏 引用
ACM SIGSAC Conference on Computer and Communications Security (ACM CCS)
作者: Polubelova, Marina Bhargavan, Karthikeyan Protzenko, Jonathan Beurdouche, Benjamin Fromherz, Aymeric Kulatova, Natalia Zanella-Beguelin, Santiago Inria Paris Paris France Microsoft Res Redmond WA USA Mozilla San Francisco CA USA Carnegie Mellon Univ Pittsburgh PA 15213 USA
We present a new methodology for building formally verified cryptographic libraries that are optimized for multiple architectures. In particular, we show how to write and verify generic crypto code in the F* programmi... 详细信息
来源: 评论
CSR2: A New Format for simd-accelerated SpMV  20
CSR2: A New Format for SIMD-accelerated SpMV
收藏 引用
20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid)
作者: Bian, Haodong Huang, Jianqiang Dong, Runting Liu, Lingbin Wang, Xiaoying Qinghai Univ Dept Comp Technol & Applicat Xining Peoples R China Tsinghua Univ Dept Comp Sci & Technol Beijing Peoples R China
SpMV (Sparse matrix-vector multiplication) has attracted the attention of researchers in related fields at home and abroad. Of course, improving SpMV performance has also been a research hot spot for researchers in re... 详细信息
来源: 评论
An optimized FM-index library for nucleotide and amino acid search
收藏 引用
ALGORITHMS FOR MOLECULAR BIOLOGY 2021年 第1期16卷 25-25页
作者: Anderson, Tim Wheeler, Travis J. Univ Montana Dept Comp Sci Missoula MT 59812 USA
Background: Pattern matching is a key step in a variety of biological sequence analysis pipelines. The FM-index is a compressed data structure for pattern matching, with search run time that is independent of the leng... 详细信息
来源: 评论
TweTriS: Twenty trillion-atom simulation
收藏 引用
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 2019年 第5期33卷 838-854页
作者: Tchipev, Nikola Seckler, Steffen Heinen, Matthias Vrabec, Jadran Gratl, Fabio Horsch, Martin Bernreuther, Martin Glass, Colin W. Niethammer, Christoph Hammer, Nicolay Krischok, Bernd Resch, Michael Kranzlmueller, Dieter Hasse, Hans Bungartz, Hans-Joachim Neumann, Philipp Tech Univ Munich Dept Informat Garching Germany Tech Univ Berlin Thermodynam & Proc Engn Berlin Germany AUIS Engn Dept Sulaimani Iraq STFC Daresbury Lab Sci Comp Dept Warrington Cheshire England High Performance Comp Ctr Stuttgart HLRS Stuttgart Germany Helmut Schmidt Univ Mech Engn Hamburg Germany Leibniz Supercomp Ctr Garching Germany Univ Stuttgart Inst High Performance Comp Stuttgart Germany TU Kaiserslautern Lab Engn Thermodynam LTD Kaiserslautern Germany Univ Hamburg Dept Informat Bundesstr 45a D-20146 Hamburg Germany
Significant improvements are presented for the molecular dynamics code ls1 mardyn - a linked cell-based code for simulating a large number of small, rigid molecules with application areas in chemical engineering. The ... 详细信息
来源: 评论
Adaptive simd optimizations in particle-in-cell codes with fine-grain particle sorting
收藏 引用
COMPUTER PHYSICS COMMUNICATIONS 2019年 244卷 246-263页
作者: Beck, A. Derouillat, J. Lobet, M. Farjallah, A. Massimo, F. Zemzemi, I Perez, F. Vinci, T. Grech, M. Ecole Polytech Lab Leprince Ringuet CNRS IN2P3 F-91128 Palaiseau France Ecole Polytech LULI CNRS CEA F-91128 Palaiseau France UPMC Univ Paris 06 Sorbonne Univ CNRS LULI Pl Jussieu F-75252 Paris 05 France Univ Paris Saclay Univ Paris Sud UVSQ Maison SimulatCEACNRS F-91191 Gif Sur Yvette France Intel Corp Meudon France
Particle-In-Cell (PIC) codes are broadly applied to the kinetic simulation of plasmas, from laser matter interaction to astrophysics. Their heavy simulation cost can be mitigated by using the Single Instruction Multip... 详细信息
来源: 评论
Program Generation for Small-Scale Linear Algebra Applications  2018
Program Generation for Small-Scale Linear Algebra Applicatio...
收藏 引用
16th International Symposium on Code Generation and Optimization (CGO)
作者: Spampinato, Daniele G. Fabregat-Traver, Diego Bientinesi, Paolo Puschel, Markus Swiss Fed Inst Technol Dept Comp Sci Zurich Switzerland Rhein Westfal TH Aachen Aachen Inst Adv Study Computat Engn Sci Aachen Germany
We present SLINGEN, a program generation system for linear algebra. The input to SLINGEN is an application expressed mathematically in a linear-algebra-inspired language (LA) that we define. LA provides basic scalar/v... 详细信息
来源: 评论
High-Performance Implementation of Matrix-Free Runge-Kutta Discontinuous Galerkin Method for Euler Equations  20
High-Performance Implementation of Matrix-Free Runge-Kutta D...
收藏 引用
20th IEEE International Conference on High Performance Computing and Communications (HPCC) / 16th IEEE International Conference on Smart City (SmartCity) / 4th IEEE International Conference on Data Science and Systems (DSS)
作者: Feng, Yongquan Yang, Wenjing Sun, Liaoyuan Lin, Zhipeng Zhang, Yongjun Natl Univ Def Technol Coll Comp State Key Lab High Performance Comp Changsha Hunan Peoples R China Natl Innovat Inst Def Technol Beijing Peoples R China
DG method is one of the mainstream high-order numerical discretization methods of CFD, and the RKDG which combines the Runge-Kutta explicit time stepping with DG space discretization plays an important role in the DG ... 详细信息
来源: 评论
Distributed memory building blocks for massive biological sequence analysis
Distributed memory building blocks for massive biological se...
收藏 引用
作者: Pan, Tony C. Georgia Institute of Technology
学位级别:博士
K-mer indices and de Bruijn graphs are important data structures in bioinformatics with multiple applications ranging from foundational tasks such as error correction, alignment, and genome assembly, to knowledge disc... 详细信息
来源: 评论
An efficient and portable simd algorithm for charge/current deposition in Particle-In-Cell codes
收藏 引用
COMPUTER PHYSICS COMMUNICATIONS 2017年 210卷 145-154页
作者: Vincenti, H. Lobet, M. Lehe, R. Sasanka, R. Vay, J. -L. Lawrence Berkeley Natl Lab 1 Cyclotron Rd Berkeley CA 94720 USA CEA Lasers Interact & Dynam Lab LIDyL Gif Sur Yvette France Intel Corp Hillsboro OR 97124 USA
In current computer architectures, data movement (from die to network) is by far the most energy consuming part of an algorithm (R,'20 pi/word on-die to 10,000 pi/word on the network). To increase memory locality ... 详细信息
来源: 评论