咨询与建议

限定检索结果

文献类型

  • 184 篇 会议
  • 170 篇 期刊文献
  • 14 篇 学位论文

馆藏范围

  • 368 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 333 篇 工学
    • 289 篇 计算机科学与技术...
    • 155 篇 软件工程
    • 88 篇 电气工程
    • 16 篇 信息与通信工程
    • 13 篇 控制科学与工程
    • 5 篇 机械工程
    • 5 篇 电子科学与技术(可...
    • 2 篇 网络空间安全
    • 1 篇 仪器科学与技术
    • 1 篇 材料科学与工程(可...
    • 1 篇 动力工程及工程热...
    • 1 篇 环境科学与工程(可...
  • 13 篇 理学
    • 11 篇 数学
    • 1 篇 物理学
    • 1 篇 化学
    • 1 篇 生物学
    • 1 篇 统计学(可授理学、...
  • 5 篇 管理学
    • 5 篇 管理科学与工程(可...
  • 3 篇 医学
    • 2 篇 临床医学
    • 1 篇 基础医学(可授医学...
  • 1 篇 教育学
    • 1 篇 教育学

主题

  • 368 篇 compiler optimiz...
  • 29 篇 performance
  • 20 篇 machine learning
  • 15 篇 languages
  • 15 篇 instruction sche...
  • 12 篇 program analysis
  • 12 篇 algorithms
  • 11 篇 gpu
  • 10 篇 llvm
  • 10 篇 vectorization
  • 10 篇 design
  • 10 篇 data dependence
  • 10 篇 automatic parall...
  • 10 篇 openmp
  • 9 篇 quantum computin...
  • 9 篇 simd
  • 9 篇 embedded systems
  • 9 篇 register allocat...
  • 8 篇 experimentation
  • 8 篇 prefetching

机构

  • 5 篇 univ edinburgh e...
  • 4 篇 univ texas dept ...
  • 4 篇 oak ridge natl l...
  • 4 篇 univ utah sch co...
  • 4 篇 carnegie mellon ...
  • 3 篇 georgia inst tec...
  • 3 篇 ohio state univ ...
  • 3 篇 univ sci & techn...
  • 3 篇 univ chicago dep...
  • 3 篇 colorado state u...
  • 3 篇 univ manchester ...
  • 3 篇 northeastern uni...
  • 3 篇 louisiana state ...
  • 3 篇 intel labs banga...
  • 3 篇 univ calif los a...
  • 3 篇 tsinghua univ pe...
  • 3 篇 penn state univ ...
  • 3 篇 univ washington ...
  • 3 篇 ohio state univ ...
  • 3 篇 ibm corp thomas ...

作者

  • 6 篇 o'boyle michael ...
  • 6 篇 wang zheng
  • 5 篇 pouchet louis-no...
  • 5 篇 lerner sorin
  • 5 篇 niu wei
  • 5 篇 ren bin
  • 5 篇 cohen albert
  • 5 篇 kennedy k
  • 4 篇 wang yanzhi
  • 4 篇 leather hugh
  • 4 篇 doerfert johanne...
  • 4 篇 psarris k
  • 4 篇 tatlock zachary
  • 4 篇 chong frederic t...
  • 4 篇 basu protonu
  • 4 篇 hall mary
  • 4 篇 mowry tc
  • 4 篇 kyriakopoulos k
  • 4 篇 sadayappan p.
  • 4 篇 cavazos john

语言

  • 352 篇 英文
  • 14 篇 其他
  • 2 篇 中文
检索条件"主题词=Compiler Optimization"
368 条 记 录,以下是111-120 订阅
排序:
An adaptive algorithm selection framework for reduction parallelization
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2006年 第10期17卷 1084-1096页
作者: Yu, Hao Rauchwerger, Lawrence IBM Corp Thomas J Watson Res Ctr Yorktown Hts NY 10598 USA Texas A&M Univ Dept Comp Sci College Stn TX 77843 USA
Irregular and dynamic memory reference patterns can cause performance variations for low level algorithms in general and for parallel algorithms in particular. In this paper, we present an adaptive algorithm selection... 详细信息
来源: 评论
IMPROVING MEMORY UTILIZATION IN CACHE COHERENCE DIRECTORIES
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 1993年 第10期4卷 1130-1146页
作者: LILJA, DJ YEW, PC UNIV ILLINOIS CTR SUPERCOMP RES & DEVURBANAIL 61801
Efficiently maintaining cache coherence is a major problem in large-scale shared memory multiprocessors. Hardware directory coherence schemes have very high memory requirements, while software-directed schemes must re... 详细信息
来源: 评论
Efficient and safe-for-space closure conversion
收藏 引用
ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS 2000年 第1期22卷 129-161页
作者: Shao, Z Appel, AW Yale Univ Dept Comp Sci New Haven CT 06520 USA Princeton Univ Dept Comp Sci Princeton NJ 08544 USA
Modern compilers often implement function calls (or returns) in two steps: first, a "closure" environment is properly installed to provide access for free variables in the target program fragment;second, the... 详细信息
来源: 评论
Tsoa: a two-stage optimization approach for GCC compilation options to minimize execution time
收藏 引用
AUTOMATED SOFTWARE ENGINEERING 2024年 第2期31卷 1-36页
作者: Ni, Youcong Du, Xin Yuan, Yuan Xiao, Ruliang Chen, Gaolin Fujian Normal Univ Coll Comp & Cyber Secur Fuzhou Peoples R China Beihang Univ Beijing Peoples R China
The open-source compiler GCC offers numerous options to improve execution time. Two categories of approaches, machine learning-based and design space exploration, have emerged for selecting the optimal set of options.... 详细信息
来源: 评论
Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver
收藏 引用
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 2006年 第5期66卷 659-673页
作者: Krishnan, S Krishnamoorthy, S Baumgartner, G Lam, CC Ramanujam, J Sadayappan, P Choppella, V Ohio State Univ Dept Comp Sci & Engn Columbus OH 43210 USA Louisiana State Univ Dept Comp Sci Baton Rouge LA 70803 USA Louisiana State Univ Dept Elect & Comp Engn Baton Rouge LA USA Indian Inst Informat Technol & Management Thiruvananthapuram 695581 Kerala India
We address the problem of efficient out-of-core code generation for a special class of imperfectly nested loops encoding tensor contractions arising in quantum chemistry computations. These loops operate on arrays too... 详细信息
来源: 评论
compiler-based I/O prefetching for out-of-core applications
收藏 引用
ACM TRANSACTIONS ON COMPUTER SYSTEMS 2001年 第2期19卷 111-170页
作者: Brown, AD Mowry, TC Krieger, O Carnegie Mellon Univ Dept Comp Sci Pittsburgh PA 15213 USA IBM Corp Thomas J Watson Res Ctr Yorktown Hts NY 10598 USA
Current operating systems offer poor performance when a numeric application's working set does not fit in main memory. As a result, programmers who wish to solve "out-of-core" problems efficiently are ty... 详细信息
来源: 评论
compiler controlled prefetching for multiprocessors using low-overhead traps and prefetch engines
收藏 引用
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 2000年 第5期60卷 585-615页
作者: Skeppstedt, J Dubois, M Univ Lund Dept Comp Sci S-22100 Lund Sweden Univ So Calif Dept Elect Engn Syst Los Angeles CA 90089 USA
In this paper we propose and evaluate a new data-prefetching technique for cache coherent multiprocessors. Prefetches are issued by a functional unit called a prefetch engine which is controlled by the compiler. We le... 详细信息
来源: 评论
PMU Guided Structure Data-Layout optimization
收藏 引用
Tsinghua Science and Technology 2011年 第2期16卷 145-150页
作者: 闫家年 陈文光 郑纬民 Department of Computer Science and Technology Tsinghua University
Existing methods of obtaining runtime feedback for structure data-layout optimization have several drawbacks, such as large overhead and difficulty composing training sets. As a result, structure data-layout optimizat... 详细信息
来源: 评论
Allo: A Programming Model for Composable Accelerator Design
收藏 引用
PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL 2024年 第PLDI期8卷 593-620页
作者: Chen, Hongzheng Zhang, Niansong Xiang, Shaojie Zeng, Zhichen Dai, Mengjia Zhang, Zhiru Cornell Univ Ithaca NY 14853 USA Univ Sci & Technol China Hefei Peoples R China
Special-purpose hardware accelerators are increasingly pivotal for sustaining performance improvements in emerging applications, especially as the benefits of technology scaling continue to diminish. However, designer... 详细信息
来源: 评论
STATIC SCHEDULING FOR BARRIER MIMD ARCHITECTURES
收藏 引用
JOURNAL OF SUPERCOMPUTING 1992年 第4期5卷 263-289页
作者: DIETZ, HG ZAAFRANI, A OKEEFE, MT UNIV MINNESOTA DEPT ELECT ENGNMINNEAPOLISMN 55455
In a SIMD or VLIW machine, conceptual synchronizations are accomplished by using a static code schedule that does not require run-time synchronization. The lack of run-time synchronization overhead makes these machine... 详细信息
来源: 评论