检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

31 篇 会议
4 册 图书
1 篇 期刊文献

馆藏范围

36 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

27 篇 工学
- 27 篇 计算机科学与技术...
- 16 篇 软件工程
- 1 篇 电子科学与技术（可...
- 1 篇 信息与通信工程
2 篇 管理学
- 2 篇 图书情报与档案管...

主题

5 篇 parallel program...
5 篇 program compiler...
3 篇 programming lang...
2 篇 operating system...
2 篇 simd
2 篇 high level langu...
2 篇 programming tech...
2 篇 performance anal...
1 篇 computer simulat...
1 篇 mesh refinement
1 篇 spmd
1 篇 parallel process...
1 篇 parallelization
1 篇 productivity
1 篇 intel knl
1 篇 llvm
1 篇 software enginee...
1 篇 openacc
1 篇 concurrent compu...
1 篇 amorphous data-p...

机构

1 篇 inria
1 篇 lab-sticc comput...
1 篇 vienna universit...
1 篇 univ texas san a...
1 篇 university of mu...
1 篇 argonne natl lab...
1 篇 brigham young un...
1 篇 di ens
1 篇 georgia inst tec...
1 篇 nvidia gmbh wurs...
1 篇 intel china res ...
1 篇 qualcomm technol...
1 篇 univ iowa dept c...
1 篇 univ illinois de...
1 篇 high performance...
1 篇 georgia inst tec...
1 篇 ist nazl fis nuc...
1 篇 accenture cyber ...
1 篇 ibm corp program...
1 篇 department of co...

作者

2 篇 josé nelson amar...
1 篇 burger doug
1 篇 mallinson a.c.
1 篇 kaiser hartmut
1 篇 mckinley kathryn...
1 篇 martin sergio m.
1 篇 campanoni simone
1 篇 yi qing
1 篇 doerfert johanne...
1 篇 hwu wen-mei w.
1 篇 dubach christoph...
1 篇 chatarasi prasan...
1 篇 finkel hal
1 篇 beckingsale d.a.
1 篇 khan minhaj ahma...
1 篇 ponraj helen vij...
1 篇 nicolau alex
1 篇 wilmarth terry
1 篇 paramasivan meen...
1 篇 charles h. -p.

语言

35 篇 英文
1 篇 中文

检索条件"任意字段=21st International Workshop on Languages and Compilers for Parallel Computing"

共 36 条记录，以下是1-10 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

31st international workshop on languages and compilers for parallel computing, LCPC 2018

31st International Workshop on Languages and Compilers for P...

引用

31st international workshop on languages and compilers for parallel computing, LCPC 2018

ISBN: (纸本)9783030346263

The proceedings contain 14 papers. The special focus in this conference is on languages and compilers for parallel computing. The topics include: GASNet-EX: A high-performance, portable communication library for exascale;nested parallelism with algorithmic skeletons;HDArray: parallel array interface for distributed heterogeneous devices;automating the exchangeability of shared data abstractions;design and performance analysis of real-time dynamic streaming applications;a similarity measure for gpu kernel subgraph matching;new opportunities for compilers in computer security;footmark: A new formulation for working set statistics;towards an achievable performance for the loop nests;extending index-array properties for data dependence analysis;optimized sound and complete data race detection in structured parallel programs;compiler optimizations for parallel programs.

关键词：

来源：评论

学校读者我要写书评

暂无评论

2D Oxide Picture languages and Their Properties 21st

2D Oxide Picture Languages and Their Properties

引用

21st international workshop on Combinatorial Image Analysis (IWCIA)

作者： Ponraj, Helen Vijitha Thamburaj, Robinson Paramasivan, Meenakshi Rajalakshmi Engn Coll Dept Math Chennai 602105 Tamil Nadu India Madras Christian Coll Dept Math Chennai 600059 Tamil Nadu India Univ Trier FB Informat Wissensch 4 D-54286 Trier Germany

ISBN: (纸本)9783031236112;9783031236129

In the theory of formal languages, two-dimensional (picture) languages are a generalization of string languages to two dimensions. Pictures may be regarded as digitized finite arrays, occurring in studies concerning pattern recognition, image analysis, cellular automata, and parallel computing. Several studies have been done for generating and (or) recognizing rectangular, triangular, and hexagonal arrays using formal syntactic methods. Motivated by oxide molecular structures, the oxide pictures, a special class of two-dimensional pictures, are considered. Various generating and recognizing schemes, such as the Oxide Tiling System (OXTS), Oxide Wang System (OXWS), Oxide Tile Rewriting Grammar (OXTRG), and Oxide Sgraffito Automata (OXS), have been developed recently. It is found that the family of oxide picture languages recognizable by oxide tiling systems is closed under union, overlapping, half-turn, transpose, anti-transpose, and reflection (both along horizontal and vertical lines), but not closed under quarter-turn and anti-quarter-turn. This paper further discusses some language theoretic results as well.

关键词： Two-dimensional languages Oxide pictures Oxide tiles

来源：评论

学校读者我要写书评

暂无评论

Cain: Automatic Code Generation for Simultaneous Convolutional Kernels on Focal-plane Sensor-processors 33rd

Cain: Automatic Code Generation for Simultaneous Convolution...

引用

33rd international workshop on languages and compilers for parallel computing (LCPC)

作者： stow, Edward Murai, Riku Saeedi, Sajad Kelly, Paul H. J. Imperial Coll London Dept Comp 180 Queens Gate London SW7 2AZ England Ryerson Univ Dept Mech & Ind Engn 350 Victoria St Victoria ON M5B 2K3 Canada

ISBN: (纸本)9783030959531;9783030959524

Focal-plane Sensor-processors (FPSPs) are a camera technology that enable low power, high frame rate computation, making them suitable for edge computation. Unfortunately, these devices' limited instruction sets and registers make developing complex algorithms difficult. In this work, we present Cain, an open-source compiler that targets SCAMP-5, a general-purpose FPSP - which generates code from multiple convolutional kernels. As an example, given the convolutional kernels for an MNIst digit recognition neural network, Cain produces code that is half as long, when compared to the other available compilers for SCAMP-5.

关键词： Convolution SIMD Image sensor Analogue computing Edge inference

来源：评论

学校读者我要写书评

暂无评论

Shared Memory parallelism in Modern C++ and HPX 1

引用

1st international workshop on Asynchronous Many-Task Systems and Applications, WAMTA 2023

作者： Diehl, Patrick Brandt, steven R. Kaiser, Hartmut Center of Computation and Technology Louisiana State University Baton Rouge United States Department of Physics and Astronomy Louisiana State University Baton Rouge United States

ISBN: (数字)9783031323164

ISBN: (纸本)9783031323157

parallel programming remains a daunting challenge, from the struggle to express a parallel algorithm without cluttering the underlying synchronous logic, to describing which devices to employ in a calculation, to correctness. Over the years, numerous solutions have arisen, many of them requiring new programming languages, extensions to programming languages, or the addition of pragmas. Support for these various tools and extensions is available to a varying degree. In recent years, the C++ standards committee has worked to refine the language features and libraries needed to support parallel programming on a single computational node. Eventually, all major vendors and compilers will provide robust and performant implementations of these standards. Until then, the HPX library and runtime provides cutting edge implementations of the standards, as well as proposed standards and extensions. Because of these advances, it is now possible to write high performance parallel code without custom extensions to C++. We provide an overview of modern parallel programming in C++, describing the language and library features, and providing brief examples of how to use them. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Compiler Optimizations for parallel Programs 1

引用

31st international workshop on languages and compilers for parallel computing (LCPC)

作者： Doerfert, Johannes Finkel, Hal Argonne Natl Lab Argonne Leadership Comp Facil 9700 S Cass Ave Argonne IL 60439 USA

ISBN: (数字)9783030346270

ISBN: (纸本)9783030346270;9783030346263

This paper outlines a research and development program to enhance modern compiler technology, and the LLVM compiler infrastructure specifically, to directly optimize parallel-programming-model constructs. The goal is to produce higher-quality code, and moreover, to remove abstraction penalties generally associated with such constructs. We believe that such abstraction penalties are increasing in importance due to C++ parallel-algorithms libraries and other performance-portability-motivated programming methods. In addition, we will discuss when, and more importantly when not, explicit parallelism-awareness is necessary within the compiler in order to enable the desired optimization capabilities.

关键词： parallel programming LLVM OpenMP Compiler optimizations Intermediate representation Programming models

来源：评论

学校读者我要写书评

暂无评论

HDArray: parallel Array Interface for Distributed Heterogeneous Devices 1

引用

31st international workshop on languages and compilers for parallel computing (LCPC)

作者： Cho, Hyun Dok Kwon, Okwan Midkiff, Samuel P. NVIDIA Corp Santa Clara CA 95051 USA Apple Inc Cupertino CA 95014 USA Purdue Univ W Lafayette IN 47907 USA

ISBN: (数字)9783030346270

ISBN: (纸本)9783030346270;9783030346263

Heterogeneous clusters with nodes containing one or more accelerators, such as GPUs, have become common. While MPI provides inter-address space communication, and OpenCL provides a process with access to heterogeneous computational resources, programmers are forced to write hybrid programs that manage the interaction of both of these systems. This paper describes an array programming interface that provides users with automatic and manual distributions of data and work. Using work distribution, and kernel def and use information, communication among processes and devices in a process is performed automatically. By providing a unified programming model to the user, program development is simplified.

关键词： Program processors

来源：评论

学校读者我要写书评

暂无评论

New Opportunities for compilers in Computer Security 1

引用

31st international workshop on languages and compilers for parallel computing (LCPC)

作者： Shen, Junjie Chen, Zhi Ghalaty, Nahid Farhady Cammarota, Rosario Nicolau, Alexandru Veidenbaum, Alexander V. Univ Calif Irvine Dept Comp Sci Irvine CA 92697 USA Accenture Cyber Secur Technol Labs Arlington VA USA Qualcomm Technol Inc San Diego CA USA

ISBN: (数字)9783030346270

ISBN: (纸本)9783030346270;9783030346263

Compiler techniques have been deployed to prevent various security attacks. Examples include mitigating memory access corruption, control flow integrity checks, race detection, software diversity, etc. Hardware fault and side-channel attacks, however, are typically thought to require hardware protection. Attempts have been made to mitigate some timing and fault attacks via compiler techniques, but these typically adversely affected performance and often created opportunities for other types of attacks. More can and should be done in this area by the compiler community. This paper presents such a compiler approach that simultaneously mitigates two types of attacks, namely a fault and a side-channel attacks. Continued development in this area using compiler techniques can further improve security.

关键词： Side channel attack

来源：评论

学校读者我要写书评

暂无评论

Optimized Sound and Complete Data Race Detection in structured parallel Programs 1

引用

31st international workshop on languages and compilers for parallel computing (LCPC)

作者： storey, Kyle Powell, Jacob Ben Ogles Hooker, Joshua Aldous, Peter Mercer, Eric Brigham Young Univ Provo UT 84601 USA

ISBN: (数字)9783030346270

ISBN: (纸本)9783030346270;9783030346263

Task parallel programs that are free of data race are guaranteed to be deterministic, serializable, and free of deadlock. Techniques for verification of data race freedom vary in both accuracy and asymptotic complexity. One work is particularly well suited to task parallel programs with isolation and lightweight threads. It uses the Java Pathfinder model checker to reason about different schedules and proves the presence or absence of data race in a program on a fixed input. However, it uses a direct and inefficient transitive closure on the happens-before relation to reason about data race. This paper presents Zipper, an alternative to this naive algorithm, which identifies the presence or absence of data race in asymptotically superior time. Zipper is optimized for lightweight threads and, in the presence of many threads, has superior time complexity to leading vector clock algorithms. This paper includes an empirical study of Zipper and a comparison against the naive computation graph algorithm, demonstrating the superior performance it achieves.

关键词： Model checking

来源：评论

学校读者我要写书评

暂无评论

MATE, a Unified Model for Communication-Tolerant Scientific Applications 1

引用

31st international workshop on languages and compilers for parallel computing (LCPC)

作者： Martin, Sergio M. Baden, Scott B. Univ Calif San Diego La Jolla CA 92093 USA Lawrence Berkeley Natl Lab Berkeley CA 94720 USA

ISBN: (数字)9783030346270

ISBN: (纸本)9783030346270;9783030346263

We present MATE, a model for developing communication-tolerant scientific applications. MATE employs a combination of mechanisms to reduce or hide the cost of network and intra-node data movement. While previous approaches have been proposed to reduce both sources of communication overhead separately, the contribution of MATE is demonstrating the symbiotic effect of reducing both forms of data movement taken together. Furthermore, MATE provides these benefits within a single unified model, as opposed to hybrid (e.g., MPI+X) approaches. We demonstrate MATE's effectiveness in reducing the cost of communication in three scientific computing motifs on up to 32k cores of the NERSC Cori Phase I supercomputer.

关键词： Scientific computing Communication-Tolerance SPMD

来源：评论

学校读者我要写书评

暂无评论

A Unified Approach to Variable Renaming for Enhanced Vectorization 1

引用

31st international workshop on languages and compilers for parallel computing (LCPC)

作者： Chatarasi, Prasanth Shirako, Jun Cohen, Albert Sarkar, Vivek Georgia Inst Technol Atlanta GA 30332 USA INRIA Paris France DI ENS Paris France

ISBN: (数字)9783030346270

ISBN: (纸本)9783030346270;9783030346263

Despite the fact that compiler technologies for automatic vectorization have been under development for over four decades, there are still considerable gaps in the capabilities of modern compilers to perform automatic vectorization for SIMD units. One such gap can be found in the handling of loops with dependence cycles that involve memory-based anti (write-after-read) and output (write-after-write) dependences. Past approaches, such as variable renaming and variable expansion, break such dependence cycles by either eliminating or repositioning the problematic memory-based dependences. However, the past work suffers from three key limitations: (1) Lack of a unified framework that synergistically integrates multiple storage transformations, (2) Lack of support for bounding the additional space required to break memory-based dependences, and (3) Lack of support for integrating these storage transformations with other code transformations (e.g., statement reordering) to enable vectorization. In this paper, we address the three limitations above by integrating both Source Variable Renaming (SoVR) and Sink Variable Renaming (SiVR) transformations into a unified formulation, and by formalizing the "cycle-breaking" problem as a minimum weighted set cover optimization problem. To the best of our knowledge, our work is the first to formalize an optimal solution for cycle breaking that simultaneously considers both SoVR and SiVR transformations, thereby enhancing vectorization and reducing storage expansion relative to performing the transformations independently. We implemented our approach in PPCG, a state-of-the-art optimization framework for loop transformations, and evaluated it on eleven kernels from the TSVC benchmark suite. Our experimental results show a geometric mean performance improvement of 4.61x on an Intel Xeon Phi (KNL) machine relative to the optimized performance obtained by Intel's ICC v17.0 product compiler. Further, our results demonstrate a geometric me

关键词： Vectorization Renaming storage transformations Polyhedral compilers Intel KNL Nvidia Volta TSVC Suite SIMD

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共4页 << < 1 2 3 4 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：