检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

488 篇 会议
16 篇 期刊文献
9 册 图书

馆藏范围

513 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

211 篇 工学
- 197 篇 计算机科学与技术...
- 98 篇 软件工程
- 42 篇 电气工程
- 12 篇 电子科学与技术（可...
- 8 篇 信息与通信工程
- 6 篇 控制科学与工程
- 6 篇 生物工程
- 4 篇 机械工程
- 3 篇 力学（可授工学、理...
- 3 篇 化学工程与技术
- 1 篇 材料科学与工程（可...
- 1 篇 冶金工程
- 1 篇 动力工程及工程热...
- 1 篇 土木工程
- 1 篇 水利工程
- 1 篇 生物医学工程（可授...
55 篇 理学
- 35 篇 数学
- 6 篇 生物学
- 4 篇 系统科学
- 3 篇 物理学
- 3 篇 化学
- 1 篇 大气科学
- 1 篇 统计学（可授理学、...
16 篇 管理学
- 9 篇 管理科学与工程(可...
- 7 篇 图书情报与档案管...
- 5 篇 工商管理
2 篇 经济学
- 2 篇 应用经济学
2 篇 法学
- 2 篇 社会学

主题

97 篇 programming
82 篇 parallel process...
79 篇 parallel archite...
71 篇 parallel program...
63 篇 concurrent compu...
59 篇 computer archite...
46 篇 hardware
43 篇 computational mo...
39 篇 programming prof...
39 篇 algorithm design...
36 篇 parallel algorit...
34 篇 computer science
26 篇 dynamic programm...
24 篇 runtime
24 篇 heuristic algori...
22 篇 program processo...
22 篇 partitioning alg...
21 篇 costs
21 篇 instruction sets
21 篇 clustering algor...

机构

6 篇 school of comput...
5 篇 school of comput...
4 篇 school of comput...
3 篇 department of co...
3 篇 department of co...
3 篇 college of infor...
3 篇 school of electr...
2 篇 department of co...
2 篇 univ minnesota d...
2 篇 school of scienc...
2 篇 soochow univ sch...
2 篇 college of optoe...
2 篇 school of comput...
2 篇 beijing research...
2 篇 department of co...
2 篇 school of comput...
2 篇 department of co...
2 篇 department of co...
2 篇 department of co...
2 篇 vision computing...

作者

9 篇 zhong cheng
6 篇 cheng zhong
6 篇 jigang wu
5 篇 hui li
4 篇 shikai guo
4 篇 yeh-cheng chen
4 篇 zhang jinxiong
4 篇 yidong li
4 篇 rong chen
4 篇 sivasankaran raj...
4 篇 hong shen
3 篇 chen danyang
3 篇 wei liu
3 篇 liu jun
3 篇 ruey-shun chen
3 篇 wang shunxu
3 篇 rajamanickam siv...
3 篇 naixue xiong
3 篇 guangzhong sun
3 篇 tonglai liu

语言

510 篇 英文
2 篇 其他
2 篇 中文

检索条件"任意字段=Seventh International Symposium on Parallel Architectures, Algorithms and Programming"

共 513 条记录，以下是191-200 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Convergence and Scalarization for Data-parallel architectures

Convergence and Scalarization for Data-Parallel Architecture...

引用

11th IEEE/ACM international symposium on Code Generation and Optimization (CGO)

作者： Lee, Yunsup Krashinsky, Ronny Grover, Vinod Keckler, Stephen W. Asanovic, Krste Univ Calif Berkeley Berkeley CA 94720 USA NVIDIA Santa Clara CA USA Univ Texas Austin Austin TX 78712 USA

ISBN: (纸本)9781467355254;9781467355247

Modern throughput processors such as GPUs achieve high performance and efficiency by exploiting data parallelism in application kernels expressed as threaded code. One draw-back of this approach compared to conventional vector architectures is redundant execution of instructions that are common across multiple threads, resulting in energy inefficiency due to excess instruction dispatch, register file accesses, and memory operations. This paper proposes to alleviate these overheads while retaining the threaded programming model by automatically detecting the scalar operations and factoring them out of the parallel code. We have developed a scalarizing compiler that employs convergence and variance analyses to statically identify values and instructions that are invariant across multiple threads. Our compiler algorithms are effective at identifying convergent execution even in programs with arbitrary control flow, identifying two-thirds of the opportunity captured by a dynamic oracle. The compile-time analysis leads to a reduction in instructions dispatched by 29%, register file reads and writes by 31%, memory address counts by 47%, and data access counts by 38%.

关键词： CUDA GPU Scalarization

来源：评论

学校读者我要写书评

暂无评论

Modular design of data-parallel graph algorithms

Modular design of data-parallel graph algorithms

引用

2013 11th international Conference on High Performance Computing and Simulation, HPCS 2013

作者： Dash, Santanu Kumar Scholz, Sven-Bodo Christianson, Bruce University of Hertfordshire Hatfield United Kingdom Heriot-Watt University Edinburgh United Kingdom

ISBN: (纸本)9781479908363

Amorphous Data parallelism has proven to be a suitable vehicle for implementing concurrent graph algorithms effectively on multi-core architectures. In view of the growing complexity of graph algorithms for information analysis, there is a need to facilitate modular design techniques in the context of Amorphous Data parallelism. In this paper, we investigate what it takes to formulate algorithms possessing Amorphous Data parallelism in a modular fashion enabling a large degree of code re-use. Using the betweenness centrality algorithm, a widely popular algorithm in the analysis of social networks, we demonstrate that a single optimisation technique can suffice to enable a modular programming style without loosing the efficiency of a tailor-made monolithic implementation. © 2013 IEEE.

关键词： Indium compounds

来源：评论

学校读者我要写书评

暂无评论

Portable Mapping of Data parallel Programs to OpenCL for Heterogeneous Systems

Portable Mapping of Data Parallel Programs to OpenCL for Het...

引用

11th IEEE/ACM international symposium on Code Generation and Optimization (CGO)

作者： Grewe, Dominik Wang, Zheng O'Boyle, Michael F. P. Univ Edinburgh Sch Informat Edinburgh EH8 9YL Midlothian Scotland

ISBN: (纸本)9781467355254;9781467355247

General purpose GPU based systems are highly attractive as they give potentially massive performance at little cost. Realizing such potential is challenging due to the complexity of programming. This paper presents a compiler based approach to automatically generate optimized OpenCL code from data-parallel OpenMP programs for GPUs. Such an approach brings together the benefits of a clear high level language (OpenMP) and an emerging standard (OpenCL) for heterogeneous multi-cores. A key feature of our scheme is that it leverages existing transformations, especially data transformations, to improve performance on GPU architectures and uses predictive modeling to automatically determine if it is worthwhile running the OpenCL code on the GPU or OpenMP code on the multi-core host. We applied our approach to the entire NAS parallel benchmark suite and evaluated it on two distinct GPU based systems: Core i7/NVIDIA GeForce GTX 580 and Core i7/AMD Radeon 7970. We achieved average (up to) speedups of 4.51x and 4.20x (143x and 67x) respectively over a sequential baseline. This is, on average, a factor 1.63 and 1.56 times faster than a hand-coded, GPU-specific OpenCL implementation developed by independent expert programmers.

关键词： GPU OpenCL Machine-Learning Mapping

来源：评论

学校读者我要写书评

暂无评论

Proceedings - 2012 5th international symposium on parallel architectures, algorithms and programming, PAAP 2012

Proceedings - 2012 5th International Symposium on Parallel A...

引用

2012 5th international symposium on parallel architectures, algorithms and programming, PAAP 2012

ISBN: (纸本)9780769548982

The proceedings contain 44 papers. The topics discussed include: reduce data coherence cost with an area efficient double layer counting bloom filter;synchronization-aware dynamic thread scheduling for improving performance and saving energy in multi-core embedded systems;efficient and secure trust negotiation over the Internet;design a low-power scheduling mechanism for a multicore android system;energy-aware scheduling for weakly-hard real-time system with I/O device;sparse matrix-vector multiplication based on network-on-chip: on data mapping;monoecism watermarking algorithm;a new piecewise chaotic mapping and its application in image secure communication;formulistic detection of malicious fast-flux domains;task scheduling prediction algorithms for dynamic hardware/software partitioning;and triggering cascades on strongly connected directed graphs.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Interactive Debugging of Dynamic Dataflow Embedded Applications

Interactive Debugging of Dynamic Dataflow Embedded Applicati...

引用

IEEE international symposium on parallel and Distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Kevin Pouget Patricia López Cueva Miguel Santana Jean-François Méhaut STMicroelectronics Crolles France LIG University of Grenoble Grenoble France

Debugging parallel and concurrent applications is well-recognized as a time-consuming task, which often requires a significant part of the application development process. In the context of embedded systems, Multi-Processor-System-on-Chip(MPSoC) architectures feature numerous multicore processors which may be coupled with heterogeneous processors like Digital Signal Processors (DSPs) and/or application-specific accelerators. In this situation, it is important that developers are provided with high-level programming environments able to efficiently exploit these architectures, as well as suitable debugging tools. Dataflow programming models were explicitly designed to program parallel architectures and they have the ability to abstract away heterogeneous computing complexity. In addition, the stream-processing aspect of multimedia algorithms naturally exhibits data-dependency graphs, which simplifies application design and implementation. In this paper, we propose a new approach for interactive debugging of dataflow applications. Going beyond the long-established ability of interactive debuggers to support sequential programming languages, we describe the functionalities they should be able to provide to debug embedded and parallel dataflow applications. Then we demonstrate our solution to this problem with a proof-of-concept debugger targeting the dataflow framework used on an industrial MPSoC platform. We also explain the development challenges we faced during the implementation of this GDB-based debugger and illustrate its efficiency through a case study of a video decoder debugging session.

关键词： Debugging Object oriented modeling Program processors programming Computer architecture Data models Context

来源：评论

学校读者我要写书评

暂无评论

Convergence and scalarization for data-parallel architectures

Convergence and scalarization for data-parallel architecture...

引用

international symposium on Code Generation and Optimization (CGO)

作者： Yunsup Lee Ronny Krashinsky Vinod Grover Stephen W. Keckler Krste Asanović University of California at Berkeley USA NVIDIA USA University of Texas at Austin USA

ISBN: (纸本)9781467355247

Modern throughput processors such as GPUs achieve high performance and efficiency by exploiting data parallelism in application kernels expressed as threaded code. One draw-back of this approach compared to conventional vector architectures is redundant execution of instructions that are common across multiple threads, resulting in energy inefficiency due to excess instruction dispatch, register file accesses, and memory operations. This paper proposes to alleviate these overheads while retaining the threaded programming model by automatically detecting the scalar operations and factoring them out of the parallel code. We have developed a scalarizing compiler that employs convergence and variance analyses to statically identify values and instructions that are invariant across multiple threads. Our compiler algorithms are effective at identifying convergent execution even in programs with arbitrary control flow, identifying two-thirds of the opportunity captured by a dynamic oracle. The compile-time analysis leads to a reduction in instructions dispatched by 29%, register file reads and writes by 31% memory address counts by 47%, and data access counts by 38%.

关键词： Instruction sets Convergence Registers Kernel Computer architecture Graphics processing units Algorithm design and analysis

来源：评论

学校读者我要写书评

暂无评论

A parallel method for generalized eigenvalue problems based on Multi-core Platform

A parallel method for generalized eigenvalue problems based ...

引用

5th international symposium on parallel architectures, algorithms and programming (PAAP)

作者： Wang, Shunxu Huaihai Inst Technol Sch Sci Lianyungang 222005 Jiangsu Peoples R China

ISBN: (纸本)9780769548982;9781467345668

In this paper, a parallel method for solving generalized eigenvalue problem based on multi-core platform is presented, which can provide parts of the eigenpairs in parallel. Compared with traditional numerical method, the parallel method in this paper using numerical integration, numerical experiments are implemented with a quad-core computer under the programming environment of Matlab parallel toolbox. The problems of computing the frequencies of a plane wing and aircraft pylon are taken as examples, which show the efficiency and applicability of our scheme.

关键词： parallel Computing Generalized Eigenvalue Problems Numerical Integration

来源：评论

学校读者我要写书评

暂无评论

Performance Analysis and Optimization of the Tiled Cholesky Factorization on NUMA Machines

Performance Analysis and Optimization of the Tiled Cholesky ...

引用

5th international symposium on parallel architectures, algorithms and programming (PAAP)

作者： Jeannot, Emmanuel Inria Bordeaux Sud Ouest LaBRI Bordeaux France

We discuss some performance issues of the tiled Cholesky factorization on non-uniform memory access-time (NUMA) shared memory machines. We show how to optimize thread placement and data placement in order to achieve p... 详细信息

ISBN: (纸本)9780769548982;9781467345668

关键词： matrix decomposition parallel processing performance evaluation shared memory systems

来源：评论

学校读者我要写书评

暂无评论

A divide-and-conquer algorithm of Delaunay triangulation with GPGPU

A divide-and-conquer algorithm of Delaunay triangulation wit...

引用

5th international symposium on parallel architectures, algorithms and programming (PAAP)

作者： Chen, Min-Bin China Univ Technol Taipei 116 Taiwan

ISBN: (纸本)9780769548982;9781467345668

In this study, we will parallelize the D&C algorithm with CUDA. In stead of recursive programming in D&C, the recursive stack is implemented on the host side (CPU) and the merge operation is executes on GPU in parallel. Since the recursive stack is a fully binary tree in this algorithm, the merge operations on the nodes in each layer of the binary tree can be performed synchronously. In this data-parallel computation, with the careful management of data structure, the data of each node can be arranged in the same block and no need to share data between threads, so the parallelism is not broken.

关键词： divide and conquer methods graphics processing units mesh generation parallel architectures tree data structures

来源：评论

学校读者我要写书评

暂无评论

The state minimization problem for nondeterministic finite automata: the parallel implementation of the truncated branch and bound method

The state minimization problem for nondeterministic finite a...

引用

5th international symposium on parallel architectures, algorithms and programming (PAAP)

作者： Melnikov, Boris Tsyganov, Andrey Togliatti State Univ Tolyatti Russia Ulyanovsk State Pedag Univ Ulyanovsk Russia

ISBN: (纸本)9780769548982;9781467345668

In this paper we present an approach to the parallel implementation of the state minimization problem for nondeterministic finite automata. This approach is based on the truncated branch and bound method and also on the usage of basis and COM automata for the given language. Minimum state automata are searched as sub-automata of the COM automaton. Some sufficient conditions for their equivalence to the given nondeterministic automaton are proved in terms of the loops of the basis automaton. We suggest exact and heuristic state minimization algorithms, discuss their implementation details and provide some experimental results.

关键词： finite automata nondeterminisitc automata state minimization parallelism OpenMP MPI

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共52页 << < 16 17 18 19 20 21 22 23 24 25 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：