检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

16,240 篇 会议
369 篇 期刊文献
22 册 图书

馆藏范围

16,631 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

9,338 篇 工学
- 8,537 篇 计算机科学与技术...
- 4,020 篇 软件工程
- 1,985 篇 电气工程
- 1,383 篇 信息与通信工程
- 673 篇 电子科学与技术（可...
- 535 篇 控制科学与工程
- 228 篇 网络空间安全
- 187 篇 仪器科学与技术
- 140 篇 机械工程
- 115 篇 生物医学工程（可授...
- 106 篇 动力工程及工程热...
- 105 篇 测绘科学与技术
- 97 篇 光学工程
- 91 篇 生物工程
- 82 篇 建筑学
- 70 篇 土木工程
- 63 篇 环境科学与工程（可...
- 61 篇 安全科学与工程
1,973 篇 理学
- 1,505 篇 数学
- 245 篇 物理学
- 203 篇 统计学（可授理学、...
- 177 篇 系统科学
- 115 篇 生物学
- 100 篇 地球物理学
- 69 篇 化学
1,462 篇 管理学
- 1,204 篇 管理科学与工程(可...
- 468 篇 工商管理
- 321 篇 图书情报与档案管...
106 篇 医学
- 86 篇 临床医学
96 篇 经济学
- 93 篇 应用经济学
56 篇 法学
53 篇 农学
18 篇 教育学
12 篇 文学
9 篇 军事学
1 篇 艺术学

主题

2,212 篇 parallel process...
1,199 篇 computer archite...
1,129 篇 concurrent compu...
1,116 篇 distributed comp...
1,063 篇 computational mo...
1,038 篇 application soft...
1,017 篇 distributed proc...
991 篇 hardware
905 篇 computer science
710 篇 graphics process...
595 篇 runtime
527 篇 scalability
520 篇 parallel process...
507 篇 algorithm design...
496 篇 parallel program...
490 篇 parallel algorit...
470 篇 graphics process...
460 篇 kernel
446 篇 processor schedu...
440 篇 conferences

机构

38 篇 ibm thomas j. wa...
33 篇 college of compu...
31 篇 school of comput...
27 篇 oak ridge nation...
26 篇 university of ch...
26 篇 oak ridge natl l...
25 篇 georgia inst tec...
25 篇 ohio state univ ...
24 篇 department of co...
23 篇 pacific northwes...
22 篇 tsinghua univers...
21 篇 argonne national...
21 篇 oak ridge nation...
20 篇 georgia inst tec...
19 篇 college of compu...
19 篇 school of comput...
19 篇 department of co...
19 篇 argonne natl lab...
19 篇 pacific northwes...
19 篇 national laborat...

作者

39 篇 jack dongarra
31 篇 dongarra jack
29 篇 zomaya albert y.
26 篇 bader david a.
23 篇 feng wu-chun
22 篇 boukerche azzedi...
19 篇 hoefler torsten
18 篇 gagan agrawal
18 篇 schulz martin
16 篇 dhabaleswar k. p...
16 篇 p. sadayappan
16 篇 wang yijie
15 篇 ito yasuaki
15 篇 yves robert
14 篇 h. casanova
14 篇 alexey lastovets...
14 篇 azad ariful
13 篇 dongsheng li
13 篇 wang guojun
13 篇 kishore kothapal...

语言

16,421 篇 英文
180 篇 其他
27 篇 中文
2 篇 土耳其文
1 篇 葡萄牙文

检索条件"任意字段=IEEE International Symposium on Parallel and Distributed Processing with Applications"

共 16631 条记录，以下是4651-4660 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

GraphPad: Optimized Graph Primitives for parallel and distributed Platforms

GraphPad: Optimized Graph Primitives for Parallel and Distri...

引用

international symposium on parallel and distributed processing (IPDPS)

作者： Michael J. Anderson Narayanan Sundaram Nadathur Satish Md. Mostofa Ali Patwary Theodore L. Willke Pradeep Dubey Intel Corporation Parallel Computing Laboratory

ISBN: (纸本)9781509021413

The duality between graphs and matrices means that many common graph analyses can be expressed with primitives such as generalized sparse matrix-vector multiplication (SpMSpV) and sparse matrix-matrix multiplication (SpGEMM). Achieving high performance on these primitives is challenging due to limited arithmetic intensity, irregular memory accesses, and significant network communication requirements in the distributed setting. In this paper we implement four graph applications using GraphPad, our optimized multinode implementations of generalized linear algebra primitives such as SpMSpV and SpGEMM. GraphPad is highly flexible to accommodate multiple data layouts, partitioning strategies, and incorporates communication optimizations. Our performance at scale can exceed that of CombBLAS by up to 40×. In addition to GraphPad's performance in a distributed setting, it is also within 2× the performance of GraphMat, a high performance graph framework on a single node for four out of five benchmarks. We also show our communication optimizations and flexibility are critical for good performance on both HPC clusters and commodity cloud platforms.

关键词： Sparse matrices Matrices Kernel Optimization Algorithm design and analysis Programming

来源：评论

学校读者我要写书评

暂无评论

Utilizing Hardware Performance Counters to Model and Optimize the Energy and Performance of Large Scale Scientific applications on Power-Aware Supercomputers

Utilizing Hardware Performance Counters to Model and Optimiz...

引用

ieee international symposium on parallel and distributed processing Workshops and Phd Forum (IPDPSW)

作者： Xingfu Wu Valerie Taylor Department of Computer Science & Engineering Texas A&M University College Station TX

ISBN: (纸本)9781509036837

Hardware performance counters are used as effective proxies to estimate power consumption and runtime. In this paper we present a performance counter-based power and performance modeling and optimization method, and use the method to model four metrics: runtime, system power, CPU power and memory power. The performance counters that compose the models are used to explore some counter-guided optimizations with two large-scale scientific applications: an earthquake simulation and an aerospace application. We demonstrate the use of the method using two power-aware supercomputers, Mira at Argonne National Laboratory and SystemG at Virginia Tech. The counter-guided optimizations result in a reduction in energy by an average of 18.28% on up to 32,768 cores on Mira and 11.28% on up to 128 cores on SystemG for the aerospace application. For the earthquake simulation, the average energy reductions achieved are 48.65% on up to 4,096 cores on Mira and 30.67% on up to 256 cores on SystemG.

关键词： Radiation detectors Runtime Optimization Power demand Hardware Power measurement Analytical models

来源：评论

学校读者我要写书评

暂无评论

A hybrid CPU+GPU working-set dictionary 15

A hybrid CPU+GPU working-set dictionary

引用

15th international symposium on parallel and distributed Computing, ISPDC 2016

作者： Choudhury, Ziaul Purini, Suresh Krishna, Shiva Rama Siemens Corporate Research Bangalore India International Institute of Information Technology Hyderabad India

ISBN: (纸本)9781509041527

In this paper, we propose a hybrid CPU+GPU data structure, that optimizes search operation for frequently accessed search keys. This is based on the working-set structure due to Badiu et al. [1]. The main idea is to maintain a dynamic set of most frequently accessed keys in the GPU memory and the rest of the keys in the CPU main memory. Further, search queries are processed in batches of size 1K to 16K (K = 210). We measured the query throughput of our data structure using Millions of Queries Processed per Second (MQPS) as a metric, on different key access distributions. On distributions, where some keys are accessed more frequently than others, we achieved 2x higher MQPS when compared to a highly tuned hash map provided by C++ BOOST library, and 1.5x higher MQPS against the B+ tree implementation in the Rodinia GPU benchmark. We further showed the effectiveness of our structure, when it is used to store visited vertices information in breadth-first search traversal of graphs. Here, we achieved 1.2x and 1.5x speedups when compared to the BOOST hash map and the GPU B+ trees respectively. © 2016 ieee.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

Energy, Power, and Performance Characterization of GPGPU Benchmark Programs

Energy, Power, and Performance Characterization of GPGPU Ben...

引用

ieee international symposium on parallel and distributed processing Workshops and Phd Forum (IPDPSW)

作者： Jared Coplin Martin Burtscher Department of Computer Science Texas State University

ISBN: (纸本)9781509036837

This paper studies the effects on energy consumption, power draw, and runtime of a modern compute GPU when changing the core and memory clock frequencies, enabling or disabling ECC, using alternate implementations, and varying the program inputs. We evaluate 34 applications from 5 benchmark suites and measure their power draw over time on a K20c GPU. Our results show that changing the frequency or the program implementation can alter the energy, power, and performance by a factor of two or more. Interestingly, some changes affect these three aspects very unevenly. ECC can greatly increase the runtime and energy consumption, but only on memory-bound codes. Compute-bound codes tend to behave quite differently from memory-bound codes, in particular regarding their power draw. On irregular programs, a small change in frequency can result in a large change in runtime and energy consumption.

关键词： Graphics processing units Benchmark testing Instruction sets Runtime Energy efficiency Hardware Power measurement

来源：评论

学校读者我要写书评

暂无评论

A Tool for Bottleneck Analysis and Performance Prediction for GPU-Accelerated applications

A Tool for Bottleneck Analysis and Performance Prediction fo...

引用

ieee international symposium on parallel and distributed processing Workshops and Phd Forum (IPDPSW)

作者： Souley Madougou Ana Lucia Varbanescu Cees De Laat Rob Van Nieuwpoort University of Amsterdam Amsterdam The Netherlands University of Amsterdam The Netherlands Universiteit van Amsterdam Amsterdam Noord-Holland NL Netherlands eScience Center Amsterdam Netherlands

ISBN: (纸本)9781509036837

High-level tools for analyzing and predicting the performance GPU-accelerated applications are scarce, at best. Although performance modeling approaches for GPUs exist, their complexity makes them virtually impossible to use to quickly analyze the performance of real life applications and obtain easy-to-use, readable feedback. This is why, although GPUs are significant performance boosters in many HPC domains, performance prediction is still based on extensive benchmarking, and performance bottleneck analysis remains a nonsystematic, experience-driven process. In this context, we propose a tool for bottleneck analysis and performance prediction for GPU-accelerated applications. Based on random forest modeling, and using hardware performance counters data, our method can be used to quickly and accurately evaluate application performance on GPU-based systems for different problem characteristics and different hardware generations. We illustrate the benefits of our approach with three detailed use cases: a simple step-by-step example on a parallel reduction kernel, and two classical benchmarks (matrix multiplication and sequence alignment). Our results so far indicate that our statistical modeling is a quick, easy-to-use method to grasp the performance characteristics of applications running on GPUs. Our current work focuses on tackling some of its applicability limitations (more applications, more platforms) and improving its usability (full automation from input to user feedback).

关键词： Hardware Graphics processing units Radiation detectors Analytical models Data models Kernel Computer architecture

来源：评论

学校读者我要写书评

暂无评论

Minimal Aggregated Shared Memory Messaging on distributed Memory Supercomputers

Minimal Aggregated Shared Memory Messaging on Distributed Me...

引用

international symposium on parallel and distributed processing (IPDPS)

作者： Benjamin F. Jamroz John M. Dennis Computational Information Systems Laboratory National Center for Atmospheric Research Boulder CO USA

ISBN: (纸本)9781509021413

Many high-performance distributed memory applications rely on point-to-point messaging using the Message Passing Interface (MPI). Due to the latency of the network, and other costs, this communication can limit the scalability of an application when run on high node counts of distributed memory supercomputers. Communication costs are further increased on modern multi-and many-core architectures, when using more than one MPI process per node, as each process sends and receives messages independently, inducing multiple latencies and contention for resources. In this paper, we use shared memory constructs available in the MPI 3.0 standard to implement an aggregated communication method to minimize the number of inter-node messages to reduce these costs. We compare the performance of this Minimal Aggregated SHared Memory (MASHM) messaging to the standard point-to-point implementation on large-scale supercomputers, where we see that MASHM leads to enhanced strong scalability of a weighted Jacobi relaxation. For this application, we also see that the use of shared memory parallelism through MASHM and MPI 3.0 can be more efficient than using Open Multi-processing (OpenMP). We then present a model for the communication costs of MASHM which shows that this method achieves its goal of reducing latency costs while also reducing bandwidth costs. Finally, we present MASHM as an open source library to facilitate the integration of this efficient communication method into existing distributed memory applications.

关键词： Supercomputers Scalability Standards parallel processing Memory management Mathematical model Computational modeling

来源：评论

学校读者我要写书评

暂无评论

Message from the ieee TrustCom/BigDataSE/ISPA 2016 General Chairs

Proceedings - 15th IEEE International Conference on Trust, S...

引用

Proceedings - 15th ieee international Conference on Trust, Security and Privacy in Computing and Communications, 10th ieee international Conference on Big Data Science and Engineering and 14th ieee international symposium on parallel and distributed processing with applications, ieee TrustCom/BigDataSE/ISPA 2016 2016年 xxxi-xxxii页

作者： Jin, Hai Kato, Nei Dillion, Tharam Gaudiot, Jean-Luc Cao, Jiannong Zomaya, Albert Fox, Geoffrey Guo, Minyi Ganti, Raghu Huazhong University of Science and Technology China Tohoku University Japan Latrobe University Australia University of California Irvine United States Hong Kong Polytechnic University Hong Kong Hong Kong University of Sydney Australia Indiana University United States Shanghai Jiao Tong University China IBM T. J. Watson Research Center United States

来源：评论

学校读者我要写书评

暂无评论

SamzaSQL: Scalable Fast Data Management with Streaming SQL

SamzaSQL: Scalable Fast Data Management with Streaming SQL

引用

ieee international symposium on parallel and distributed processing Workshops and Phd Forum (IPDPSW)

作者： Milinda Pathirage Julian Hyde Yi Pan Beth Plale School of Informatics and Computing Indiana University LinkedIn Hortonworks

ISBN: (纸本)9781509036837

As the data-driven economy evolves, enterprises have come to realize a competitive advantage in being able to act on high volume, high velocity streams of data. Technologies such as distributed message queues and streaming processing platforms that can scale to thousands of data stream partitions on commodity hardware are a response. However, the programming API provided by these systems is often low-level, requiring substantial custom code that adds to the programmer learning curve and maintenance overhead. Additionally, these systems often lack SQL querying capabilities that have proven popular on Big Data systems like Hive, Impala or Presto. We define a minimal set of extensions to standard SQL for data stream querying and manipulation. These extensions are prototyped in SamzaSQL, a new tool for streaming SQL that compiles streaming SQL into physical plans that are executed on Samza, an open-source distributed stream processing framework. We compare the performance of streaming SQL queries against native Samza applications and discuss usability improvements. SamzaSQL is a part of the open source Apache Samza project and will be available for general use.

关键词： Yarn Standards distributed databases Fault tolerance Fault tolerant systems Computer architecture Big data

来源：评论

学校读者我要写书评

暂无评论

The Suzaku Pattern Programming Framework

The Suzaku Pattern Programming Framework

引用

ieee international symposium on parallel and distributed processing Workshops and Phd Forum (IPDPSW)

作者： Barry Wilkinson Clayton Ferner University of North Carolina Charlotte Charlotte NC USA University of North Carolina Wilmington Wilmington NC USA

ISBN: (纸本)9781509036837

Suzaku is a pattern programming framework that enables programmers to create pattern-based parallel MPI programs without writing the MPI message-passing code implicit in the patterns. The purpose of this framework is to simplify message-passing programming and create better structured programs based upon established parallel design patterns. The focus for developing Suzaku is on teaching parallel programming. This paper covers the main features of Suzaku and describes our experiences using it in parallel programming classes.

关键词： parallel programming Master-slave Arrays Message systems distributed computing Computers

来源：评论

学校读者我要写书评

暂无评论

Ray-tracing domain decomposition methods for real-time simulation on multi-core and multi-processor systems

Ray-tracing domain decomposition methods for real-time simul...

引用

4th international Workshop on Embedded Multicore Computing and applications in conjunction with the 16th ieee international Conference on High Performance and Communications

作者： Magoules, Frederic Gbikpi-Benissan, Guillaume Callet, Patrick Technol Res Inst SystemX 8 Ave Vauve F-91120 Palaiseau France CentraleSupelec F-92295 Chatenay Malabry France

This paper describes the use of domain decomposition methods for accelerating wave physics simulation. Numerical wave-based methods provide more accurate simulation than geometrical methods, but at a higher computation cost as well. In the context of virtual reality, the quality of the results is estimated according to human perception, what makes geometrical methods an interesting approach for achieving real-time physically-based rendering. Here, we investigate a geometrical method based on both beams and rays tracing, which we enhance by two levels of parallel processing. Techniques from domain decomposition methods are coupled with classical parallel computing on both shared and distributed memory. Both optic and acoustic renderings are experimented to evaluate the acceleration impact of the domain decomposition scheme. Speedup measurements clearly show the efficiency of using domain decomposition methods for real-time simulation of wave physics. Copyright (C) 2015 John Wiley & Sons, Ltd.

关键词： ray-tracing beam-tracing parallel computing domain decomposition method wave propagation acoustic optic

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 462 463 464 465 466 467 468 469 470 471 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：