检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

16,237 篇 会议
368 篇 期刊文献
22 册 图书

馆藏范围

16,627 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

9,336 篇 工学
- 8,536 篇 计算机科学与技术...
- 4,019 篇 软件工程
- 1,982 篇 电气工程
- 1,383 篇 信息与通信工程
- 676 篇 电子科学与技术（可...
- 535 篇 控制科学与工程
- 228 篇 网络空间安全
- 188 篇 仪器科学与技术
- 141 篇 机械工程
- 115 篇 生物医学工程（可授...
- 106 篇 动力工程及工程热...
- 105 篇 测绘科学与技术
- 97 篇 光学工程
- 91 篇 生物工程
- 82 篇 建筑学
- 70 篇 土木工程
- 63 篇 环境科学与工程（可...
- 61 篇 安全科学与工程
1,973 篇 理学
- 1,505 篇 数学
- 245 篇 物理学
- 203 篇 统计学（可授理学、...
- 177 篇 系统科学
- 115 篇 生物学
- 100 篇 地球物理学
- 69 篇 化学
1,463 篇 管理学
- 1,205 篇 管理科学与工程(可...
- 468 篇 工商管理
- 321 篇 图书情报与档案管...
106 篇 医学
- 86 篇 临床医学
96 篇 经济学
- 93 篇 应用经济学
56 篇 法学
53 篇 农学
18 篇 教育学
12 篇 文学
9 篇 军事学
1 篇 艺术学

主题

2,212 篇 parallel process...
1,199 篇 computer archite...
1,130 篇 concurrent compu...
1,116 篇 distributed comp...
1,063 篇 computational mo...
1,037 篇 application soft...
1,017 篇 distributed proc...
990 篇 hardware
905 篇 computer science
708 篇 graphics process...
595 篇 runtime
527 篇 scalability
518 篇 parallel process...
507 篇 algorithm design...
494 篇 parallel program...
490 篇 parallel algorit...
470 篇 graphics process...
460 篇 kernel
446 篇 processor schedu...
440 篇 conferences

机构

38 篇 ibm thomas j. wa...
33 篇 college of compu...
31 篇 school of comput...
27 篇 oak ridge nation...
26 篇 university of ch...
26 篇 oak ridge natl l...
25 篇 georgia inst tec...
25 篇 ohio state univ ...
24 篇 department of co...
23 篇 tsinghua univers...
23 篇 pacific northwes...
21 篇 argonne national...
21 篇 oak ridge nation...
20 篇 georgia inst tec...
19 篇 college of compu...
19 篇 school of comput...
19 篇 department of co...
19 篇 argonne natl lab...
19 篇 pacific northwes...
19 篇 national laborat...

作者

39 篇 jack dongarra
31 篇 dongarra jack
29 篇 zomaya albert y.
26 篇 bader david a.
23 篇 feng wu-chun
22 篇 boukerche azzedi...
19 篇 hoefler torsten
18 篇 gagan agrawal
18 篇 schulz martin
16 篇 dhabaleswar k. p...
16 篇 p. sadayappan
16 篇 wang yijie
15 篇 ito yasuaki
15 篇 yves robert
14 篇 h. casanova
14 篇 alexey lastovets...
14 篇 azad ariful
13 篇 dongsheng li
13 篇 wang guojun
13 篇 kishore kothapal...

语言

16,553 篇 英文
44 篇 其他
27 篇 中文
2 篇 土耳其文
1 篇 葡萄牙文

检索条件"任意字段=IEEE International Symposium on Parallel and Distributed Processing with Applications"

共 16627 条记录，以下是411-420 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

QoS Promotion for Energy-Efficient Datacenters by Regulating Overloaded Workloads 15

QoS Promotion for Energy-Efficient Datacenters by Regulating...

引用

15th ieee international symposium on parallel and distributed processing with applications (ISPA) / 16th ieee international Conference on Ubiquitous Computing and Communications (IUCC)

作者： Hu, Cheng Deng, Yuhui Jinan Univ Dept Comp Sci Guangzhou 510632 Guangdong Peoples R China Chinese Acad Sci Inst Comp State Key Lab Comp Architecture Beijing 100190 Peoples R China

ISBN: (纸本)9781538637906

On-demand hardware resources provisioning is an efficient way to save energy in traditional data centers. However, when workloads burst and exceed the capacity of provided resources, the capacity will temporary deficit. That's because a specific time is needed to increase the quantity of resources. Thus, performance degradation is incurred. To alleviate this problem, this paper proposes a peak load regulation method to promote the QoS of workloads for traditional energy-efficient DCs. In this method, overloaded workloads (peak loads) are regulated to improve the response time of critical requests and increase the number of QoS-guaranteed requests. Experimental results show that, with this method the energy consumption of the data center can be reduced by about 25% compared with the baseline. What's more, this method can significantly promote the QoS of workloads.

关键词： Quality of service Time factors Servers Hardware Energy consumption Data centers Degradation

来源：评论

学校读者我要写书评

暂无评论

On Overlapping Communication and File I/O in Collective Write Operation 34

On Overlapping Communication and File I/O in Collective Writ...

引用

34th ieee international parallel and distributed processing symposium (IPDPS)

作者： Feki, Raafat Gabriel, Edgar Univ Houston Dept Comp Sci Parallel Software Technol Lab Houston TX 77204 USA

ISBN: (纸本)9781728174457

Many parallel scientific applications spend a significant amount of time reading and writing data files. Collective I/O operations allow to optimize the file access of a process group by redistributing data across processes to match the data layout on the file system. In most parallel I/O libraries, the implementation of collective I/O operations is based on the two-phase I/O algorithm, which consists of a communication phase and a file access phase. This papers evaluates various design options for overlapping two internal cycles of the two-phase I/O algorithm, and explores using different data transfer primitives for the shuffle phase, including non-blocking two-sided communication and multiple versions of one-sided communication. The results indicate that overlap algorithms incorporating asynchronous I/O outperform overlapping approaches that only rely on non-blocking communication. However, in the vast majority of the testcases one-sided communication did not lead to performance improvements over two-sided communication.

关键词： Data transfer

来源：评论

学校读者我要写书评

暂无评论

A Memory Heterogeneity-Aware Runtime System for bandwidth-sensitive HPC applications 31

A Memory Heterogeneity-Aware Runtime System for bandwidth-se...

引用

31st ieee international parallel and distributed processing symposium Workshops (IPDPS)

作者： Chandrasekar, Kavitha Ni, Xiang Kale, Laxmikant V. Univ Illinois Urbana IL 61801 USA IBM TJ Watson Ctr New York NY USA

ISBN: (纸本)9780769561493

Today's supercomputers are moving towards deployment of many-core processors like Intel Xeon Phi Knights Landing (KNL), to deliver high compute and memory capacity. applications executing on such many-core platforms with improved vectorization require high memory bandwidth. To improve performance, architectures like Knights Landing include a high bandwidth and low capacity in-package high bandwidth memory (HBM) in addition to the high capacity but low bandwidth DDR4. Other architectures like Nvidia's Pascal GPU also expose similar stacked DRAM. In architectures with heterogeneity in memory types within a node, efficient allocation and data movement can result in improved performance and energy savings in future systems if all the data requests are served from the high bandwidth memory. In this paper, we propose a memory-heterogeneity aware runtime system which guides data prefetch and eviction such that data can be accessed at high bandwidth for applications whose entire working set does not fit within the high bandwidth memory and data needs to be moved among different memory types. We implement a data movement mechanism managed by the runtime system which allows applications to run efficiently on architectures with heterogeneous memory hierarchy, with trivial code changes. We show upto 2X improvement in execution time for Stencil3D and Matrix Multiplication which are important HPC kernels.

关键词： HPC Memory Heterogeneity Runtime System Scheduling

来源：评论

学校读者我要写书评

暂无评论

Shared Memory parallel Subgraph Enumeration 31

Shared Memory Parallel Subgraph Enumeration

引用

31st ieee international parallel and distributed processing symposium Workshops (IPDPS)

作者： Kimmig, Raphael Meyerhenke, Henning Strash, Darren KIT Inst Theoret Informat Karlsruhe Germany Colgate Univ Dept Comp Sci Hamilton NY 13346 USA

ISBN: (纸本)9780769561493

The subgraph enumeration problem asks us to find all subgraphs of a target graph that are isomorphic to a given pattern graph. Determining whether even one such isomorphic subgraph exists is NP-complete-and therefore finding all such subgraphs (if they exist) is a time-consuming task. Subgraph enumeration has applications in many fields, including biochemistry and social networks, and interestingly the fastest algorithms for solving the problem for biochemical inputs are sequential. Since they depend on depth-first tree traversal, an efficient parallelization is far from trivial. Nevertheless, since important applications produce data sets with increasing difficulty, parallelism seems beneficial. We thus present here a shared-memory parallelization of the state-of-the-art subgraph enumeration algorithms RI and RIDS (a variant of RI for dense graphs) by Bonnici et al. [BMC Bioinformatics, 2013]. Our strategy uses work stealing and our implementation demonstrates a significant speedup on real-world biochemical data-despite a highly irregular data access pattern. We also improve RI-DS by pruning the search space better;this further improves the empirical running times compared to the already highly tuned RI-DS.

关键词： subgraph enumeration subgraph isomorphism parallel combinatorial search graph mining network analysis

来源：评论

学校读者我要写书评

暂无评论

Understanding the performance of streaming applications deployed on hybrid systems

Understanding the performance of streaming applications depl...

引用

10th Workshop on Advances in parallel and distributed Computational Models/22nd ieee international parallel and distributed processing symposium

作者： Lancaster, Joseph Cytron, Ron Chamberlain, Roger D. Washington Univ Dept Comp Sci & Engn St Louis MO 63130 USA

ISBN: (纸本)9781424416936

Significant performance gains have been reported by exploiting the specialized characteristics of hybrid computing architectures for a number of streaming applications. While it is straightforward to physically construct these hybrid systems, application development is often quite difficult. We have built an application development environment, Auto-Pipe, that targets streaming applications deployed on hybrid architectures. Here, we describe some of the current and future characteristics of the Auto-Pipe environment that facilitate an understanding of the performance of an application that is deployed on a hybrid system.

关键词： Hybrid systems

来源：评论

学校读者我要写书评

暂无评论

Efficient Process Replication for MPI applications: Sharing Work Between Replicas 29

Efficient Process Replication for MPI Applications: Sharing ...

引用

29th ieee international parallel and distributed processing symposium (IPDPS)

作者： Ropars, Thomas Lefray, Arnaud Kim, Dohyun Schiper, Andre Ecole Polytech Fed Lausanne CH-1015 Lausanne Switzerland Univ Lyon LIP Lab CNRS UMRENS LyonINRIAUCB Lyon 5668 Lyon France INSA Ctr Val Loire Blois France Seoul Natl Univ Dept Comp Sci & Engn Seoul 151 South Korea

ISBN: (纸本)9781479986484

With the increased failure rate expected in future extreme scale supercomputers, process replication might become a viable alternative to checkpointing. By default, the workload efficiency of replication is limited to 50% because of the additional resources that have to be used to execute the replicas of the application's processes. In this paper, we introduce intra-parallelization, a solution that avoids replicating all computation by introducing work-sharing between replicas. We show on a representative set of benchmarks that intra-parallelization allows achieving more than 50% efficiency without compromising fault tolerance.

关键词： High performance computing fault tolerance replication

来源：评论

学校读者我要写书评

暂无评论

Annotation-based parallelization of Java Code 31

Annotation-based Parallelization of Java Code

引用

31st ieee international parallel and distributed processing symposium Workshops (IPDPS)

作者： Mehrabi, Mostafa Giacaman, Nasser Sinnen, Oliver Univ Auckland Dept Elect & Comp Engn Parallel & Reconfigurable Comp Lab Auckland New Zealand

ISBN: (纸本)9780769561493

The majority of mainstream programming languages support parallel computing via extended libraries that require restructuring of sequential code. Library-based features are portable, but tend to be verbose and usually reduce the understandability and modifiability of code. On the contrary, approaches with language constructs promote simple code structures, hide the complexity of parallelization and avoid boilerplate code. However, language constructs normally impose additional development concepts and compilation requirements that may sacrifice the ease-of-use and portability. Therefore, frameworks that offer simple and intuitive concepts and constructs that are recognized by the standard compilers of a language can gain priority over other approaches. In this paper we discuss @PT (Annotation parallel Task), a parallel programming framework that proposes Java annotations, standard Java components, as its language constructs. @PT takes an intuitive object-oriented approach on asynchronous execution of tasks, and has a special focus on GUI-responsive applications. This paper presents the annotation-based programming interface of the framework and its fundamental parallelization concepts. Furthermore, it studies @PT in different parallel programming patterns, and evaluates its efficiency by comparing @PT with other Java parallelization approaches in a set of standard benchmarks.

关键词： parallel annotation lambda expressions object-oriented parallel tasks GUI concurrency

来源：评论

学校读者我要写书评

暂无评论

Modeling and Predicting Performance of High Performance Computing applications on Hardware Accelerators

Modeling and Predicting Performance of High Performance Comp...

引用

26th ieee international parallel and distributed processing symposium (IPDPS)

作者： Meswani, Mitesh R. Carrington, Laura Unat, Didem Snavely, Allan Baden, Scott Poole, Stephen UCSD SDSC La Jolla CA USA

ISBN: (纸本)9780769546766

Computers with hardware accelerators, also referred to as hybrid-core systems, speedup applications by offloading certain compute operations that can run faster on accelerators. Thus, it is not surprising that many of top500 supercomputers use accelerators. However, in addition to procurement cost, significant programming and porting effort is required to realize the potential benefit of such accelerators. Hence, before building such a system it is prudent to answer the question 'what is the projected performance benefit from accelerators for the workloads of interest?'. We address this question by way of a performance-modeling framework that predicts realizable application performance on accelerators rapidly and accurately without going to the considerable effort of porting and tuning. The modeling framework first automatically identifies commonly found compute patterns in scientific applications which we term idioms, which may benefit by accelerator technology. Next the framework models the predicted speedup of those idioms if they were to be ported to and run on hardware accelerators. As a proof of concept we characterize two kinds of accelerators 1) the FPGA accelerators on a Convey HC-1 system and 2) an NVIDIA FERMI GPU accelerator. We model performance of the idioms gather/scatter and stream and our predictions show that where these occur in two full-scale HPC applications, Milc and HYCOM, gather/scatter speeds up by as much as 15X, and stream by as much as 14X, whereas the overall compute time of Milc improves by 3.4% and HYCOM by 20%.

关键词： accelerators GPU FPGA performance prediction performance modeling benchmarking HPC

来源：评论

学校读者我要写书评

暂无评论

Unobtrusive Asynchronous Exception Handling with Standard Java Try/Catch Blocks 32

Unobtrusive Asynchronous Exception Handling with Standard Ja...

引用

32nd ieee international parallel and distributed processing symposium (IPDPS)

作者： Mehrabi, Mostafa Giacaman, Nasser Sinnen, Oliver Univ Auckland Dept Elect & Comp Engn Parallel & Reconfigurable Comp Lab Auckland New Zealand

ISBN: (纸本)9781538643686

The sophisticated nature of parallel computing concepts makes parallel programming challenging. This has encouraged higher-level frameworks that conceal much of the complications behind abstraction layers. Paradigms in this category are mostly performance centric, and do not share the same sentiments for the robustness of asynchronous executions. This is while current applications demand consistency in addition to fast performance. Therefore, programming environments that offer high-level support for asynchronous exception handling will have higher chances for popularity. This paper discusses our latest enhancements to @PT, a parallel programming environment that is based on Java annotations. The proposed concept promotes the robustness of parallelized programs by adhering to the familiar exception handling standards of sequential code, and reducing the asynchronous execution concerns at the API level. This study suggests that the concept simplifies efficient management of asynchronous exceptions, which appears to be a challenge in parallel programming.

关键词： parallel programming exception handling asynchronous annotations @PT

来源：评论

学校读者我要写书评

暂无评论

Toucan - A Translator for Communication Tolerant MPI applications 31

Toucan - A Translator for Communication Tolerant MPI Applica...

引用

31st ieee international parallel and distributed processing symposium (IPDPS)

作者： Martin, Sergio M. Berger, Marsha J. Baden, Scott B. Univ Calif San Diego Dept Comp Sci & Engn La Jolla CA 92093 USA NYU Courant Inst Comp Sci Dept New York NY 10012 USA Lawrence Berkeley Natl Lab Computat Res Div Berkeley CA 94720 USA

ISBN: (纸本)9781538639146

We discuss early results with Toucan, a sourceto- source translator that automatically restructures C/C++ MPI applications to overlap communication with computation. We co-designed the translator and runtime system to enable dynamic, dependence-driven execution of MPI applications, and require only a modest amount of programmer annotation. Co-design was essential to realizing overlap through dynamic code block reordering and avoiding the limitations of static code relocation and inlining. We demonstrate that Toucan hides significant communication in four representative applications running on up to 24K cores of NERSC's Edison platform. Using Toucan, we have hidden from 33% to 85% of the communication overhead, with performance meeting or exceeding that of painstakingly hand-written overlap variants.

关键词： Communication/Computation Overlap Source-toSource Translator MPI Data-Driven

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 38 39 40 41 42 43 44 45 46 47 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：