检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

5,038 篇 会议
1,444 篇 期刊文献
129 册 图书
75 篇 学位论文

馆藏范围

6,686 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

3,970 篇 工学
- 3,387 篇 计算机科学与技术...
- 2,002 篇 软件工程
- 990 篇 电气工程
- 237 篇 信息与通信工程
- 178 篇 电子科学与技术（可...
- 136 篇 控制科学与工程
- 66 篇 机械工程
- 52 篇 生物医学工程（可授...
- 52 篇 生物工程
- 44 篇 仪器科学与技术
- 33 篇 材料科学与工程（可...
- 30 篇 力学（可授工学、理...
- 28 篇 动力工程及工程热...
- 28 篇 土木工程
- 21 篇 光学工程
- 20 篇 石油与天然气工程
677 篇 理学
- 396 篇 数学
- 118 篇 物理学
- 87 篇 生物学
- 78 篇 系统科学
- 33 篇 化学
- 28 篇 统计学（可授理学、...
- 25 篇 地球物理学
354 篇 管理学
- 262 篇 管理科学与工程(可...
- 98 篇 图书情报与档案管...
- 62 篇 工商管理
68 篇 教育学
- 62 篇 教育学
59 篇 医学
- 44 篇 临床医学
- 22 篇 基础医学(可授医学...
30 篇 法学
- 27 篇 社会学
17 篇 农学
15 篇 经济学
12 篇 文学
6 篇 艺术学
4 篇 军事学

主题

6,686 篇 parallel program...
1,067 篇 concurrent compu...
1,005 篇 parallel process...
572 篇 programming prof...
482 篇 application soft...
466 篇 computer science
466 篇 computer archite...
401 篇 hardware
340 篇 message passing
335 篇 distributed comp...
320 篇 libraries
315 篇 computational mo...
248 篇 computer languag...
231 篇 high performance...
230 篇 program processo...
229 篇 runtime
198 篇 parallel archite...
196 篇 parallel algorit...
193 篇 yarn
179 篇 costs

机构

14 篇 carnegie mellon ...
13 篇 barcelona superc...
11 篇 brno university ...
11 篇 univ illinois de...
11 篇 school of comput...
11 篇 intel corporatio...
10 篇 univ pisa dept c...
10 篇 stanford univ st...
9 篇 school of applie...
9 篇 department of co...
9 篇 carnegie mellon ...
9 篇 mathematics and ...
9 篇 department of co...
9 篇 rice univ housto...
8 篇 department of co...
8 篇 ibm thomas j. wa...
8 篇 univ alberta dep...
8 篇 department of co...
8 篇 irisa rennes
8 篇 tech univ berlin

作者

31 篇 griebler dalvan
25 篇 sarkar vivek
21 篇 danelutto marco
20 篇 fernandes luiz g...
19 篇 loulergue freder...
17 篇 badia rosa m.
16 篇 torquati massimo
15 篇 mencagli gabriel...
15 篇 olukotun kunle
14 篇 wolf felix
12 篇 g. runger
12 篇 gonzalez-escriba...
12 篇 ayguade eduard
12 篇 m. sato
11 篇 hoefler torsten
11 篇 dinavahi venkata
11 篇 benini luca
11 篇 valero mateo
11 篇 sato mitsuhisa
11 篇 t. rauber

语言

6,494 篇 英文
139 篇 其他
21 篇 中文
17 篇 俄文
7 篇 土耳其文
2 篇 德文
2 篇 朝鲜文
1 篇 西班牙文
1 篇 日文
1 篇 葡萄牙文

检索条件"主题词=Parallel programming"

共 6686 条记录，以下是1301-1310 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Evaluating Performance of Task and Data Coarsening in Concurrent Collections 1

引用

29th International Workshop on Languages and Compilers for parallel Computing (LCPC)

作者： Liu, Chenyang Kulkarni, Milind Purdue Univ W Lafayette IN 47907 USA

ISBN: (数字)9783319527093

ISBN: (纸本)9783319527093;9783319527086

Programmers are faced with many challenges for obtaining performance on machines with increasingly capable, yet increasingly complex hardware. A trend towards task-parallel and asynchronous many-task programming models aim to alleviate the burden of parallel programming on a vast array of current and future platforms. One such model, Concurrent Collections (CnC), provides a programming paradigm that emphasizes the separation of the concerns-domain experts concentrate on their algorithms and correctness, whereas performance experts handle mapping and tuning to a target platform. Deep understanding of parallel constructs and behavior is not necessary to write parallel applications that will run on various multi-threaded and multi-core platforms when using the CnC model. However, performance can vary greatly depending on the granularity of tasks and data declared by the programmer. These program-specific decisions are not part of the CnC tuning capabilities and must be tuned in the program. We analyze the performance behavior based on tuning various elements in each collection for the LULESH application using CnC. We demonstrate the effects of different techniques to modify task and data granularity in CnC collections. Our fully tiled CnC implementation outperforms the OpenMP counterpart by 3x for 48 processors. Finally, we propose guidelines to emulate the techniques used to obtain high performance while improving programmability.

关键词： Concurrent collections LULESH Coarsening parallel programming

来源：评论

学校读者我要写书评

暂无评论

GPUs in a computational physics course 28

GPUs in a computational physics course

引用

28th Annual IUPAP Conference on Computational Physics (CCP)

作者： Adler, Joan Nissim, Gal Kiswani, Ahmad Technion IIT Phys Dept IL-32000 Haifa Israel

In an introductory computational physics class of the type that many of us give, time constraints lead to hard choices on topics. Everyone likes to include their own research in such a class but an overview of many areas is paramount. parallel programming algorithms using MPI is one important topic. Both the principle and the need to break the "fear barrier" of using a large machine with a queuing system via ssh must be sucessfully passed on. Due to the plateau in chip development and to power considerations future HPC hardware choices will include heavy use of GPUs. Thus the need to introduce these at the level of an introductory course has arisen. Just as for parallel coding, explanation of the benefits and simple examples to guide the hesitant first time user should be selected. Several student projects using GPUs that include how-to pages were proposed at the Technion. Two of the more successful ones were lattice Boltzmann and a finite element code, and we present these in detail.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Comparison of Threading programming Models 31

Comparison of Threading Programming Models

引用

31st IEEE International parallel and Distributed Processing Symposium Workshops (IPDPS)

作者： Salehian, Solmaz Liu, Jiawen Yan, Yonghong Oakland Univ Dept Comp Sci & Engn Rochester MI 48309 USA

ISBN: (纸本)9780769561493

In this paper, we provide comparison of language features and runtime systems of commonly used threading parallel programming models for high performance computing, including OpenMP, Intel Cilk Plus, Intel TBB, OpenACC, Nvidia CUDA, OpenCL, C++11 and PThreads. We then report our performance comparison of OpenMP, Cilk Plus and C++11 for data and task parallelism on CPU using benchmarks. The results show that the performance varies with respect to factors such as runtime scheduling strategies, overhead of enabling parallelism and synchronization, load balancing and uniformity of task workload among threads in applications. Our study summarizes and categorizes the latest development of threading programming APIs for supporting existing and emerging computer architectures, and provides tables that compare all features of different APIs. It could be used as a guide for users to choose the APIs for their applications according to their features, interface and performance reported.

关键词： threading parallel programming data parallelism task parallelism memory abstraction synchronization mutual exclusion

来源：评论

学校读者我要写书评

暂无评论

Selection of parallel Runtime Systems for Tasking Models

Selection of Parallel Runtime Systems for Tasking Models

引用

International Conference on Computational Science and Computational Intelligence (CSCI)

作者： Wang, Chun-Kun Univ N Carolina Dept Comp Sci Chapel Hill NC 27599 USA

ISBN: (纸本)9781538626528

The ubiquity of multi- and many-core processors means that many general purpose programmers are beginning to face the difficult task of using runtime systems designed for large-scale parallelism. Not only do they have to deal with finding and exploiting irregular parallelism through Tasking, but they have to deal with runtime systems that require an expert tuning of task granularity and scheduling for performance. This paper provides hands-on experiences to help programmers to select an appropriate tasking model and design programs. It investigates the scheduling strategies of three different runtime tasking models: Cilk, OpenMP and High Performance ParalleX (HPX-5). Six different simple benchmarks are used to expose how well each runtime performs when provided untuned implementations of irregular code fragments. The benchmarks, which have irregular and dynamic structures, provide information about the pros and cons of each system's runtime model, particularly the differences to the programmer between help-first and work-first scheduling.

关键词： Tasking Model parallel programming Cilk OpenMP HPX-5

来源：评论

学校读者我要写书评

暂无评论

Assessing NUMA Performance Based on Hardware Event Counters 31

Assessing NUMA Performance Based on Hardware Event Counters

引用

31st IEEE International parallel and Distributed Processing Symposium Workshops (IPDPS)

作者： Plauth, Max Sterz, Christoph Eberhardt, Felix Feinbube, Frank Polze, Andreas Univ Potsdam Hasso Plattner Inst Software Syst Engn Operating Syst & Middleware Grp Potsdam Germany

ISBN: (纸本)9780769561493

Cost models play an important role for the efficient implementation of software systems. These models can be embedded in operating systems and execution environments to optimize execution at run time. Even though non-uniform memory access (NUMA) architectures are dominating today's server landscape, there is still a lack of parallel cost models that represent NUMA system sufficiently. Therefore, the existing NUMA models are analyzed, and a two-step performance assessment strategy is proposed that incorporates low-level hardware counters as performance indicators. To support the two-step strategy, multiple tools are developed, all accumulating and enriching specific hardware event counter information, to explore, measure, and visualize these low-overhead performance indicators. The tools are showcased and discussed alongside specific experiments in the realm of performance assessment.

关键词： parallel programming Performance analysis Memory management

来源：评论

学校读者我要写书评

暂无评论

A Framework for Generating and Evaluating parallelized Code

A Framework for Generating and Evaluating Parallelized Code

引用

Federated Conference on Computer Science and Information Systems (FedCSIS)

作者： Bylina, Jaroslaw Marie Curie Sklodowska Univ Lublin Poland

ISBN: (纸本)9788394625375

The work describes a flexible framework built to generate various (parallel) software versions and to benchmark them. The framework is written with the use of the Python language with some support of the gnuplot plotting program. An example of the use of this tool shows the tuning of a matrix factorization on different architectures (Intel Haswell and Intel Knights Corner) with various parameters of parallelization, vectorization, blocking etc.

关键词： Python use tuning Pythonidae tuning Python matrix decomposition elastic frame Frameworks PLOTTING PROGRAMS parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallelized Advanced Rabin-Karp Algorithm for String Matching

Parallelized Advanced Rabin-Karp Algorithm for String Matchi...

引用

3rd International Conference on Computing, Communication, Control and Automation (ICCUBEA)

作者： Joshi, Omkar Sunil Upadhyay, Bhargavi R. Supriya, M. Amrita Univ Amrita Vishwa Vidyapeetham Amrita Sch Engn Dept Comp Sci & Engn Bengaluru India

ISBN: (纸本)9781538640081

String matching refers to the search of each and every occurrence of a string in another string. Nowadays, this issue presents itself in various segments in a great deal, starting from standard programs for text editing and processing, through databases and all the way to their various applications in other sciences. There are numerous different efficient algorithms to solve this problem. One of the efficient algorithms is Rabin-Karp algorithm which has complexity of O(m(n-m+1)) whereas the complexity of proposed advanced Rabin-Karp algorithm is O(n-m). However, the main focus of this research is to apply the concepts of parallelism to improve the performance of the algorithm. There are lots of parallel processing Application programming Interfaces (APIs) available, like OpenMP, MPI, CUDA MapReduce, etc. out of these we have chosen OpenMP and CUDA to achieve parallelism. Comparison of the results of both serial and parallel implementations will give us insights into how performance and efficiency is achieved through various techniques of parallelism.

关键词： CUDA OpenMP parallel programming String matching

来源：评论

学校读者我要写书评

暂无评论

The FMCAD 2017 Graduate Student Forum 17

The FMCAD 2017 Graduate Student Forum

引用

17th International Conference on Formal Methods in Computer-Aided Design (FMCAD)

作者： Heljanko, Keijo Aalto Univ Espoo Finland

ISBN: (纸本)9780983567875

The FMCAD Student Forum provides a platform for graduate students at any career stage to introduce their research to the wider Formal Methods community, and solicit feedback. In 2017, the event took place in Vienna, Austria, as integral part of the FMCAD conference. Thirteen students were invited to give a short talk and present a poster illustrating their work. The presentations covered a broad range of topics in the field of verification, such as automated reasoning, model checking of hardware, software, as well as parameterized systems, verification of concurrent programs, and checking of floating point properties.

关键词： Software Design automation Cognition Model checking Engineering profession Hardware Concurrent computing Model checking Computer-Aided Design parallel programming Computer hardware computer software Cognition Engineering profession Students Intelligence computer programs

来源：评论

学校读者我要写书评

暂无评论

Replicated Synchronization for Imperative BSP Programs

Replicated Synchronization for Imperative BSP Programs

引用

International Conference on Computational Science (ICCS)

作者： Jakobsson, Arvid Dabrowski, Frederic Bousdira, Wadoud Loulergue, Frederic Hains, Gaetan Huawei Technol France Res Ctr Paris France Univ Orleans INSA Ctr Val Loire LIFO EA 4022 Orleans France No Arizona Univ Sch Informat Comp & Cyber Syst Flagstaff AZ USA

The BSP model (Bulk Synchronous parallel) simplifies the construction and evaluation of parallel algorithms, with its simplified synchronization structure and cost model. Nevertheless, imperative BSP programs can suffer from synchronization errors. Programs with textually aligned barriers are free from such errors, and this structure eases program comprehension. We propose a simplified formalization of barrier inference as data flow analysis, which verifies statically whether an imperative BSP program has replicated synchronization, which is a sufficient condition for textual barrier alignment. (C) 2017 The Authors. Published by Elsevier B. V.

关键词： parallel programming bulk synchronous parallelism static analysis barrier inference

来源：评论

学校读者我要写书评

暂无评论

Configuring Concurrent Computation of Phylogenetic Partial Likelihoods: Accelerating Analyses Using the BEAGLE Library 17th

Configuring Concurrent Computation of Phylogenetic Partial L...

引用

17th International Conference on Algorithms and Architectures for parallel Processing (ICA3PP)

作者： Ayres, Daniel L. Cummings, Michael P. Univ Maryland Ctr Bioinformat & Computat Biol College Pk MD 20742 USA

ISBN: (纸本)9783319654829;9783319654812

We describe our approach in augmenting the BEAGLE library for high-performance statistical phylogenetic inference to support concurrent computation of independent partial likelihoods arrays. Our solution involves identifying independent likelihood estimates in analyses of partitioned datasets and in proposed tree topologies, and configuring concurrent computation of these likelihoods via CUDA and opencL frameworks. We evaluate the effect of each increase in concurrency on throughput performance for our partial likelihoods kernel for a four-state nucleotide substitution model on a variety of parallel computing hardware, such as NVIDIA and AMD GPU5, and Intel multicore cPus, observing up to 16-fold speedups over our previous implementation. Finally, we evaluate the effect of these gains on an domain application program, MrBayes. For a partitioned nucleotide-model analysis we observe an average speedup for the overall run time of 2.1-fold over our previous parallel implementation, and 10-fold over the native MrBayes with SSE.

关键词： Bayes methods Biology computing Evolution (biology) Phylogeny Maximum likelihood estimation Multicore processing parallel programming High performance computing

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 127 128 129 130 131 132 133 134 135 136 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：