检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

5,038 篇 会议
1,444 篇 期刊文献
129 册 图书
75 篇 学位论文

馆藏范围

6,686 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

3,970 篇 工学
- 3,387 篇 计算机科学与技术...
- 2,002 篇 软件工程
- 990 篇 电气工程
- 237 篇 信息与通信工程
- 178 篇 电子科学与技术（可...
- 137 篇 控制科学与工程
- 66 篇 机械工程
- 52 篇 生物医学工程（可授...
- 52 篇 生物工程
- 44 篇 仪器科学与技术
- 32 篇 材料科学与工程（可...
- 30 篇 力学（可授工学、理...
- 28 篇 动力工程及工程热...
- 28 篇 土木工程
- 21 篇 光学工程
- 21 篇 石油与天然气工程
677 篇 理学
- 396 篇 数学
- 118 篇 物理学
- 87 篇 生物学
- 78 篇 系统科学
- 33 篇 化学
- 28 篇 统计学（可授理学、...
- 25 篇 地球物理学
355 篇 管理学
- 263 篇 管理科学与工程(可...
- 98 篇 图书情报与档案管...
- 62 篇 工商管理
68 篇 教育学
- 62 篇 教育学
59 篇 医学
- 44 篇 临床医学
- 22 篇 基础医学(可授医学...
30 篇 法学
- 27 篇 社会学
17 篇 农学
15 篇 经济学
12 篇 文学
6 篇 艺术学
4 篇 军事学

主题

6,686 篇 parallel program...
1,067 篇 concurrent compu...
1,005 篇 parallel process...
572 篇 programming prof...
482 篇 application soft...
466 篇 computer science
466 篇 computer archite...
401 篇 hardware
340 篇 message passing
334 篇 distributed comp...
320 篇 libraries
315 篇 computational mo...
248 篇 computer languag...
231 篇 high performance...
230 篇 program processo...
229 篇 runtime
198 篇 parallel archite...
196 篇 parallel algorit...
193 篇 yarn
179 篇 costs

机构

14 篇 carnegie mellon ...
13 篇 barcelona superc...
11 篇 brno university ...
11 篇 univ illinois de...
11 篇 school of comput...
11 篇 intel corporatio...
10 篇 univ pisa dept c...
10 篇 stanford univ st...
9 篇 school of applie...
9 篇 department of co...
9 篇 carnegie mellon ...
9 篇 mathematics and ...
9 篇 department of co...
9 篇 rice univ housto...
8 篇 department of co...
8 篇 ibm thomas j. wa...
8 篇 univ alberta dep...
8 篇 department of co...
8 篇 irisa rennes
8 篇 tech univ berlin

作者

31 篇 griebler dalvan
25 篇 sarkar vivek
21 篇 danelutto marco
20 篇 fernandes luiz g...
19 篇 loulergue freder...
17 篇 badia rosa m.
16 篇 torquati massimo
15 篇 mencagli gabriel...
15 篇 olukotun kunle
14 篇 wolf felix
12 篇 g. runger
12 篇 gonzalez-escriba...
12 篇 ayguade eduard
12 篇 m. sato
11 篇 hoefler torsten
11 篇 dinavahi venkata
11 篇 benini luca
11 篇 valero mateo
11 篇 sato mitsuhisa
11 篇 t. rauber

语言

6,494 篇 英文
139 篇 其他
21 篇 中文
17 篇 俄文
7 篇 土耳其文
2 篇 德文
2 篇 朝鲜文
1 篇 西班牙文
1 篇 日文
1 篇 葡萄牙文

检索条件"主题词=Parallel Programming"

共 6686 条记录，以下是791-800 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Chapel on Accelerators 34

Chapel on Accelerators

引用

34th IEEE International parallel and Distributed Processing Symposium (IPDPS)

作者： Ghangas, Rahul Milthorpe, Josh Australian Natl Univ Res Sch Comp Sci Canberra ACT Australia

ISBN: (纸本)9781728174457

Chapel's high level data-parallel constructs make parallel programming productive for general programmers. This talk introduces the 'Chapel on Accelerators' project, which proposes compiler enhancements to extend data-parallel constructs to hardware accelerators including GPUs. Previous attempts to extend Chapel to GPUs [1]-[3] have not been successfully integrated, and any such extension needs to maintain portability and consistency with the Chapel design philosophy and implementation. © 2020 IEEE.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Peachy parallel Assignments (EduHPC 2020)

Peachy Parallel Assignments (EduHPC 2020)

引用

Workshop on Education for High Performance Computing (EduHPC)

作者： Casanova, Henri da Silva, Rafael Ferreira Gonzalez-Escribano, Arturo Koch, William Torres, Yuri Bunde, David P. Univ Hawaii Honolulu HI 96822 USA Univ Southern Calif Marina Del Rey CA USA Univ Valladolid Valladolid Spain Knox Coll Galesburg IL USA

ISBN: (纸本)9780738143057

Peachy parallel Assignments are high-quality assignments for teaching parallel and distributed computing. They are selected competitively for presentation at the Edu* workshops. All of the assignments have been successfully used in class and they are selected based on the their ease of adoption by other instructors and for being cool and inspirational to students. This paper presents a paper-and-pencil assignment asking students to analyze the performance of different system configurations and an assignment in which students parallelize a simulation of the evolution of simple living organisms.

关键词： Peachy parallel Assignments parallel computing education High-Performance Computing education parallel programming Curriculum Development Performance analysis parallel simulation OpenMP MPI GPGPU

来源：评论

学校读者我要写书评

暂无评论

A parallel Implementation for Cellular Potts Model with Software Transactional Memory 13th

A Parallel Implementation for Cellular Potts Model with Soft...

引用

13th International Conference on Practical Applications of Computational Biology and Bioinformatics (PACBB)

作者： Tomeu, A. J. Gamez, A. Salguero, A. G. Univ Cadiz Puerto Real 11519 CA Spain IES Mar Mediterraneo Aguadulce 04720 AL Spain

ISBN: (纸本)9783030238735;9783030238728

Cellular Potts Model is a mathematical model used to simulate biological systems in a wide scale range, from cells to organs. The model uses a Monte-Carlo approach to determinate for each cell, new state and actions like mitosis, movements or emission of pseudopods. Literature shows multiple implementations of CPM model, even incorporating parallel processing. These works use a data division approach that requires to take locks on data structures, or to spread information between tasks, slowing down simulations. This work proposes a fast implementation for CPM using software transactional memory to synchronize parallel tasks and to apply it to breast cancer in situ (DCIS). Execution times and speedups are calculated. Results show appreciable speedups.

关键词： Cellular automaton Cellular Potts Model (Breast cancer in situ) DCIS Gland Locks Multicore parallel programming Shared memory Software transactional memory speedup

来源：评论

学校读者我要写书评

暂无评论

Enabling System Wide Shared Memory for Performance Improvement in PyCOMPSs Applications 9

Enabling System Wide Shared Memory for Performance Improveme...

引用

9th Workshop on Python for High-Performance and Scientific Computing (PYHPC)

作者： Foyer, Clement Conejero, Javier Ejarque, Jorge Badia, Rosa M. Tate, Adrian McIntosh-Smith, Simon HPE HPC AI EMEA Res Lab Bristol Avon England Barcelona Supercomp Ctr Barcelona Spain Numer Algorithms Grp Ltd NAG Oxford England Univ Bristol Dept Comp Sci High Performance Comp Res Grp Bristol Avon England

ISBN: (纸本)9780738110868

Python has been gaining some traction for years in the world of scientific applications. However, the high-level abstraction it provides may not allow the developer to use the machines to their peak performance. To address this, multiple strategies, sometimes complementary, have been developed to enrich the software ecosystem either by relying on additional libraries dedicated to efficient computation (e.g., NumPy) or by providing a framework to better use HPC scale infrastructures (e.g., PyCOMPSs). In this paper, we present a Python extension based on SharedArray that enables the support of system-provided shared memory and its integration into the PyCOMPSs programming model as an example of integration to a complex Python environment. We also evaluate the impact such a tool may have on performance in two types of distributed execution-flows, one for linear algebra with a blocked matrix multiplication application and the other in the context of data-clustering with a k-means application. We show that with very little modification of the original decorator (3 lines of code to be modified) of the task-based application the gain in performance can rise above 40% for tasks relying heavily on data reuse on a distributed environment, especially when loading the data is prominent in the execution time.

关键词： Memory Shared Memory Task Python parallel programming Distributed Memory NumPy Data Management

来源：评论

学校读者我要写书评

暂无评论

Random Forests in Chapel 34

Random Forests in Chapel

引用

34th IEEE International parallel and Distributed Processing Symposium (IPDPS)

作者： Albrecht, Benjamin Hewlett Packard Enterprise Houston TX 77070 USA

ISBN: (纸本)9781728174457

This talk will present the ongoing work of developing a Chapel implementation of Random Forest, a popular ensembling learning method utilized both for predictive modeling and feature selection. Language features in Chapel make it possible to easily express shared-memory and distributed-memory implementations of this algorithm. Furthermore, Chapel's built-in python interoperability functionality made it easier to implement a python front-end, making it accessible to a language popular among data scientists.

关键词： parallel programming machine learning

来源：评论

学校读者我要写书评

暂无评论

Minimizing Self-adaptation Overhead in parallel Stream Processing for Multi-cores 25th

Minimizing Self-adaptation Overhead in Parallel Stream Proce...

引用

25th International Conference on parallel and Distributed Computing (Euro-Par)

作者： Vogel, Adriano Griebler, Dalvan Danelutto, Marco Fernandes, Luiz Gustavo Pontificia Univ Catolica Rio Grande do Sul Sch Technol Porto Alegre RS Brazil Univ Pisa Dept Comp Sci Pisa Italy Tres de Maio Fac SETREM Lab Adv Res Cloud Comp LARCC Tres De Maio Brazil

ISBN: (纸本)9783030483401;9783030483395

Stream processing paradigm is present in several applications that apply computations over continuous data flowing in the form of streams (e.g., video feeds, image, and data analytics). Employing self-adaptivity to stream processing applications can provide higher-level programming abstractions and autonomic resource management. However, there are cases where the performance is suboptimal. In this paper, the goal is to optimize parallelism adaptations in terms of stability and accuracy, which can improve the performance of parallel stream processing applications. Therefore, we present a new optimized self-adaptive strategy that is experimentally evaluated. The proposed solution provided high-level programming abstractions, reduced the adaptation overhead, and achieved a competitive performance with the best static executions.

关键词： parallel programming Stream parallelism Self-adaptive parallelism Autonomic computing

来源：评论

学校读者我要写书评

暂无评论

Enhancing Java Streams API with PowerList Computation 34

Enhancing Java Streams API with PowerList Computation

引用

34th IEEE International parallel and Distributed Processing Symposium (IPDPS)

作者： Niculescu, Virginia Bufnea, Darius Sterca, Adrian Babes Bolyai Univ Fac Math & Comp Sci Cluj Napoca Romania

ISBN: (纸本)9781728174457

Since they were introduced, Java streams were very fast embraced by the industry, being currently used at a large scale. The parallelism enabled by them is very easy to achieve, but it is constrained either by the used parallelism model (in some cases), or by the set of operations that could be specified using streams. We investigate in this paper the possibility to enhance the computation types that could be defined using the Java streams API by introducing into this infrastructure the PowerList theory based computation. Powerlists are recursive data structures that together with their associated algebraic theory offer both abstractions in order to ease the development of parallel applications, and also a methodology to design parallel algorithms. The Java streaming infrastructure could be adapted to support them in a great measure. We present here such an adaptation, and we analyse and discuss the advantages and constraints. This analysis is exemplified by application examples.

关键词： parallel programming streams recursive structures Java performance models

来源：评论

学校读者我要写书评

暂无评论

Exploring Chapel Productivity Using Some Graph Algorithms 34

Exploring Chapel Productivity Using Some Graph Algorithms

引用

34th IEEE International parallel and Distributed Processing Symposium (IPDPS)

作者： Barrett, Richard F. Cook, Jeanine Olivier, Stephen L. Aaziz, Omar Jenkins, Christipher D. Vaughan, Courtenay T. Sandia Natl Labs Cyber Intelligence Res POB 5800 Albuquerque NM 87185 USA Sandia Natl Labs Ctr Comp Res POB 5800 Albuquerque NM 87185 USA Sandia Natl Labs Cyber Secur & Mission Comp Ctr POB 5800 Albuquerque NM 87185 USA Sandia Natl Labs Syst Secur Res POB 5800 Albuquerque NM 87185 USA

ISBN: (纸本)9781728174457

A broad set of data science and engineering questions may be organized as graphs, providing a powerful means for describing relational data. Although experts now routinely compute graph algorithms on huge, unstructured graphs using high performance computing (HPC) or cloud resources, this practice hasn't yet broken into the mainstream. Such computations require great expertise, yet users often need rapid prototyping and development to quickly customize existing code. Toward that end, we are exploring the use of the Chapel programming language as a means of making some important graph analytics more accessible, examining the breadth of characteristics that would make for a productive programming environment, one that is expressive, performant, portable, and robust. In this talk we describe our early explorations of this space, based on miniTri [4], a miniapp from the Mantevo suite [1], and the mean hitting time algorithm [2], one of the analytics being explored within Grafiki1 [3], both of which are designed for use on distributed memory parallel processing environments. These implementations have been posed in terms of key linear algebra operations and algorithms, specifically sparse matrix-matrix multiplication, operating on integer datatypes, and the Conjugate Gradient method, based on a graph Laplacian matrix.

关键词： High performance computing parallel programming graph analytics

来源：评论

学校读者我要写书评

暂无评论

Towards an Auto-Tuned and Task-Based SpMV (LASs Library) 16th

Towards an Auto-Tuned and Task-Based SpMV (LASs Library)

引用

16th International Workshop on OpenMP (IWOMP)

作者： Catalan, Sandra Usui, Tetsuzo Toledo, Leonel Martorell, Xavier Labarta, Jesus Valero-Lara, Pedro Univ Complutense Madrid Madrid Spain Fujitsu Ltd Next Generat Tech Comp Unit Kawasaki Kanagawa Japan Barcelona Supercomp Ctr Barcelona Spain Univ Politecn Cataluna Barcelona Spain

ISBN: (纸本)9783030581442;9783030581435

We present a novel approach to parallelize the SpMV kernel included in LASs (Linear Algebra routines on OmpSs) library, after a deep review and analysis of several well-known approaches. LASs is based on OmpSs, a task-based runtime that extends OpenMP directives, providing more flexibility to apply new strategies. Based on tasking and nesting, with the aim of improving the workload imbalance inherent to the SpMV operation, we present a strategy especially useful for highly imbalanced input matrices. In this approach, the number of created tasks is dynamically decided in order to maximize the use of the resources of the platform. Throughout this paper, SpMV behavior depending on the selected strategy (state of the art and proposed strategies) is deeply analyzed, setting in this way the base for a future auto-tunable code that is able to select the most suitable approach depending on the input matrix. The experiments of this work were carried out for a set of 12 matrices from the Suite Sparse Matrix Collection, all of them with different characteristics regarding their sparsity. The experiments of this work were performed on a node of Marenostrum 4 supercomputer (with two sockets Intel Xeon, 24 cores each) and on a node of Dibona cluster (using one ARM ThunderX2 socket with 32 cores). Our tests show that, for Intel Xeon, the best parallelization strategy reduces the execution time of the reference MKL multi-threaded version up to 67%. On ARM ThunderX2, the reduction is up to 56% with respect to the OmpSs parallel reference.

关键词： SpMV parallel programming Tasking Auto-tuning Taskloop Nesting LASs OmpSs

来源：评论

学校读者我要写书评

暂无评论

GVPRoF: A Value Profiler for GPU-Based Clusters

GVPRoF: A Value Profiler for GPU-Based Clusters

引用

International Conference on High Performance Computing, Networking, Storage and Analysis (SC)

作者： Zhou, Keren Hao, Yueming Mellor-Crummey, John Meng, Xiaozhu Liu, Xu Rice Univ Dept Comp Sci Houston TX 77005 USA North Carolina State Univ Dept Comp Sci Raleigh NC USA

ISBN: (纸本)9781728199986

GPGPUs arc widely used in high-performance computing systems to accelerate scientific and machine learning workloads. Developing efficient GPU kernels is critically important to obtain "bare-metal" performance on GPU-based dusters. In this paper, we describe the design and implementation of GVPROF, the first value profiler that pinpoints value-related inefficiencies in applications running on NVIDIA GPU-based clusters. The novelly of GVPROF resides in its ability to detect temporal and spatial value redundancies, which provides useful information to guide code optimization. GVPROF can monitor production multi-node multi-GPU executions in clusters. Our experiments with well-known GPU benchmarks and HPC applications show that GVPROF incurs acceptable overhead and scales to large executions. Using GVPROF, we optimized several IIPC and machine learning workloads on one NVIDIA V100 GPU. In one case study of LAMMPS, optimizations based on information from GVProf led to whole-program speedups ranging from I.37x on a single CPU to 1.08x on 64 GPUs.

关键词： High performance computing Performance analysis parallel programming Supercomputers

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 76 77 78 79 80 81 82 83 84 85 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：