检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

215 篇 会议
5 篇 期刊文献
2 册 图书

馆藏范围

222 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

153 篇 工学
- 141 篇 计算机科学与技术...
- 113 篇 软件工程
- 9 篇 电气工程
- 8 篇 电子科学与技术（可...
- 7 篇 生物工程
- 5 篇 控制科学与工程
- 4 篇 信息与通信工程
- 4 篇 生物医学工程（可授...
- 3 篇 机械工程
- 2 篇 动力工程及工程热...
- 2 篇 建筑学
- 1 篇 力学（可授工学、理...
- 1 篇 光学工程
- 1 篇 仪器科学与技术
- 1 篇 土木工程
- 1 篇 化学工程与技术
- 1 篇 安全科学与工程
34 篇 理学
- 21 篇 数学
- 7 篇 生物学
- 6 篇 系统科学
- 4 篇 物理学
- 2 篇 化学
- 1 篇 大气科学
- 1 篇 统计学（可授理学、...
18 篇 管理学
- 11 篇 管理科学与工程(可...
- 7 篇 图书情报与档案管...
- 1 篇 工商管理
5 篇 教育学
- 5 篇 教育学

主题

44 篇 parallel program...
9 篇 parallel process...
8 篇 programming prof...
8 篇 message passing
6 篇 object oriented ...
6 篇 application prog...
6 篇 hardware
5 篇 computer science
5 篇 parallelism
5 篇 pattern recognit...
4 篇 parallel archite...
4 篇 programming
4 篇 graphics process...
4 篇 parallel
4 篇 multicore
4 篇 design patterns
4 篇 dynamic programm...
3 篇 genetic programm...
3 篇 parallel design ...
3 篇 concurrent compu...

机构

3 篇 calvin college g...
2 篇 macalester colle...
2 篇 university of il...
2 篇 vsb tech univ os...
2 篇 nus graduate sch...
2 篇 los alamos natio...
2 篇 univ marburg fac...
2 篇 st. olaf college...
1 篇 univ of naples
1 篇 university of he...
1 篇 computer science...
1 篇 mathematik forsc...
1 篇 university at bu...
1 篇 nanyang technol ...
1 篇 parlab eecs depa...
1 篇 sandia natl labs...
1 篇 stmicroelectroni...
1 篇 department of co...
1 篇 clermont univers...
1 篇 univ politecn ca...

作者

5 篇 wolf felix
3 篇 joel c. adams
3 篇 sirianni marco
3 篇 elizabeth shoop
3 篇 ricca francesco
3 篇 perri simona
2 篇 kale vivek
2 篇 kessler christop...
2 篇 alves tiago a. o...
2 篇 scholz sven-bodo
2 篇 tichy walter f.
2 篇 pulka andrzej
2 篇 hamidouche khale...
2 篇 sanders beverly ...
2 篇 johnson ralph
2 篇 calotoiu alexand...
2 篇 richard a. brown
2 篇 pankratius victo...
2 篇 loogen rita
2 篇 b. di martino

语言

221 篇 英文
1 篇 俄文

检索条件"任意字段=Proceedings of the 2010 Workshop on Parallel Programming Patterns"

共 222 条记录，以下是1-10 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Typed Design patterns for the Functional Era 1

Typed Design Patterns for the Functional Era

引用

1st ACM SIGPLAN International workshop on Functional Software Architecture (FUNARCH) - Functional programming in the Large

作者： Crichton, Will Brown Univ Providence RI 02912 USA

ISBN: (纸本)9798400702976

This paper explores how design patterns could be revisited in the era of mainstream functional programming languages. I discuss the kinds of knowledge that ought to be represented as functional design patterns: architectural concepts that are relatively self-contained, but whose entirety cannot be represented as a language-level abstraction. I present four concrete examples embodying this idea: the Witness, the State Machine, the parallel Lists, and the Registry. Each pattern is implemented in Rust to demonstrate how careful use of a sophisticated type system can better model each domain construct and thereby catch user mistakes at compile-time.

关键词： design patterns domain-driven design rust

来源：评论

学校读者我要写书评

暂无评论

Towards a SYCL API for Approximate Computing 23

Towards a SYCL API for Approximate Computing

引用

International Conference on OpenCL (IWOCL)

作者： Carpentieri, Lorenzo Cosenza, Biagio Univ Salerno Salerno Italy

ISBN: (纸本)9798400707452

Approximate computing is a well-known method [7] to achieve higher performance or lower energy consumption while accepting a loss of output accuracy. Many applications such as image processing and neural networks, are tolerant of a certain amount of error, and have the potential for significant improvements in terms of execution time and energy consumption. The most advanced software approximation techniques are mixed precision, which uses a lower precision data representation for both integer and floating point variables [1, 4]; perforation, which skips instruction blocks in a program, iterations in a loop, or data in buffers assuming that nearby data have similar values [2, 5, 6, 8]; and relaxed synchronization which removes synchronization points that represent one of the major bottleneck in parallel applications [3, 9]. These approximate approaches differ in performance achieved and also in error produced. Usually, perforation and synchronization elision have higher performance compared with mixed precision but produce more errors. In particular, synchronization elision introduces non-deterministic errors that are complex to handle. Support for approximate computing is provided by the SYCL heterogeneous programming model often used for developing portable HPC applications. SYCL supports approximate computing by providing a set of built-in functions and data types that can be used to perform approximate operations, such as half-floating-point reductions and bit-level operations. In this technical talk, we present SYprox, a SYCL-based API supporting a broad set of approximation techniques in modern C++. SYprox introduces a set of semantics that extend SYCL’s buffers and accessors to provide a high-level easy-to-use programming API. It supports data perforation and elision patterns for efficient approximation, as well as signal reconstruction algorithms for error mitigation. Figure 1 (a) depicts the accurate execution of an application while Figure 1 (b) shows the

关键词： Approximate Computing programming Models Data Perforation Data Reconstruction Synchronization Elision

来源：评论

学校读者我要写书评

暂无评论

Ghostwriter: A Cache Coherence Protocol for Error-Tolerant Applications 21

Ghostwriter: A Cache Coherence Protocol for Error-Tolerant A...

引用

50th International Conference on parallel Processing (ICPP)

作者： Kao, Henry San Miguel, Joshua Jerger, Natalie Enright Univ Toronto Toronto ON Canada Univ Wisconsin Madison WI USA Huawei Technol Canada Markham ON Canada

ISBN: (纸本)9781450384414

Coherence induced cache misses are an important aspect limiting the scalability of shared memory parallel programs. Many coherence misses are avoidable, namely misses due to false sharing when different threads write to different memory addresses that are contained within the same cache block causing unnecessary invalidations. Past work has proposed numerous ways to mitigate false sharing from coherence protocols optimized for certain sharing patterns, to software tools for false-sharing detection and repair. Our work leverages approximate computing and store value similarity in error-tolerant multi-threaded applications. We introduce a novel cache coherence protocol which implements an approximate store instruction and coherence states to allow some limited incoherence within approximatable shared data to mitigate both coherence misses and coherence traffic for various sharing patterns. For applications from the Phoenix and AxBench suites, we see dynamic energy improvements within the NoC and memory hierarchy of up to 50.1% and speedup of up to 37.3% with low output error for approximate applications that exhibit false sharing.

关键词： Cache Coherence Approximate Computing parallel programming

来源：评论

学校读者我要写书评

暂无评论

Evolutionary Acquisition of Multiple TTSP Graph patterns with Wildcards by Clustering TTSP Graphs 12

Evolutionary Acquisition of Multiple TTSP Graph Patterns wit...

引用

12th IEEE International workshop on Computational Intelligence and Applications, IWCIA 2021

作者： Kawasaki, Yuma Miyahara, Tetsuhiro Kuboyama, Tetsuji Suzuki, Yusuke Uchida, Tomoyuki Hiroshima City University Graduate School of Information Sciences Hiroshima731-3194 Japan Gakushuin University Computer Centre Tokyo171-8588 Japan

ISBN: (纸本)9781665444255

Knowledge acquisition from graph structured data is an important task in machine learning and data mining. TTSP (Two-Terminal Series parallel) graphs are used as data models for electric networks and scheduling. We propose a multiple TTSP graph pattern, which is a finite set of TTSP graph patterns, with wildcards and give a clustering procedure of TTSP graphs. Then we propose an evolutionary learning method for obtaining characteristic multiple TTSP graph patterns with wildcards, from positive and negative TTSP graph data by clustering positive TTSP graphs. Experimental results show that our proposed evolutionary learning method obtains characteristic multiple TTSP graph patterns with wildcards. © 2021 IEEE.

关键词： Genetic programming

来源：评论

学校读者我要写书评

暂无评论

Towards Generic parallel programming in Computer Science Education with Kokkos

Towards Generic Parallel Programming in Computer Science Edu...

引用

workshop on Education for High Performance Computing (EduHPC)

作者： Ciesko, Jan Poliakoff, David Hollman, Daisy S. Trott, Christian C. Lebrun-Grandie, Damien Sandia Natl Labs Comp Sci Res Inst POB 5800 Albuquerque NM 87185 USA Sandia Natl Labs Comp Sci Res Inst Livermore CA USA Oak Ridge Natl Lab Computat Sci & Engn Oak Ridge TN USA

ISBN: (纸本)9780738143057

parallel patterns, views, and spaces are promising abstractions to capture the programmer's intent as well as the contextual information that can be used by an underlying runtime to efficiently map software to parallel hardware. These abstractions can be valuable in cases where an algorithm must accommodate requirements of code and performance portability across hardware architectures and vendor programming models. Kokkos is a parallel programming model for host- and accelerator architectures that relies on these abstractions and targets these requirements. It consists of a pure C++ interface, a specification, and a programming library. The programming library exposes patterns and types and maps them to an underlying abstract machine model. The abstract machine model offers a generic view of parallel hardware. While Kokkos is gaining popularity in large-scale HPC applications at some DOE laboratories, we believe that the implemented concepts are of interest to a broader audience including academia as they may contribute to a generic, vendor, and architecture-independent education of parallel programming. In this work, we give an insight into the design considerations of this programming model and list important abstractions. Further, we document best practices obtained from giving virtual classes on Kokkos and give pointers to resources that the reader may consider valuable for a lecture on generic parallel programming for students with preexisting knowledge on this matter.

关键词： parallel programming Kokkos C plus

来源：评论

学校读者我要写书评

暂无评论

High-Throughput Stream Processing with Actors 10

High-Throughput Stream Processing with Actors

引用

10th International workshop on programming Based on Actors, Agents, and Decentralized Control (AGERE)

作者： Rinaldi, Luca Torquati, Massimo Mencagli, Gabriele Danelutto, Marco Univ Pisa Comp Sci Dept Pisa Italy

ISBN: (纸本)9781450381857

The steady growth of data volume produced as continuous streams makes paramount the development of software capable of providing timely results to the users. The Actor Model (AM) offers a high-level of abstraction suited for developing scalable message-passing applications. It allows the application developer to focus on the application logic moving the burden of implementing fast and reliable inter-Actors message-exchange to the implementation framework. In this paper, we focus on evaluating the model in high data rate streaming applications targeting scale-up servers. Our approach leverages parallel Pattern (PP) abstractions to model streaming computations and introduces optimizations that otherwise could be challenging to implement without violating the Actor Model's semantics. The experimental analysis demonstrates that the new implementation skeletons we propose for our PPs can bring significant performance boosts (more than 2x) in high data rate streaming applications implemented in CAF.

关键词： Actor Model parallel patterns Data Stream Processing programming Model Multi-Cores

来源：评论

学校读者我要写书评

暂无评论

Usability and Performance Improvements in Hatchet 7

Usability and Performance Improvements in Hatchet

引用

IEEE/ACM International workshop on HPC User Support Tools (HUST) / workshop on programming and Performance Visualization Tools (ProTools)

作者： Brink, Stephanie Lumsden, Ian Scully-Allison, Connor Williams, Katy Pearce, Olga Gamblin, Todd Taufer, Michela Isaacs, Katherine E. Bhatele, Abhinav Lawrence Livermore Natl Lab Livermore CA 94550 USA Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN USA Univ Arizona Dept Comp Sci Tucson AZ 85721 USA Univ Maryland Dept Comp Sci College Pk MD 20742 USA

ISBN: (纸本)9780738110707

Performance analysis is critical for pinpointing bottlenecks in parallel applications. Several profilers exist to instrument parallel programs on HPC systems and gather performance data. Hatchet is an open-source Python library that can read profiling output of several tools, and enables the user to perform a variety of programmatic analyses on hierarchical performance profiles. In this paper, we augment Hatchet to support new features: a query language for representing call path patterns that can be used to filter a calling context tree, visualization support for displaying and interacting with performance profiles, and new operations for performing analyses on multiple datasets. Additionally, we present performance optimizations in Hatchet's HPCToolkit reader and the unify operation to enable scalable analysis of large datasets.

关键词： performance analysis tools parallel profiles calling context tree call graph graph analytics

来源：评论

学校读者我要写书评

暂无评论

Empirical Modeling of Spatially Diverging Performance 7

Empirical Modeling of Spatially Diverging Performance

引用

IEEE/ACM International workshop on HPC User Support Tools (HUST) / workshop on programming and Performance Visualization Tools (ProTools)

作者： Calotoiu, Alexandru Geisenhofer, Markus Kummer, Florian Ritter, Marcus Weber, Jens Hoefler, Torsten Oberlack, Martin Wolf, Felix Swiss Fed Inst Technol Dept Comp Sci Zurich Switzerland Tech Univ Darmstadt Dept Mech Engn Darmstadt Germany

ISBN: (纸本)9780738110707

A common simplification made when modeling the performance of a parallel program is the assumption that the performance behavior of all processes or threads is largely uniform. Empirical performance-modeling tools such as Extra-P exploit this common pattern to make their modeling process more noise resilient, mitigating the effect of outliers by summarizing performance measurements of individual functions across all processes. While the underlying assumption does not equally hold for all applications, knowing the qualitative differences in how the performance of individual processes changes as execution parameters are varied can reveal important performance bottlenecks such as malicious patterns of load imbalance. A challenge for empirical modeling tools, however, arises from the fact that the behavioral class of a process may depend on the process configuration, letting process ranks migrate between classes as the number of processes grows. In this paper, we introduce a novel approach to the problem of modeling of spatially diverging performance based on a certain type of process clustering. We apply our technique to identify a previously unknown performance bottleneck in the BoSSS fluid-dynamics code. Removing it made the code regions in question running up to 20 times and the application as a whole run up to 4.5 times faster.

关键词： parallel programming performance modeling fluid dynamics

来源：评论

学校读者我要写书评

暂无评论

Accelerating Domain Propagation: an Efficient GPU-parallel Algorithm over Sparse Matrices 10

Accelerating Domain Propagation: an Efficient GPU-Parallel A...

引用

10th IEEE/ACM workshop on Irregular Applications - Architectures and Algorithms (IA3)

作者： Sofranac, Boro Gleixner, Ambros Pokutta, Sebastian Berlin Inst Technol Berlin Germany Zuse Inst Berlin Berlin Germany HTW Berlin Berlin Germany

ISBN: (纸本)9781665415576

Fast domain propagation of linear constraints has become a crucial component of today's best algorithms and solvers for mixed integer programming and pseudo-boolean optimization to achieve peak solving performance. Irregularities in the form of dynamic algorithmic behaviour, dependency structures, and sparsity patterns in the input data make efficient implementations of domain propagation on GPUs and, more generally, on parallel architectures challenging. This is one of the main reasons why domain propagation in state-of-the-art solvers is single thread only. In this paper, we present a new algorithm for domain propagation which (a) avoids these problems and allows for an efficient implementation on GPUs, and is (b) capable of running propagation rounds entirely on the GPU, without any need for synchronization or communication with the CPU. We present extensive computational results which demonstrate the effectiveness of our approach and show that ample speedups are possible on practically relevant problems: on state-of-theart GPUs, our geometric mean speed-up for reasonably-large instances is around 10x to 20x and can be as high as 195x on favorably-large instances.

关键词： Mixed Integer Linear programming MIP GPU Domain Propagation Bound Tightening parallel Algorithms

来源：评论

学校读者我要写书评

暂无评论

Evaluating FPGA Accelerator Performance with a Parameterized OpenCL Adaptation of Selected Benchmarks of the HPCChallenge Benchmark Suite 6

Evaluating FPGA Accelerator Performance with a Parameterized...

引用

6th IEEE/ACM International workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC)

作者： Meyer, Marius Kenter, Tobias Plessl, Christian Paderborn Univ Dept Comp Sci Paderborn Germany Paderborn Univ Paderborn Ctr Parallel Comp PC2 Paderborn Germany

ISBN: (纸本)9780738123547

FPGAs have found increasing adoption in data center applications since a new generation of high-level tools have become available which noticeably reduce development time for FPGA accelerators and still provide high-quality results. There is, however, no high-level benchmark suite available, which specifically enables a comparison of FPGA architectures, programming tools, and libraries for HPC applications. To fill this gap, we have developed an OpenCL-based open-source implementation of the HPCC benchmark suite for Xilinx and Intel FPGAs. This benchmark can serve to analyze the current capabilities of FPGA devices, cards, and development tool flows, track progress over time, and point out specific difficulties for FPGA acceleration in the HPC domain. Additionally, the benchmark documents proven performance optimization patterns. We will continue optimizing and porting the benchmark for new generations of FPGAs and design tools and encourage active participation to create a valuable tool for the community.

关键词： FPGA OpenCL High Level Synthesis HPC benchmarking

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共23页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：