检索结果-内蒙古大学图书馆

IEEE International Conference on High Performance Computing Workshops (HiPCW)

作者： Preeti Malakar Department of Computer Science and Engineering Indian Institute of Technology Kanpur India

ISBN: (数字)9781728148946

ISBN: (纸本)9781728148953

In this paper, we present teaching experiences of offering a course on large-scale parallel computing using message passing interface (MPI). This particular course was offered to under-graduates and graduates for the first time in the Department of Computer Science and Engineering, Indian Institute of Technology Kanpur in a very long time. We will present what topics were covered, how we decided the course content, the class demographics, what resources were made available for the students to run their MPI jobs and discuss the output of the course. We will also discuss what were the stumbling blocks encountered while offering a parallel computing systems course without much support from teaching assistants, and some lessons that we took forward to the next time we offered this course.

关键词： Education Bars parallel programming Message passing Supercomputers

来源：评论

学校读者我要写书评

暂无评论

Implementing the Jaccard Index on the Migratory Memory-Side Processing Emu Architecture

Implementing the Jaccard Index on the Migratory Memory-Side ...

引用

IEEE High Performance Extreme Computing Conference (HPEC)

作者： Krawezik, Geraud P. Kogge, Peter M. Dysart, Timothy J. Kuntz, Shannon K. McMahon, Janice O. Emu Technol New York NY USA

ISBN: (纸本)9781538659892

We present an implementation of the Jaccard Index for graphs on the Migratory Memory-Side Processing Emu architecture. This index was designed to find similarities between different vertices in a graph, and is often used to identify communities. The Emu architecture is a parallel system based on a partitioned global address space, with threads automatically migrating inside the memory. We introduce the parallel programming model used to exploit it, detail our implementation of the algorithm, and analyze simulated performance results as well as early hardware tests. We discuss its application to large scale problems.

关键词： Graph Algorithms Jaccard Similarity parallel programming Memory-Side Processing Migrating Threads Partitioned Global Address Space

来源：评论

学校读者我要写书评

暂无评论

Effects of Latency Jitter on Simulator Sickness in a Search Task 25

Effects of Latency Jitter on Simulator Sickness in a Search ...

引用

25th IEEE Conference on Virtual Reality and 3D User Interfaces (IEEE VR)

作者： Stauffert, Jan-Philipp Niebling, Florian Latoschik, Marc Erich Univ Wurzburg Wurzburg Germany

ISBN: (纸本)9781538633656

Low latency is a fundamental requirement for Virtual Reality (VR) systems to reduce the potential risks of cybersickness and to increase effectiveness, efficiency and user experience. In contrast to the effects of uniform latency degradation, the influence of latency jitter on user experience in VR is not well researched, although today's consumer VR systems are vulnerable in this respect. In this work we report on the impact of latency jitter on cybersickness in HMD-based VR environments. Test subjects are given a search task in Virtual Reality, provoking both head rotation and translation. One group experienced artificially added latency jitter in the tracking data of their head-mounted display. The introduced jitter pattern was a replication of a real-world latency behavior extracted and analyzed from an existing example VR-system. The effects of the introduced latency jitter were measured based on self-reports simulator sickness questionnaire (SSQ) and by taking physiological measurements. We found a significant increase in self-reported simulator sickness. We therefore argue that measure and control of latency based on average values taken at a few time intervals is not enough to assure a required timeliness behavior but that latency jitter needs to be considered when designing experiences for Virtual Reality.

关键词： D.1.3 [programming Techniques]: Concurrent programming parallel programming D.4.8 [Operating Systems]: Performance Measurements H.5.1 [Information Interfaces and Presentation]: Multimedia Information Systems Artificial, augmented, and virtual realities

来源：评论

学校读者我要写书评

暂无评论

GPU Accelerated Non-Parametric Background Subtraction 1

引用

13th International Symposium on Visual Computing (ISVC)

作者： Porr, William Easton, James Tavakkoli, Alireza Loffredo, Donald Simmons, Sean Univ Calif Berkeley Berkeley CA 94720 USA Univ Texas Austin Austin TX 78712 USA Univ Nevada Reno NV 89557 USA Univ Houston Victoria Victoria TX 77901 USA

ISBN: (数字)9783030038014

ISBN: (纸本)9783030038014;9783030038007

Accurate background subtraction is an essential tool for high level computer vision applications. However, as research continues to increase the accuracy of background subtraction algorithms, computational efficiency has often suffered as a result of increased complexity. Consequentially, many sophisticated algorithms are unable to maintain real-time speeds with increasingly high resolution video inputs. To combat this unfortunate reality, we propose to exploit the inherently parallelizable nature of background subtraction algorithms by making use of NVIDIA's parallel computing platform known as CUDA. By using the CUDA interface to execute parallel tasks in the Graphics Processing Unit (GPU), we are able to achieve up to a two orders of magnitude speed up over traditional techniques. Moreover, the proposed GPU algorithm achieves over 8x speed over its CPU-based background subtraction implementation proposed in our previous work [1].

关键词： Graphics Processing Unit (GPU) Non-parametric Background subtraction CUDA NVIDIA parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel DEPSO-Scout: Data parallelism 6

Parallel DEPSO-Scout: Data Parallelism

引用

6th International Electrical Engineering Congress (iEECON)

作者： Boonserm, Prasitchai Sitjongsataporn, Suchada Mahanakorn Univ Technol Elect Engn Grad Program Fac Engn Bangkok Thailand Mahanakorn Univ Technol Fac Engn Mahanakorn Inst Innovat Bangkok Thailand

ISBN: (纸本)9781538623176

DEPSO-Scout is a hybrid optimization algorithm combining Differential Evolution (DE), Particle Swarm Optimization (PSO) and Artificial Bee Colony (ABC). The solution convergence is balanced between exploration of PSO and exploitation from DE. The suboptimal solution has reduced by the scout bee property of ABC. DEPSO-Scout outperforms traditional DE, PSO, and ABC. However, in a higher dimension of search space, the accuracy of DEPSO-Scout is maintained while the search speed is significantly decreased. From the experiment, the computational time varies depending on the complexity of the problem. To improve the time-performance of DEPSO-Scout, the parallelization techniques becomes an interest. By modifying the DEPSO-Scout algorithm with the parallel approach, the speed of algorithm significantly improved while the correctness of solutions is maintained. The experiment and analysis of speedup and algorithm efficiency are discussed. The improvement opportunity of parallel DEPSO-Scout is also discussed in the last section.

关键词： DEPSO-Scout parallel programming Function Decomposition Optimization

来源：评论

学校读者我要写书评

暂无评论

TaskUniVerse: A Task-Based Unified Interface for Versatile parallel Execution 12th

TaskUniVerse: A Task-Based Unified Interface for Versatile P...

引用

12th International Conference on parallel Processing and Applied Mathematics (PPAM)

作者： Zafari, Afshin Uppsala Univ Div Comp Sci Dept Informat Technol Lagerhyddsvagen 2 S-75237 Uppsala Sweden

ISBN: (纸本)9783319780245;9783319780238

Task based parallel programming has shown competitive outcomes in many aspects of parallel programming such as efficiency, performance, productivity and scalability. Different approaches are used by different software development frameworks to provide these outcomes to the programmer, while making the underlying hardware architecture transparent to her. However, since programs are not portable between these frameworks, using one framework or the other is still a vital decision by the programmer whose concerns are expandability, adaptivity, maintainability and interoperability of the programs. In this work, we propose a unified programming interface that a programmer can use for working with different task based parallel frameworks transparently. In this approach we abstract the common concepts of task based parallel programming and provide them to the programmer in a single programming interface uniformly for all frameworks. We have tested the interface by running programs which implement matrix operations within frameworks that are optimized for shared and distributed memory architectures and accelerators, while the cooperation between frameworks is configured externally with no need to modify the programs. Further possible extensions of the interface and future potential research are also described.

关键词： High Performance Computing Task based programming parallel programming Unified interface

来源：评论

学校读者我要写书评

暂无评论

A MULTICORE CONVEX OPTIMIZATION ALGORITHM WITH APPLICATIONS TO VIDEO RESTORATION 25

A MULTICORE CONVEX OPTIMIZATION ALGORITHM WITH APPLICATIONS ...

引用

25th IEEE International Conference on Image Processing (ICIP)

作者： Abboud, Feriel Chouzenoux, Emilie Pesquet, Jean-Christophe Talbot, Hugues WITBE France Puteaux La Defense France Univ Paris Saclay Cent Supelec INRIA Saclay CVN Paris France Univ Paris Est Marne La Vallee ESIEE CNRS LIGMUMR 8049 Champs Sur Marne France

ISBN: (纸本)9781479970612

In this paper, we present a new distributed algorithm for minimizing a sum of non-necessarily differentiable convex functions composed with arbitrary linear operators. The overall cost function is assumed strongly convex. Each involved function is associated with a node of a hypergraph having the ability to communicate with neighboring nodes sharing the same hyperedge. Our algorithm relies on a primal-dual splitting strategy with established convergence guarantees. We show how it can be efficiently implemented to take full advantage of a multicore architecture. The good numerical performance of the proposed approach is illustrated in a problem of video sequence denoising, where a significant speedup is achieved.

关键词： convex optimization distributed algorithms proximal methods video processing parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallelization of Selected Algorithms on Multi-core CPUs, a Cluster and in a Hybrid CPU plus Xeon Phi Environment 38th

Parallelization of Selected Algorithms on Multi-core CPUs, a...

引用

38th International Conference on Information Systems Architecture and Technology (ISAT)

作者： Krzywaniak, Adam Czarnul, Pawel Gdansk Univ Technol Fac Elect Telecommun & Informat Gdansk Poland

ISBN: (纸本)9783319672205;9783319672199

In the paper we present parallel implementations as well as execution times and speed-ups of three different algorithms run in various environments such as on a workstation with multi-core CPUs and a cluster. The parallel codes, implementing the master-slave model in C+MPI, differ in computation to communication ratios. The considered problems include: a genetic algorithm with various ratios of master processing time to communication and fitness evaluation times, matrix multiplication and numerical integration. We present how the codes scale in the aforementioned systems. For the numerical integration code that scales very well we also show performance in a hybrid CPU+Xeon Phi environment.

关键词： parallel programming Multi-core CPU Cluster Intel Xeon Phi parallelization

来源：评论

学校读者我要写书评

暂无评论

ParaSail: A pointer-free pervasively-parallel language for irregular computations

arXiv

引用

arXiv 2019年

作者： Taft, S. Tucker AdaCore LexingtonMA United States

ParaSail is a language specifically designed to simplify the construction of programs that make full, safe use of parallel hardware even while manipulating potentially irregular data structures. As parallel hardware has proliferated, there has been an urgent need for languages that ease the writing of correct parallel programs. ParaSail achieves these goals largely through simplification of the language, rather than by adding numerous rules. In particular, ParaSail eliminates global variables, parameter aliasing, and most significantly, re-assignable pointers. ParaSail has adopted a pointer-free approach to defining complex data structures. Rather than using pointers, ParaSail supports flexible data structuring using expandable (and shrinkable) objects implemented using region-based storage management, along with generalized indexing. By eliminating global variables, parameter aliasing, and pointers, ParaSail reduces the complexity for the programmer, while still allowing ParaSail to provide flexible, pervasive, safe, parallel programming for irregular computations. Perhaps the most interesting discovery in this language development effort, based on over six years of use by the author and a group of ParaSail users, has been that it is possible to simultaneously simplify the language, support parallel programming with advanced data structures, and maintain flexibility and efficiency. Copyright © 2019, The Authors. All rights reserved.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

New Network / IT Command: Virtualized Function Performance for a Programmable Infrastructure

New Network / IT Command: Virtualized Function Performance f...

引用

作者： Veronica Karina Quintuna Rodriguez Sorbonne Universite

学位级别：博士

In the framework of Network Function Virtualization (NFV), we address in this work the performance analysis of virtualized network functions (VNFs), wherein the virtualization of the radio access network (namely, Cloud-RAN) is the driving use-case. The overarching principle of network virtualization consists of replacing network functions, which were so far running on dedicated and proprietary hardware, with open software applications running on shared general purpose servers. The complexity of virtualization is in the softwarization of low-layer network functions (namely, PHY functions) because their execution must meet strict latency requirements. Throughout this work, we evaluate the performance of VNFs in terms of latency which consid- ers the total amount of time that is required to process VNFs in cloud computing systems. We notably investigate the relevance of resource pooling and statistical multiplexing when available cores in a data center are shared by all active VNFs. We perform VNF modeling by means of stochastic service systems. Proposed queuing models reveal the behavior of high performance computing architectures based on parallel processing and enable dimensioning the required com- puting capacity in data centers.

关键词： Virtualization NFV VNF queuing systems resource pooling Cloud-RAN C-RAN parallel programming processor sharing cloud computing scheduling service chaining

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：