检索结果-内蒙古大学图书馆

arXiv 2018年

作者： Zafari, Afshin Larsson, Elisabeth Tillenius, Martin Uppsala University Department of Information Technology Box 337 UppsalaSE-751 05 Sweden

Current high-performance computer systems used for scientific computing typically combine shared memory computational nodes in a distributed memory environment. Extracting high performance from these complex systems requires tailored approaches. Task based parallel programming has been successful both in simplifying the programming and in exploiting the available hardware parallelism for shared memory systems. In this paper we focus on how to extend task parallel programming to distributed memory systems. We use a hierarchical decomposition of tasks and data in order to accommodate the different levels of hardware. We test the proposed programming model on two different applications, a Cholesky factorization, and a solver for the Shallow Water Equations. We also compare the performance of our implementation with that of other frameworks for distributed task parallel programming, and show that it is competitive. Copyright © 2018, The Authors. All rights reserved.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Detection of Simulated Brain Strokes Using Microwave Tomography

引用

IEEE JOURNAL OF ELECTROMAGNETICS RF AND MICROWAVES IN MEDICINE AND BIOLOGY 2019年第4期3卷 254-260页

作者： Coli, Vanna Lisa Tournier, Pierre-Henri Dolean, Victorita El Kanfoud, Ibtissam Pichot, Christian Migliaccio, Claire Blanc-Feraud, Laure Univ Cote dAzur CNRS CEPAM F-06357 Nice 4 France Univ Paris Diderot SPC Univ Sorbonne CNRS INRIALJLL F-75005 Paris France Univ Cote dAzur LJAD CNRS F-06108 Nice 2 France Univ Strathclyde Dept Math & Stat Glasgow G1 1XH Lanark Scotland Univ Cote dAzur CNRS LEAT F-06903 Sophia Antipolis France Univ Cote dAzur CNRS INRIA I3S F-06900 Sophia Antipolis France

Brain strokes are one of the leading causes of disability and mortality in adults in developed countries. Ischemic stroke (85% of total cases) and hemorrhagic stroke (15%) must be treated with opposing therapies, and thus, the nature of the stroke must be determined quickly in order to apply the appropriate treatment. Recent studies in biomedical imaging have shown that strokes produce variations in the complex electric permittivity of brain tissues, which can be detected by means of microwave tomography. Here, we present some synthetic results obtained with an experimental microwave tomography-based portable system for the early detection andmonitoring of brain strokes. The determination of electric permittivity first requires the solution of a coupled forward-inverse problem. We make use of massive parallel computation from domain decomposition method and regularization techniques for optimization methods. Synthetic data are obtained with electromagnetic simulations corrupted by noise, which have been derived from measurements errors of the experimental imaging system. Results demonstrate the possibility to detect hemorrhagic strokes with microwave systems when applying the proposed reconstruction algorithm with edge preserving regularization.

关键词： Microwave imaging biomedical imaging inverse problems mathematical programming optimizationmethods signal reconstruction dielectric constant medical image processing parallel programming brain stroke imaging domain-specific language gradient based minimization algorithm regularization methods total variation hemorrhagic brain stroke detection high-speed parallel computing iterative microwave tomographic imaging massively parallel computing numerical modeling open source FreeFem plus plus solver whole-microwave measurement system brain modeling computational modeling tomography

来源：评论

学校读者我要写书评

暂无评论

Scheduling Mutual Exclusion Accesses in Equal-Length Jobs

引用

ACM TRANSACTIONS ON parallel COMPUTING 2019年第2期6卷 1–26页

作者： Kagaris, Dimitri Dutta, Sourav Southern Illinois Univ Elect & Comp Engn Dept 1230 Lincoln Dr Carbondale IL 62901 USA

A fundamental problem in parallel and distributed processing is the partial serialization that is imposed due to the need for mutually exclusive access to common resources. In this article, we investigate the problem of optimally scheduling (in terms of makespan) a set of jobs, where each job consists of the same number L of unit-duration tasks, and each task either accesses exclusively one resource from a given set of resources or accesses a fully shareable resource. We develop and establish the optimality of a fast polynomial-time algorithm to find a schedule with the shortest makespan for any number of jobs and for any number of resources for the case of L = 2. In the notation commonly used for job-shop scheduling problems, this result means that the problem J vertical bar d(ij) = 1, n(j) =2 vertical bar C-max ax is polynomially solvable, adding to the polynomial solutions known for the problems J2 vertical bar n(j) <= 2 vertical bar C-max and J2 vertical bar d(ij) = 1 vertical bar C-max (whereas other closely related versions such as J2 vertical bar n(j)<= 3 vertical bar C-max, J2 vertical bar d(ij) is an element of {1,2}C-max, J3 vertical bar d(ij) =1 vertical bar C-max, J3 vertical bar d(ij) =1 vertical bar and J vertical bar d(ij) =1, n(j) <= 3 vertical bar C-max are all known to be NP-complete). For the general case L > 2 (i.e., for the job-shop problem J vertical bar d(ij) =1, nj = L >2 vertical bar C-max) we present a competitive heuristic and provide experimental comparisons with other heuristic versions and, when possible, with the ideal integer linear programming formulation.

关键词： Job-shop scheduling mutual exclusion critical resources parallel programming polynomial-time algorithm

来源：评论

学校读者我要写书评

暂无评论

The programming of sequences of saccades

引用

EXPERIMENTAL BRAIN RESEARCH 2019年第4期237卷 1009-1018页

作者： McSorley, Eugene Gilchrist, Iain D. McCloy, Rachel Univ Reading Sch Psychol & Clin Language Sci Reading RG6 6AL Berks England Univ Bristol Sch Expt Psychol Bristol BS8 1TU Avon England

Saccadic eye movements move the high-resolution fovea to point at regions of interest. Saccades can only be generated serially (i.e., one at a time). However, what remains unclear is the extent to which saccades are programmed in parallel (i.e., a series of such moments can be planned together) and how far ahead such planning occurs. In the current experiment, we investigate this issue with a saccade contingent preview paradigm. Participants were asked to execute saccadic eye movements in response to seven small circles presented on a screen. The extent to which participants were given prior information about target locations was varied on a trial-by-trial basis: participants were aware of the location of the next target only, the next three, five, or all seven targets. The addition of new targets to the display was made during the saccade to the next target in the sequence. The overall time taken to complete the sequence was decreased as more targets were available up to all seven targets. This was a result of a reduction in the number of saccades being executed and a reduction in their saccade latencies. Surprisingly, these results suggest that, when faced with a demand to saccade to a large number of target locations, saccade preparation about all target locations is carried out in parallel.

关键词： Saccade Sequences parallel programming Eye movements

来源：评论

学校读者我要写书评

暂无评论

Improvement of Real-Time Hybrid Simulation Using parallel Finite-Element Program

引用

JOURNAL OF EARTHQUAKE ENGINEERING 2020年第10期24卷 1547-1565页

作者： Lu, Li-Qiao Wang, Jin-Ting Zhu, Fei Tsinghua Univ State Key Lab Hydrosci & Engn Beijing 100084 Peoples R China Changjiang Inst Survey Planning Design & Res Wuhan Peoples R China

This paper proposes a novel framework to efficiently calculate a large-scale finite element (FE) numerical substructure in real-time hybrid simulation (RTHS). It is composed of a non-real-time Windows computer and a real-time Target Computer. The Windows computer is used to solve the FE numerical substructure by parallel computing in soft real-time, while the real-time Target Computer generates displacement signals for the controller in real time. Based on the proposed framework, a RTHS with numerical substructure simulated in Windows environment is developed. It is demonstrated that the computational efficiency of the RTHS could be greatly improved by parallel programming.

关键词： Real-Time Hybrid Simulation Windows Calculation System Real-Time Blockset Soft Real-Time Interpolation Algorithm parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel programming in Haskell almost for free: An embedding of Intel's array building blocks

Parallel programming in Haskell almost for free: An embeddin...

引用

1st ACM SIGPLAN Workshop on Functional High Performance Computing, FHPC 2012

作者： Svensson, Bo Joel Sheeran, Mary Chalmers University of Technology Sweden

ISBN: (纸本)9781450315777

Nowadays, performance in processors is increased by adding more cores or wider vector units, or by combining accelerators like GPUs and traditional cores on a chip. programming for these diverse architectures is a challenge. We would like to exploit all the resources at hand without putting too much burden on the programmer. Ideally, the programmer should be presented with a machine model abstracted from the specific number of cores, SIMD width or the existence of a GPU or not. Intel's Array Building Blocks (ArBB) is a system that takes on these challenges. ArBB is a language for data parallel and nested data parallel programming, embedded in C++. By offering a retargetable dynamic compilation framework, it provides vectorisation and threading to programmers without the need to write highly architecture specific code. We aim to bring the same benefits to the Haskell programmer by implementing a Haskell frontend (embedding) of the ArBB system. We call this embedding EmbArBB. We use standard Haskell embedded language procedures to provide an interface to the ArBB functionality in Haskell. EmbArBB is work in progress and does not currently support all of the ArBB functionality. Some small programming examples illustrate how the Haskell embedding is used to write programs. ArBB code is short and to the point in both C++ and Haskell. Matrix multiplication has been benchmarked in sequential C++, ArBB in C++, EmbArBB and the Repa library. The C++ and the Haskell embeddings have almost identical performance, showing that the Haskell embedding does not impose any large extra overheads. Two image processing algorithms have also been benchmarked against Repa. In these benchmarks at least, EmbArBB performance is much better than that of the Repa library, indicating that building on ArBB may be a cheap and easy approach to exploiting data parallelism in Haskell. © 2012 ACM.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

On parallelisation of image dehazing with OpenMP

引用

INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS 2019年第4期11卷 427-439页

作者： Weng, Tien-Hsiung Chen, Yi-Siang Lu, Huimin Marino, Mario Donato Li, Kuan-Ching Providence Univ Dept Comp Sci & Informat Engn Taichung 43301 Taiwan Kyushu Inst Technol Dept Mech & Control Engn Kitakyushu Fukuoka Japan Leeds Beckett Univ Sch Comp Creat Technol & Engn City Campus Leeds LS1 3HE W Yorkshire England Guangzhou Univ Guangzhou Guangdong Peoples R China

In this paper, we present our learning experience on the design and implementation of image dehazing parallel code with OpenMP developed from existing fast sequential version. The aim of this work is to present an analysis of a case study showing the development of parallel haze removal with practical and efficient use of shared memory multi-core servers. Implementation technique and result discussions in terms of program improvements that may be needed to support parallel application developers with similar high performance goals are presented. Preliminary studies, results and experiments on haze removal application program are executed on multi-core shared memory platforms, and results show that the performance of the proposed parallel code is promising.

关键词： OpenMP image haze removal multicores parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel programming methods based on the multi-core DSP TMS320C6670

Parallel programming methods based on the multi-core DSP TMS...

引用

2012 International Applied Mechanics, MechatronicsAutomation and System Simulation Meeting, AMMASS 2012

作者： Li, Zhiyong Ye, Zhenliang Liu, Chentao Hefei Xinxing Technology Institute No.451 Huang Shan road Hefei China Military Representative Bureau ZONGZHUANG QingDao China

While the frame rate is higher and the image size is larger, sequence images processing is harder. Good real-time can be ensured by the multi-core DSP in the embedded image processing system. TMS320C6670 which is the multi-core DSP designed by TI corporation is selected as study object. Based on hardware characteristics analyzed, the Data Flow model is adopted as the multicore processing model. Two data processing subtasks assigning methods are analyzed by comparing their advantages and disadvantages on the system idle time and memory requirements. The data processing subtask assigning flow is design for a serial sequence images processing example. An inter-core data transfer flow design idea is put forward. Using methods and occasion of two kinds of data buffer establishing techniques is studied and defined. An inter-core notification flow design idea is put forward. Using methods and occasion of three notification methods based on the interrupt controller and the Semaphore2 module is studied and defined. © (2012) Trans Tech Publications, Switzerland.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Teaching programming in the 21stCentury

引用

Data Processor for Better Business Education 2023年第4期63卷

作者： Francisco Fernández de Vega Universidad de Extremadura Mérida Spain

ABSTRACTAlthough the teaching of programming has evolved over 50 years, all methodologies rely on a simple structure that was born a long time ago: the loop, shared by all high-level programming languages, and the preferred choice for any repetitive task programmers face. We analyze here how “loops” skew the way programmers solve problems, and prevent them from taking advantage of the available parallel/distributed computing architectures. To do so, we state our initial hypothesis: eliminating loops will allow a more natural parallel programming approach. The idea is to mimic a common practice today that was established in the past for a different purpose: prohibiting goto statements to improve code maintainability. This paper describes a new computer programming teaching strategy that we tested for 7 years and provides evidence on how loop prohibition, in the context of Functional programming, makes students aware of data dependencies and produces 21st-century programmers who benefit from widely available parallel architectures.

关键词： programming methodologies loops parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel programming for the web 4

Parallel programming for the web

引用

4th USENIX Workshop on Hot Topics in parallelism, HotPar 2012

作者： Herhut, Stephan Hudson, Richard L. Shpeisman, Tatiana Sreeram, Jaswanth Intel Labs

parallel hardware is today's reality and language extensions that ease exploiting its promised performance ourish. For most mainstream languages, one or more tailored solutions exist that address the specific needs of the language to access parallel hardware. Yet, one widely used language is still stuck in the sequential past: JavaScript, the lingua franca of the web. Our position is that existing solutions do not transfer well to the world of JavaScript due to differences in programming models, the additional requirements of the web, like safety, and to developer expectations. To address this we propose River Trail, a new parallel programming API designed specifically for JavaScript and we show how it satisfies the needs of the web. To prove that our approach is viable, we have implemented a prototype JIT compiler in Firefox that shows an order of magnitude performance improvement for a realistic web application. © HotPar 2012.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：