检索结果-内蒙古大学图书馆

IEEE Conference on Electromagnetic Field Computation

作者： Takeshi Iwashita Akihiro Ida Takeshi Mifune Yasuhito Takahashi Hokkaido Univcrsity Sapporo Japan The University of Tokyo Tokyo Japan Kyoto University Kyoto Japan Doshisha University Kyoto Japan

We developed a software framework for boundary element analyses. The software supports a hybrid parallel programming model and is equipped with a hierarchical matrix (H-matrix) library to accelerate the BEM analysis.

ISBN: (纸本)9781509010332

关键词： Software Libraries Computational modeling parallel programming Integral equations Boundary conditions Mathematical model

来源：评论

学校读者我要写书评

暂无评论

Hyper-threading technology: Not a good choice for speeding up CPU-bound code

Hyper-threading technology: Not a good choice for speeding u...

引用

International Conference on Electronic Design (ICED)

作者： Ng Hui Qun Z.I.A Khalib M. N. Warip M. Elshaikh Elobaid R. Mostafijur N.A.H. Zahri Puteh Saad Embedded Network and Advanced Computing Research Cluster University of Malaysia Perlis Arau Perlis

ISBN: (纸本)9781509021611

Hyper-threading (HT) technology allows one thread to execute its task while another thread is stalled waiting for shared resource or other operations to complete. Thus, this reduces the idle time of a processor. If HT is enabled, an operating system would see two logical cores per each physical core. This gives one physical core the ability to run two threads simultaneously. However, it does not necessarily speed up the performance of a parallel code twice the number of physical cores. This happens when two threads are trying to access the shared CPU resource. The instructions could only be executed one after another at any given time. In this case, parallel CPU-bound code could attain a little improvement in terms of speedup from HT on a quad-core platform, which is Intel i5-2410M@2.30GHz.

关键词： Algorithm design and analysis Instruction sets parallel processing Multicore processing Heuristic algorithms Scalability parallel programming

来源：评论

学校读者我要写书评

暂无评论

Applying parallel design patterns to embarassingly parallel problem

Applying parallel design patterns to embarassingly parallel ...

引用

Colossal Data Analysis and Networking (CDAN)

作者： Nilesh Maltare Chetan Chudasama Department of Information Technology MBICT India

ISBN: (纸本)9781509006700

This paper present experiment done with mapping of Algorithmic structure pattern with implementation pattern. Selection of implementation patterns and data structures needs to consider parallel platform for which they are developed and they also affects the performance of program. The experiment results supports need of Adaptive patterns for parallel programming to develop software's runs on different parallel environments.

关键词： Software parallel processing Algorithm design and analysis parallel programming Data structures Concurrent computing Software algorithms

来源：评论

学校读者我要写书评

暂无评论

Placement of Smart Mobile Access Points in Wireless Sensor Networks and Cyber-Physical Systems using Fog Computing

Placement of Smart Mobile Access Points in Wireless Sensor N...

引用

IEEE International Conference on Ubiquitous Intelligence and Computing

作者： Amin Majd Golnaz Sahebi Masoud Daneshtalab Juha Plosila Hannu Tenhunen Department of Information Technology University of Turku Royal Institute of Technology (KTH) Royal Institute of Technology University of Turku

ISBN: (纸本)9781509027729

Increasingly sophisticated, complex, and energy-efficient cyber-physical systems and wireless sensor networks are emerging, facilitated by recent advances in computing and sensor technologies. Integration of cyberphysical systems and wireless sensor networks with other contemporary technologies, such as unmanned aerial vehicles and fog or edge computing, enable creation of completely new smart solutions. We present the concept of a Smart Mobile Access Point (SMAP), which is a key building block for a smart network, and propose an efficient placement approach for such SMAPs. SMAPs predict the behavior of the network, based on information collected from the network, and select the best approach to support the network at any given time. When needed, they autonomously change their positions to obtain a better configuration from the network performance perspective. Therefore, placement of SMAPs is an important issue in such a system. Initial placement of SMAPs is an NP problem, and evolutionary algorithms provide an efficient means to solve it. Specifically, we present a parallel implementation of the imperialistic competitive algorithm and an efficient evaluation or fitness function to solve the initial placement of SMAPs in the fog computing context.

关键词： Smart mobile access point Fog computing Wireless sensor networks Cyber-physical systems Multi-objective optimization Evolutionary computing parallel approaches ICA parallel programming Multi-population Placement

来源：评论

学校读者我要写书评

暂无评论

Priority-grouping method for parallel multi-scheduling in Grid

引用

JOURNAL OF COMPUTER AND SYSTEM SCIENCES 2015年第6期81卷 943-957页

作者： Abraham, Goodhead Tomvie James, Anne Yaacob, Norlaily Coventry Univ Distributed Syst & Modelling Grp Coventry CV1 5FB W Midlands England

This article presents a method of enhancing the efficiency of Grid scheduling algorithms by employing a job grouping method based on priorities and also grouping of Grid machines based on their configuration before implementing a suitable scheduling algorithm within paired groups. The Priority method is employed to group jobs into four groups, while two different methods, Similar Together and Evenly Distributed, are employed to group machines into four groups before implementing the Min Min Grid scheduling algorithm simultaneously. Implementing the scheduling algorithms simultaneously within paired groups (multi-scheduling) ensures a high degree of parallelism, increases throughput and improves the overall performance of scheduling algorithms. Two sets of controlled experiments were carried out on an HPC system. Analysis of results shows that the Priority Grouping method improved the scheduling efficiency by very large margins over the non-grouping method. (C) 2014 Elsevier Inc. All rights reserved.

关键词： Grid computing Scheduling parallel programming Multicore-systems Multi-scheduling

来源：评论

学校读者我要写书评

暂无评论

Evaluation of Movement Facilitating Techniques for Finite Element Analysis of Magnetically Geared Electrical Machines

引用

IEEE TRANSACTIONS ON MAGNETICS 2015年第2期51卷 1-6页

作者： Gerber, Stiaan Wang, Rong-Jie Univ Stellenbosch Dept Elect & Elect Engn ZA-7600 Stellenbosch South Africa

The simulation of magnetically geared electrical machines using the finite element method is an especially demanding task when movement has to be considered. Several methods that facilitate movement exist. In this paper, two of these methods, the macro air-gap element (AGE) and the moving band (MB) are applied in a time-stepped static simulation of a magnetically geared machine (MGM). The methods are evaluated in terms of accuracy and computational efficiency, vitally important factors for numerical optimization. The implementation of both methods exploit the multi-core architecture of modern CPUs to solve several steps in parallel, drastically reducing the simulation time. Nevertheless, the computational cost of the AGE is prohibitively high in the simulation of MGMs. The MB is computationally efficient and good accuracy can be achieved using a multilayer approach.

关键词： Air gaps air-gap element (AGE) computational electromagnetics electric machines electromagnetics finite element analysis magnetic gears moving band (MB) parallel programming permanent magnet machines

来源：评论

学校读者我要写书评

暂无评论

Safe Data parallelism for General Streaming

引用

IEEE TRANSACTIONS ON COMPUTERS 2015年第2期64卷 504-517页

作者： Schneider, Scott Hirzel, Martin Gedik, Bugra Wu, Kun-Lung IBM TJ Watson Res Ctr Yorktown Hts NY 10598 USA Bilkent Univ Dept Comp Engn TR-06800 Ankara Turkey

Streaming applications process possibly infinite streams of data and often have both high throughput and low latency requirements. They are comprised of operator graphs that produce and consume data tuples. General streaming applications use stateful, selective, and user-defined operators. The stream programming model naturally exposes task and pipeline parallelism, enabling it to exploit parallel systems of all kinds, including large clusters. However, data parallelism must either be manually introduced by programmers, or extracted as an optimization by compilers. Previous data parallel optimizations did not apply to selective, stateful and user-defined operators. This article presents a compiler and runtime system that automatically extracts data parallelism for general stream processing. Data-parallelization is safe if the transformed program has the same semantics as the original sequential version. The compiler forms parallel regions while considering operator selectivity, state, partitioning, and graph dependencies. The distributed runtime system ensures that tuples always exit parallel regions in the same order they would without data parallelism, using the most efficient strategy as identified by the compiler. Our experiments using 100 cores across 14 machines show linear scalability for parallel regions that are computation-bound, and near linear scalability when tuples are shuffled across parallel regions.

关键词： Data processing distributed computing parallel programming

来源：评论

学校读者我要写书评

暂无评论

A parallel optimisation approach for the realisation problem in intensity modulated radiotherapy treatment planning

引用

COMPUTATIONAL OPTIMIZATION AND APPLICATIONS 2015年第2期60卷 441-477页

作者： Mason, Luke R. Mak-Hau, Vicky H. Ernst, Andreas T. Biarri Windsor Vic 3181 Australia Deakin Univ Sch Informat Technol Burwood Vic 3125 Australia CSIRO Math Informat & Stat Clayton Vic 3169 Australia

We propose a parallel algorithm for computing exact solutions to the problem of minimizing the number of multileaf collimator apertures needed in step-and-shoot intensity modulated radiotherapy. These problems are very challenging particularly as the problem size increases. Here, we investigate how advanced parallel computing methods can be applied to these problems with a focus on the issues that are peculiar to parallel search algorithms and do not arise in their serial counterparts. A previous paper by the authors presented the MU-RD method for solving such problems using a serial constraint programming based search method. This method is being used as the starting point for a parallel implementation. The key challenges in creating a parallel implementation are ensuring that the CPUs are not starved of work and avoiding unnecessary computation due to the rearrangement of the search order in the parallel version. We show that efficient parallel optimisation is possible by dynamically changing the way work is split with potentially multiple tree search processes as well as parallel search of nodes. A weakly sorted queueing system is used to ensure appropriate prioritisation of tasks. Numerical results are presented to demonstrate the effectiveness of our algorithms in scaling from 8 to 64 CPUs.

关键词： IMRT Combinatorial optimization Constraint programming parallel programming

来源：评论

学校读者我要写书评

暂无评论

A parallel Ant Colony Optimization for the Maximum-Weight Clique Problem

A Parallel Ant Colony Optimization for the Maximum-Weight Cl...

引用

IEEE International Symposium on parallel and Distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Didier El Baz Mhand Hifi Lei Wu Xiaochuan Shi LAAS Universite Federale Toulouse Midi-Pyrenees Toulouse Occitanie FR Laboratory EPROAD Université de Picardie Jules Verne Amiens France School of International Software Wuhan University Wuhan China

ISBN: (纸本)9781509036837

In this paper, we propose a parallel ant colony optimization based metaheuristic for solving the maximum-weight clique problem, which is a variation of the maximum clique problem. The advised parallel computing model is based on concept of cooperation among multiple ant colonies system. The cooperation system consists of a message center and a number of ant colonies. Each ant colony attempts to explore the solution space by using its own search strategy. The message center first collects the solution information from different ant colonies, and then it shares the current best solution with them. The performance of the proposed method was evaluated on a set of the standard benchmark instances from literature. The obtained results were compared to those reached by the Cplex solver and the best solutions reported in the literature. From the experimental results, one can observe that encouraging results have been obtained.

关键词： Ant colony optimization parallel processing Optimization Computational modeling Buildings Message passing parallel programming

来源：评论

学校读者我要写书评

暂无评论

Using GPUs to speed-up Levenshtein edit distance computation

Using GPUs to speed-up Levenshtein edit distance computation

引用

International Conference on Information and Communication Systems (ICICS)

作者： Khaled Balhaf Mohammed A. Shehab Wala'a T. Al-Sarayrah Mahmoud Al-Ayyoub Mohammed Al-Saleh Yaser Jararweh Jordan University of Science and Technology Irbid Jordan

ISBN: (纸本)9781467386159

Sequence comparison problems such as sequence alignment and approximate string matching are part of the fundamental problems in many fields such as natural language processing, data mining and bioinformatics. However, the algorithms proposed to address these problems suffer from high computational complexities prohibiting them from being widely used in practical large-scale settings. Many researchers used parallel programming to reduce the execution time of these algorithms. In this paper, we follow this approach and use the parallelism capabilities of the Graphics Processing Unit (GPU) to accelerate one of the most common algorithms to compute the edit distance between two strings, which is known as the Levenshtein distance. To take full advantage of the large number of cores in a GPU, we employ a diagonal-based tracing technique which results in even greater improvements in terms of the running time. In fact, our CUDA implementation of the Levenshtein algorithm is about 11X faster than the sequential implementation. This is achieved without affecting the accuracy.

关键词： Graphics processing units Central Processing Unit Bioinformatics Acceleration parallel programming DNA Instruction sets

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：