检索结果-内蒙古大学图书馆

GPU Acceleration of Finding Maximum Eigenvalue of Positive Matrices 1

14th international conference on algorithms and architectures for parallel processing (ica3pp)

作者： Tian, Ning Guo, Longjiang Ai, Chunyu Ren, Meirui Li, Jinbao Heilongjiang Univ Sch Comp Sci & Technol Harbin Peoples R China Key Lab Database & Parallel Comp Heilongjiang Peoples R China Univ S Carolina Upstate Div Math & Comp Sci Upstate Peoples R China

ISBN: (数字)9783319111940

ISBN: (纸本)9783319111940;9783319111933

Matrix eigenvalue theory has become an important analysis tool in scientific computing. Sometimes, people do not need to find all eigenvalues but only the maximum eigenvalue. Existing algorithms of finding the maximum eigenvalue of matrices are implemented sequentially. With the increasing of the orders of matrices, the workload of calculation is getting heavier. therefore, traditional sequential methods are unable to meet the need of fast calculation for large matrices. this paper proposes a parallel algorithm named PA-ST to find the maximum eigenvalue of positive matrices by using similarity transformation which is implemented by CUDA (Computer Unified Device Architecture) on GPU (Graphic Process Unit). To the best of our knowledge, this is the first CUDA based parallel algorithm of calculating maximum eigenvalue of matrices. In order to improve the performance, optimization techniques are applied in this paper such as using the shared memory rather than the global memory to improve the speed of computation, avoiding bank conflicts by setting the span index, satisfying the principle of coalesced memory access, and by using single-precision floating-point arithmetic and the pinned memory to reduce the copy operation and obtain higher data transfer bandwidth between the host and the GPU device. the experimental results show that the similarity transformation technique can significantly shorten the running time compared to the sequential algorithm and the speedup ratio is nearly stable when the number of iterations increases. As the matrix order increases, the running time of the sequential algorithm and PA-ST increases correspondingly. Experiments also show that the speedup ratio of the PA-ST is between 2.85 and 35.028.

关键词： Maximum Eigenvalue Positive Matrix Similarity Transformation GPU CUDA

来源：评论

学校读者我要写书评

暂无评论

Message from the ica3pp 2011 program chairs

引用

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2011年第PART 1期7016 LNCS卷 VII页

作者： Xiang, Yang Cuzzocrea, Alfredo Hobbs, Michael Deakin University School of Information Technology Melbourne Burwood Campus 221 Burwood Highway Burwood VIC 3125 Australia Italy Deakin University School of Information Technology GeelongWaurn Ponds Campus Pigdons Road Geelong VIC 3217 Australia

来源：评论

学校读者我要写书评

暂无评论

HardBio 2011 foreword

Lecture Notes in Computer Science (including subseries Lectu...

引用

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2011年第PART 2期7017 LNCS卷 XIII页

作者： Nedjah, Nadia Mourelle, Luiza De Macedo

来源：评论

学校读者我要写书评

暂无评论

Message from the IDCS 2011 chairs

Lecture Notes in Computer Science (including subseries Lectu...

引用

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2011年第PART 2期7017 LNCS卷 VII-VIII页

作者： Abawajy, Jemal Fortino, Giancarlo Hasan, Ragib Rahman, Mustafizur Deakin University Australia University of Calabria Italy Johns Hopkins University United States IBM Australia

来源：评论

学校读者我要写书评

暂无评论

Budget constrained resource allocation for non-deterministic workflows on an iaas cloud

Budget constrained resource allocation for non-deterministic...

引用

12th international conference on algorithms and architectures for parallel processing, ica3pp 2012

作者： Caron, Eddy Desprez, Frédéric Muresan, Adrian Suter, Frédéric UMR CNRS ENS Lyon UCB Lyon 1 46 allée d'Italie 69364 Lyon Cedex 7 France IN2P3 Computing Center CNRS IN2P3 43 bd du 11 novembre 1918 69622 Villeurbanne France

ISBN: (纸本)9783642330773

Many scientific applications are described through workflow structures. Due to the increasing level of parallelism offered by modern computing infrastructures, workflow applications now have to be composed not only of sequential programs, but also of parallel ones. Cloud platforms bring on-demand resource provisioning and pay-as-you-go billing model. then the execution of a workflow corresponds to a certain budget. the current work addresses the problem of resource allocation for non-deterministic workflows under budget constraints. We present a way of transforming the initial problem into sub-problems that have been studied before. We propose two new allocation algorithms that are capable of determining resource allocations under budget constraints and we present ways of using them to address the problem at hand. © 2012 Springer-Verlag.

关键词： Resource allocation

来源：评论

学校读者我要写书评

暂无评论

Deadline-oriented task scheduling for mapreduce environments 15th

Deadline-oriented task scheduling for mapreduce environments

引用

15th international conference on algorithms and architectures for parallel processing, ica3pp 2015

作者： Hu, Minghao Wang, Changjian You, Pengfei Huang, Zhen Peng, Yuxing National Laboratory for Parallel and Distributed Processing School of Computer Science National University of Defense Technology Changsha410072 China

ISBN: (纸本)9783319271217

To provide timely results for ‘Big Data Analytics’, it is crucial to satisfy deadline requirements for MapReduce jobs in production environments. In this paper, we propose a deadline-oriented task scheduling approach, named Dart, to meet the given deadline and maximize the input size if only part of the dataset can be processed before the time limit. Dart uses an iterative estimation method which is based on both historical data and job running status to precisely estimate the realtime job completion time. By comparing the estimated time with the deadline constraint, a YARN-based task scheduler dynamically decides whether continuing or terminating the map *** have validated our approach using workloads from OpenCloud and Facebook on a cluster of 60 virtual machines. the results show that Dart can not only effectively meet the deadline but also process near-maximal data volumes even when the deadline is set to be extremely small and limited resources are allocated. © Springer international Publishing Switzerland 2015.

关键词： MapReduce

来源：评论

学校读者我要写书评

暂无评论

Enhancing parallel data loading for large scale scientific database 15th

Enhancing parallel data loading for large scale scientific d...

引用

15th international conference on algorithms and architectures for parallel processing, ica3pp 2015

作者： Li, Hui Li, Hongyuan Chen, Mei Dai, Zhenyu Zhu, Ming Huang, Menglin Department of Computer Science Guizhou University Guiyang550025 China Guizhou Engineering Laboratory of ACMIS Guiyang550025 China National Astronomical Observatories Chinese Academy of Sciences Beijing100016 China

ISBN: (纸本)9783319271217

the rapidly increased data size make large scale scientific database often have a huge time delay between loading data into the system and ready for receiving query request. To solve this problem, we proposed an efficient parallel data loading approach named FASTLoad. It is designed to maximize the given resource (e.g., network bandwidth, main memory) utilization for optimizing the data loading in large scale array model based scientific database system. To verify the efficiency of FASTLoad, we implemented it in our Adaptable Data Loading System and evaluate its performance over various sizes of large scientific data sets. Our experimental results show that the performance of FASTLoad can be 4 to 6 times fast than the built-in loading techniques of states-of-the-arts array model based scientific database system. © Springer international Publishing Switzerland 2015.

关键词： Database systems

来源：评论

学校读者我要写书评

暂无评论

parallel aware hybrid Solid-State storage 15th

Parallel aware hybrid Solid-State storage

引用

15th international conference on algorithms and architectures for parallel processing, ica3pp 2015

作者： He, Dan Wang, Fang Feng, Dan Liu, Jingning Wu, Yunxiang He, Ying Hu, Yang Wuhan National Laboratory for Optoelectronics School of Computer Science and Technology Huazhong University of Science and Technology Wuhan430074 China Nanchang Hangkong University Nanchang330063 China China Ship Development and Design Center Wuhan430064 China

ISBN: (纸本)9783319271392

Compared with tradition disk, NAND Flash has advantages of higher performance and shock resistance. But before write, NAND Flash must erase the old messages. that why NAND Flash based Solid State Disks (SSDs) always use the log-based schemes to improve the performance. Compared with NAND Flash, Phase Change Memory (PCM) has higher write performance, longer lifetime, and can update in-place, but its cost is high and capacity is low. So, in PCM and NAND Flash hybrid SSD, PCM is always used as log region, such as In-Page Logging-based (hybrid-IPL) SSD. the log-based SSD incurs a large number of merge operations. the cost of merge operation is very high because it involves many read, write operations, as well as an erase operation. So, how to reduce the cost of merge operations is the critical challenge to log-based hybrid (PCM and flash) SSD. In this paper, we propose a new merge scheme in PCM and NAND Flash hybrid SSD, called parallel aware hybrid In-Page Logging-based (P-aware-IPL) SSD. this scheme can exploit the die-level and plane-level parallelism of flash. Leveraging these two levels of parallelism, the cost of full merge is significantly reduced compared with that of hybrid-IPL SSD scheme and there is no other additional overhead in our algorithm. Experiment results have shown that the proposed P-aware-IPL reduces the flash write and erase operations by up to 10% and average response time by up to 22% against the hybrid-IPL scheme. © Springer international Publishing Switzerland 2015.

关键词： Cost reduction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：