检索结果-内蒙古大学图书馆

Analyzing Performance of the parallel-based Fractal Image Compression Problem on Multicore Systems

AASRI Procedia 2013年 5卷 140-146页

作者： Roberto de Quadros Gomes Vladimir Guerreiro Rodrigo da Rosa Righi Luiz Gonzaga da Silveira Jinyoung Yang Applied Computing Graduate Program –PIPCA - UNISINOS–Unisinos Avenue 950 São Leopoldo RS Brazil Korean Advanced Institute of Science and Technology –KAIST – 291 Daehak-ro Yuseong-gu Daejeon 305-701South Korea

Both the size and the resolution of images always were key topics in the graphical computing area. Especially, they become more and more relevant in the big data era. We can observe that often a huge amount of data is exchanged by medium/low bandwidth networks or yet, they need to be stored on devices with limited space of memory. In this context, the present paper shows the use of the Fractal method for image compression. It is a lossy method known by providing higher indexes of file reduction through a highly time consuming phase. In this way, we developed a model of parallel application for exploiting the power of multiprocessor architectures in order to get the Fractal method advantages in a feasible time. The evaluation was done with different-sized images as well as by using two types of machines, one with two and another with four cores. The results demonstrated that both the speedup and efficiency are highly dependent of the number of cores. They emphasized that a large number of threads does not always represent a better performance.

关键词： parallel programing Fractal Coding Multicore architectures parallel modeling Threads Performance analysis

来源：评论

学校读者我要写书评

暂无评论

Analyzing Performance of the parallel-based Fractal Image Compression Problem on Multicore Systems

Analyzing Performance of the Parallel-based Fractal Image Co...

引用

The 2013 AASRI Conference on parallel and Distributed Computing and Systems(DCS 2013)

作者： Roberto de Quadros Gomes Vladimir Guerreiro Rodrigo da Rosa Righi Luiz Gonzaga da Silveira Jr. JinyoungYang Applied Computing Graduate Program-PIPCA-UNISINOS-Unisinos Avenue 950 Korean Advanced Institute of Science and Technology-KAIST-291 Daehak-ro

Both the size and the resolution of images always were key topics in the graphical computing ***,they become more and more relevant in the big data *** can observe that often a huge amount of data is exchanged by medium/low bandwidth networks or yet,they need to be stored on devices with limited space of *** this context,the present paper shows the use of the Fractal method for image *** is a lossy method known by providing higher indexes of file reduction through a highly time consuming *** this way,we developed a model of parallel application for exploiting the power of multiprocessor architectures in order to get the Fractal method advantages in a feasible *** evaluation was done with different-sized images as well as by using two types of machines,one with two and another with four *** results demonstrated that both the speedup and efficiency are highly dependent of the number of *** emphasized that a large number of threads does not always represent a better performance.

关键词： parallel programing Fractal Coding Multicore architectures parallel modeling Threads Performance analysis

来源：评论

学校读者我要写书评

暂无评论

Automated Generation of Polyhedral Process Networks from Affine Nested-Loop Programs with Dynamic Loop Bounds

引用

ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS 2013年第1-Sup期13卷 28-28页

作者： Nadezhkin, Dmitry Nikolov, Hristo Stefanov, Todor Leiden Univ Leiden Inst Adv Comp Sci NL-2333 CA Leiden Netherlands

The Process Networks (PNs) is a suitable parallel model of computation (MoC) used to specify embedded streaming applications in a parallel form facilitating the efficient mapping onto embedded parallel execution platforms. Unfortunately, specifying an application using a parallel MoC is a very difficult and highly error-prone task. To overcome the associated difficulties, we have developed the pn compiler, which derives specific Polyhedral Process Networks (PPN) parallel specifications from sequential static affine nested loop programs (SANLPs). However, there are many applications, for example, multimedia applications (MPEG coders/decoders, smart cameras, etc.) that have adaptive and dynamic behavior which cannot be expressed as SANLPs. Therefore, in order to handle dynamic multimedia applications, in this article we address the important question whether we can relax some of the restrictions of the SANLPs while keeping the ability to perform compile-time analysis and to derive PPNs. Achieving this would significantly extend the range of applications that can be parallelized in an automated way. The main contribution of this article is a first approach for automated translation of affine nested loop programs with dynamic loop bounds into input-output equivalent Polyhedral Process Networks. In addition, we present a method for analyzing the execution overhead introduced in the PPNs derived from programs with dynamic loop bounds. The presented automated translation approach has been evaluated by deriving a PPN parallel specification from a real-life application called Low Speed Obstacle Detection (LSOD) used in the smart cameras domain. By executing the derived PPN, we have obtained results which indicate that the approach we present in this article facilitates efficient parallel implementations of sequential nested loop programs with dynamic loop bounds. That is, our approach reveals the possible parallelism available in such applications, which allows for the utili

关键词： Design Theory Algorithms Performance Models of Computation polyhedral process networks compiler techniques for MPSoCs parallel programing

来源：评论

学校读者我要写书评

暂无评论

High-Performance Computing on a Supercomputer Based on New-Generation Processors

High-Performance Computing on a Supercomputer Based on New-G...

引用

5th Romania Tier 2 Federation Grid, Cloud and High Performance Computing Science (RO-LCG)

作者： Ungurean, Ioan Rusu, Ionela Pentiuc, Stefan-Gheorghe Stefan Cel Mare Univ Suceava Fac Elect Engn & Comp Sci Suceava Romania

ISBN: (纸本)9781467322423;9789786627113

The supercomputers built with processors based on CBEA architecture are relatively new and there are fewer applications optimized for this type of system. In this article, we propose to use computing resources provided by a CBEA-based cluster by parallelization and optimization of an algorithm for classification of a large dataset. In order to analyze the proposed methods of parallelization, the algorithm is executed on a CBEA-based cluster, which contains 96 PowerXCell 8i processors (with theoretical peak performance of 9.83TFlops). We analyze the execution time on a processor and on all 96 processors, with and without utilization of the SPE cores.

关键词： Cell Broadband Engine Architecture HPC PowerXCell supercomputing parallel programing

来源：评论

学校读者我要写书评

暂无评论

Contratos formais para derivação e verificação de componentes paralelos

Contratos formais para derivação e verificação de compon...

引用

作者： Marcilon, Thiago Braga Universidade de Lisboa

The use of cloud computing to offer High Performance Computing (HPC) services has been widely discussed in the academia and industry. In this respect, this dissertation is included in the context of designing a cloud computing platform for the development of component-based parallel computing applications, referred as cloud of components. Many important challenges about using the cloud of components relate to parallel programming, an error-prone task due to synchronization issues, which may lead to abortion and production of incorrect data during execution of applications, and the inefficient use of computational resources. These problems may be very relevant in the case of long running applications with tight timelines to obtain critical results, quite common in the context of HPC. One possible solution to these problems is the formal analysis of the behavior of the components of an application through the cloud services, before their execution. Thus, the users of the components may know if a component can be safely used in their application. In this scenario, formal methods becomes useful. In this dissertation, it is proposed a process for specification and derivation of parallel components implementation for the cloud of components. This process involves the formal specification of the components behavior through contracts described using the Circus formal specification language. Then, through a refinement and translation process, which takes the contract as a start point, one may produce an implementation of a component that may execute on a parallel computing platform. Through this process, it becomes possible to offer guarantees to developers about the components behavior in their applications. To validate the proposed idea, the process is applied to contracts that have been described based on two benchmarks belonging to the NAS parallel Benchmarks, widely adopted in HPC for evaluate the performance of parallel programming and computing platforms.

关键词： Ciência da computação Métodos formais Programação paralela Componentes de software Formal methods parallel programing Dissertação

来源：评论

学校读者我要写书评

暂无评论

Joining Forces: A RIPPL Effect? A Constraint-Oriented Perspective on a Pervasive Pattern Language

Joining Forces: A RIPPL Effect? A Constraint-Oriented Perspe...

引用

Computation World - Future Computing, Service Computation, Cognitive, Adaptive, Content, Patterns Conference

作者： Gibbs, Celina Coady, Yvonne Univ Victoria Dept Comp Sci Victoria BC Canada

ISBN: (纸本)9781424451661

Creating a unified catalogue of patterns is challenging in any domain. Difficultly lies in representing relationships between patterns, compounded by natural growth as new patterns are discovered. Existing pattern languages successfully describe relationships in small collections of patterns, but this approach lacks a systematic process that will scale to a growing catalogue of patterns. RIPPL (Relationship Initiated Pervasive Pattern Language) structures patterns and tensions in their tradeoffs and facilitates comparison and composition in terms of domain specific constraints. A case study applying the proposed methodology to two existing pervasive pattern languages reveals the ability to represent pattern relationships in a structured, systematic form that can scale across individual pattern languages.

关键词： patterns pervasive systems parallel programing

来源：评论

学校读者我要写书评

暂无评论

Implementação da biblioteca de comunicação DECK sobre o padrão de protocolo de comunicação em nível de usuário VIA

Implementação da biblioteca de comunicação DECK sobre o ...

引用

作者： Silva, Leonardo Alves de Paula e

Techniques like zero-copy and operating system bypass can decrease communication latency and increase bandwidth. Smaller latencies and greater bandwidths contribute for better performance in parallel applications and became them more scalables as well. Communication protocols using these techiniques are known as user-level communication protocols. Based on experiences from another research groups implementing communication libraries and parallel programming libraries over VIA and experience from GPPD implementing DECK, the text presents the implementation of DECK primitives over VIA standard, which is classified as an user-level protocol. The goal of this master’s thesis is implement DECK over VIA avoiding any intermediate copy between the data source and destination, reaching zero-copy. DECK/VIA is the unique library among all libriaries over VIA here studied totally free of intermediate copies, although a synchronous behavior was forced to keep this compromise. VI-GM, an implementation of VIA for Myrinet networks was used to implement DECK/VIA library. The implementation of DECK/VIA has shown a one-way latency of 86.85 μs and a maximum bandwidth of 205 Mbytes/s, 82% of nominal bandwidth of Myrinet network. To validate the library, the FT application from NPB was executed. Their results were compared with the results obtained with DECK/GM, for Myrinet networks and DECK/TCP, for Ethernet networks. Even with one additional software layer and doing all communication using a handshake, DECK/VIA reaches speedup values very closer of DECK/GMand DECK/TCP on Gigabit Ethernet and was better than DECK/TCP on Fast Ethernet. When implementing parallel programming libraries, we concluded the ideal solution is that meets the good balance between the quest for performance and the keeping of original library’s semantics. This work contibutes with a survey of communication libraries development, their problems and their solutions, which can guide others researchers performing the

关键词： parallel programing Processamento paralelo Protocolo : Comunicação : Dados Cluster computing DECK User-level communication protocols Zero-copy Operating systembypassing Virtual interface architecture Myrinet Dissertação

来源：评论

学校读者我要写书评

暂无评论

Paralelizace v jazyce Rust

Paralelizace v jazyce Rust

引用

作者： Šlampa, Ondřej Brno University of Technology

Tato práce se zabývá paralelizací v jazyce Rust. Cílem této práce je zhodnotit výkon a použitelnost jazyka Rust pro tvorbu paralelních aplikací ve srovnání s... 详细信息

Tato práce se zabývá paralelizací v jazyce Rust. Cílem této práce je zhodnotit výkon a použitelnost jazyka Rust pro tvorbu paralelních aplikací ve srovnání s již používanou alternativou - OpenMP. Toto porovnání bylo provedeno na výpočtu n-rozměrné konvoluce. V závěru se nachází zhodnocení výsledků a návrhy pro jejich další využití.

关键词： Rust OpenMP paralelizace paralelní programování konvoluce Rust OpenMP paralelization parallel programing convolution Text

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：