检索结果-内蒙古大学图书馆

Implementation of Sobel Edge Detection on DRRA and DiMArch architectures 26

Implementation of Sobel Edge Detection on DRRA and DiMArch A...

26th Euromicro Conference on Digital System Design (DSD) / 49th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)

作者： Pudi, Dhilleswararao Ryansh, Rajeev Goudu, Vamsi Boppu, Srinivas Hemani, Ahmed Indian Inst Technol Bhubaneswar Sch Elect Sci Bhubaneswar India KTH Royal Inst Technol Sch EECS Stockholm Sweden

ISBN: (纸本)9798350344196

Edge detection is a fundamental operation in image processing, serving as a crucial step in various applications such as object recognition, image segmentation, and scene understanding. The Sobel edge detection algorithm has emerged as a widely used method for detecting vertical and horizontal edges in digital images. However, performing edge detection on high-resolution images with large dimensions can be computationally intensive and time-consuming. Specialized hardware solutions such as Field Programmable Gate Arrays (FPGAs) and Coarse-Grained Reconfigurable Arrays (CGRAs) offer significant advantages over general-purpose processors for implementing edge detection algorithms. This paper proposes algorithms for implementing the Sobel edge detection algorithm using two CGRA fabrics: dynamically reconfigurable resource array and distributed memory architecture. Furthermore, we discuss the implementation of Sobel edge detection on the target architecture for an input matrix of arbitrary size. Finally, the proposed approaches were compared with other CGRA-based implementations in terms of latency. The experimental results show that the proposed approaches exhibit significantly lower latency compared to other CGRA-based implementations.

关键词： Sobel edge detection Coarse-grained reconfigurable array Field programmable gate array Dynamically reconfigurable resource array distributed memory architecture

来源：评论

学校读者我要写书评

暂无评论

Real-time image analysis using wavelets: The "a trous" algorithm on MIMD architectures

Real-time image analysis using wavelets: The "a trous" algor...

引用

Conference on Real-Time Imaging IV

作者： Feil, M Uhl, A Salzburg Univ Res Inst Softwaretechnol A-5020 Salzburg Austria

ISBN: (纸本)0819431168

The "a trous" algorithm(1) represents a discrete approach to the classical continous wavelet transform.(2) Similar to the fast wavelet transform(3) the input signal is analyzed by using the coefficients of a properly chosen low-pass filter, but in contradistinction to the latter there follows no concluding decimation step. Examples of practical applications can be found in the field of cosmology for studying the formation of Large Scale,Structures in the Universe.(4) In this paper we develop parallel algorithms on different MIMD architectures for the two-dimensional "a trous" decomposition. We implement the algorithm on several distributed memory architectures using the PVM (Parallel Virtual Machine) paradigm and on a SGI POWERChallenge using a parallel version of the C programming language (PowerC). Finally we investigate experimental results obtained on both of them.

关键词： wavelet transform a trous algorithm distributed memory architecture shared memory architecture

来源：评论

学校读者我要写书评

暂无评论

architecture and the software environment of parallel computer Cenju-4

引用

NEC RESEARCH & DEVELOPMENT 1998年第4期39卷 385-390页

作者： Nakata, T Kanoh, Y Tatsukawa, K Yanagida, S Nishi, N Takayama, H NEC Corp Ltd C&C Media Res Labs Tokyo Japan NEC Corp Ltd Informatec Syst Tokyo Japan NEC Corp Ltd Comp Div Tokyo Japan

This paper describes the architecture and operating system, and gives an evaluation of NEC's new parallel computer Cenju-4 Major features of Cenju-4 are: a) parallel memory architecture which encompasses distributed shared memory and user-level inter-processor communication. b) Scalable system from 8 nodes to 1,024 nodes. Using the powerful RISC processor VR10000 (200 MHz) from MIPS II Technologies, Inc., Cenju-4 system can be configured from 8 nodes to 1,024 nodes, flexibly extending the system as the demand arises. c) Utilization of a flexible micro-kernel operating system. Since the system adopts a micro-kernel based operating system (MACH), it can be configured into several software environments such as UNIX double dagger server systems and, single system image systems. The key components of the system are two 1 M gate arrays which implement memory control, inter processor communication control and network communication controls. The programming environment provided are de-facto standard libraries, high-level programming languages such as MPI (Message Passing Interface), PVM (Parallel Virtual Machine) and HPF (High Performance Fortran). The operating system and the inter-processor communication libraries fully exploit the functionality of the hardware to realize an inter-processor communication latency of 4.5 mu s and the throughput of 169 MB/s at user program level.

关键词： high performance computing (HPC) parallel processing distributed memory architecture distributed shared memory (DSM)

来源：评论

学校读者我要写书评

暂无评论

Differentiation of MPSoCs Message Classes Using Multiple NoCs

Differentiation of MPSoCs Message Classes Using Multiple NoC...

引用

IEEE International Conference on Electronics, Circuits, and Systems

作者： Douglas R. G. Silva Fernando G. Moraes FACIN - PUCRS

ISBN: (纸本)9781509002474

MPSoCs using a distributed memory architecture generates a large volume of messages that may be classified in application messages, as defined by the application developer, and management messages, used to ensure the correct operation of the platform. Both messages classes normally use the same communication infrastructure. Thus, the application traffic can be adversely impacted by the management traffic. Several works observe that different messages classes can be distributed into multiple NoCs, improving the performance and power consumption of the platform. However, these works mainly target shared memory systems. This work suggests the utilization of multiple NoCs in an MPSoC using distributed memory architecture, specializing each network for different message classes. An improvement of up to 40% in the application messages jitter and an average improvement of 5% in the application execution time can be achieved using this strategy.

关键词： MPSoC Network-on-chip (NoC) Multiple networks distributed memory Messages multiprocessor system on chip distributed memory architecture application developer distributed memory execution time Communications infrastructure Differentiation Individualized Instruction

来源：评论

学校读者我要写书评

暂无评论

A New Scalable Parallel DBSCAN Algorithm Using the Disjoint-Set Data Structure 12

A New Scalable Parallel DBSCAN Algorithm Using the Disjoint-...

引用

ACM/IEEE International Conference for High Performance Computing, Networking, Storage, and Analysis

作者： Mostofa Ali Patwary Diana Palsetia Ankit Agrawal Wei-keng Liao Fredrik Manne Alok Choudhary Northwestern University University of Bergen

ISBN: (纸本)9781467308052

DBSCAN is a well-known density based clustering algorithm capable of discovering arbitrary shaped clusters and eliminating noise data. However, parallelization of DBSCAN is challenging as it exhibits an inherent sequential data access order. Moreover, existing parallel implementations adopt a master-slave strategy which can easily cause an unbalanced workload and hence result in low parallel efficiency. We present a new parallel DBSCAN algorithm (PDSDBSCAN) using graph algorithmic concepts. More specifically, we employ the disjoint-set data structure to break the access sequentiality of DBSCAN. In addition, we use a tree-based bottom-up approach to construct the clusters. This yields a better-balanced workload distribution. We implement the algorithm both for shared and for distributed memory. Using data sets containing up to several hundred million high-dimensional points, we show that PDSDBSCAN significantly outperforms the master-slave approach, achieving speedups up to 25.97 using 40 cores on shared memory architecture, and speedups up to 5,765 using 8,192 cores on distributed memory architecture.

关键词： Density based clustering Union-Find algorithm Disjoint-set data structure distributed memory architecture shared memory systems Data structures memory architecture distributed memory Parallel Lines algorithms Master-slave Workload Scalability

来源：评论

学校读者我要写书评

暂无评论

Decentralized Generic Rigidity Evaluation in Interconnected Systems

Decentralized Generic Rigidity Evaluation in Interconnected ...

引用

IEEE/RSJ International Conference on Intelligent Robots and Systems

作者： Ryan K. Williams Andrea Gasparri Attilio Priolo Gaurav S. Sukhatme Departments of Electrical Engineering and Computer Science at the University of Southern California Los Angeles CA 90089 USA Department of Engineering Roma Tre University Via della Vasca Navale 79. Roma 00146 Italy

ISBN: (纸本)9781467363563

In this paper, we consider the problem of evaluating the generic rigidity of an interconnected system in the plane, without a priori knowledge of the network's topological properties. We propose the decentralization of the pebble game algorithm of Jacobs et. al., an O(n~2) method that determines the generic rigidity of a planar network. Our decentralization is based on asynchronous inter-agent message-passing and a distributed memory architecture, coupled with consensus-based auctions for electing leaders in the system. We provide analysis of the asynchronous messaging structure and its interaction with leader election, and Monte Carlo simulations demonstrating complexity and correctness. Finally, a novel rigidity evaluation and control scenario in the accompanying media illustrates the applicability of our proposed algorithm.

关键词： accompanying evaluation control scenario stiffness decentralization large-scale systems distributed memory architecture planar network Decentralized Asynchronous Monte Carlo technique

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：