High-performance and flexible configurable extract instructions targeted at stream cipher processing are proposed by analyzing the structures and operating characteristics of more than forty public stream cipher algor...
详细信息
ISBN:
(纸本)9781424438686
High-performance and flexible configurable extract instructions targeted at stream cipher processing are proposed by analyzing the structures and operating characteristics of more than forty public stream cipher algorithms in this paper. the extract instructions are designed to sustain four different data widths, and ten parallel extract modes are exploited by instruction level parallelism based on VLIW system structure. Further more, the corresponding reconfigurable hardware circuit is implemented. By configurating the hardware circuit, the extract of different data width and different parallel mode can be gained efficiently, so the circuit can be used as an important accelerated unit in special processing for stream cipher(1).
the concepts of Artifact-as-Organism and Creator-in-a-Box, and their autonomy, adaptation and evolution are proposed as purely engineering motivations for the incorporation of the cognitive attributes of consciousness...
详细信息
ISBN:
(纸本)9781424446421
the concepts of Artifact-as-Organism and Creator-in-a-Box, and their autonomy, adaptation and evolution are proposed as purely engineering motivations for the incorporation of the cognitive attributes of consciousness and self-awareness into robots, automata, machines and artifacts. these ideas are then used to create computational models of cognitive robots and machine consciousness that can be executed using modern parallel, distributed, many core, and massively multi-core, computer architectures.
In this paper, based on the advantages of both optical transmission and electronic computation, we first provide an O(log log N) bus cycles parallel algorithm for weighted distance transforms of an NxN binary image on...
详细信息
ISBN:
(纸本)9783642030949
In this paper, based on the advantages of both optical transmission and electronic computation, we first provide an O(log log N) bus cycles parallel algorithm for weighted distance transforms of an NxN binary image on a linear array with a reconfigurable pipelined bus System using N-2 processors. By increasing the number of processors, the proposed algorithm can be run in O(loglog(q) N) and O(l) bus cycles using qN(2) and N2+1/epsilon processors respectively, where 2 <= q <= root N, epsilon is a constant and epsilon >= 1. these results improve on previously known algorithms developed on various parallel computation models.
Markov decision process (MDP) provides the foundations for a number of problems, such as artificial intelligence Studying, automated planning and reinforcement learning. MDP can be solved efficiently in theory. Howeve...
详细信息
ISBN:
(纸本)9783642030949
Markov decision process (MDP) provides the foundations for a number of problems, such as artificial intelligence Studying, automated planning and reinforcement learning. MDP can be solved efficiently in theory. However, for large scenarios, more investigations are needed to reveal practical algorithms. algorithms for solving MDP have a natural concurrency. In this paper, we present parallelalgorithms based on dynamic programming Meanwhile, the cost of computation and communication complexity of this method is analyzed. Moreover, experimental results demonstrate excellent speedups and scalability.
the existing solutions to program parallelarchitectures range from parallelizing compilers to distributed concurrent programming Intermediate approaches propose a more structured parallelism. Algorithmic skeletons ar...
详细信息
ISBN:
(数字)9783642036446
ISBN:
(纸本)9783642036439
the existing solutions to program parallelarchitectures range from parallelizing compilers to distributed concurrent programming Intermediate approaches propose a more structured parallelism. Algorithmic skeletons are higher-order functions that capture the patterns of parallelalgorithms. the user of the library has just to compose some of the skeletons to write her parallel application. When one is designing a parallel program, the parallel performance is important. It is thus very interesting for the programmer to rely on a simple yet realistic parallel performance model such as the Bulk Synchronous parallel (BSP) model. We present OSL, the Orleans Skeleton Library of BSP algorithmic skeletons in C++. It offers data-parallel skeletons on arrays as well as communication oriented skeletons. the performance of OSL is demonstrated with two applications;heat equation and FFT.
this paper analyzes energy characteristics of parallelalgorithms executed on scalable multicore processors. Specifically, we provide a methodology for evaluating energy scalability of parallelalgorithms while satisf...
详细信息
It is shown first by Adleman that deoxyribonucleic acid (DNA) strand could be employed towards calculating solution to an instance of the NP-complete Hamiltonian Path Problem (HPP). Lipton also demonstrated that Adlem...
详细信息
ISBN:
(纸本)9783642030949
It is shown first by Adleman that deoxyribonucleic acid (DNA) strand could be employed towards calculating solution to an instance of the NP-complete Hamiltonian Path Problem (HPP). Lipton also demonstrated that Adleman's techniques Could be used to solve the satisfiability (SAT) problem. In this paper, it is demonstrated how the DNA operations presented by Adleman and Lipton can be used to develop the DNA-based algorithm for solving the 0-1 Knapsack Problem.
In this paper, RGB to gray, binaryzation and morphological close operation are used for fire flame images processing. the area, mean gray, circularity between fire flame and interference image are analyzed then. It is...
详细信息
ISBN:
(纸本)9783642030949
In this paper, RGB to gray, binaryzation and morphological close operation are used for fire flame images processing. the area, mean gray, circularity between fire flame and interference image are analyzed then. It is found in the experimental data that the differences among the autocorrelation function of the area sequence, the variance of the mean gray sequence, the mean v and autocorrelation function of the circularity sequence of the fire flame and these of interferences is sensible. On this basis, using of these four video dynamic features as fire identification parameters is proposed. It shows ill the experimental results that the image processing method and fire dynamic features described in this paper can identify fire flame correctly, reducing false alarm rate and false dismissal rate.
Motivated by the requirement of processing large data high-resolution videos, a research on peer to peer (P2P) large data parallel computing is made. the proposed P2P system is unstructured, completely self-organized ...
详细信息
ISBN:
(纸本)9780769539294
Motivated by the requirement of processing large data high-resolution videos, a research on peer to peer (P2P) large data parallel computing is made. the proposed P2P system is unstructured, completely self-organized and load balanced, which make it very efficient in computing any kinds of dividable large data tasks. the system models and algorithms are described, and simulation results show that the proposed system has better performance than an existing grid system.
this paper presents an FPGA-based parallel hardware architecture for real-time face detection. An image pyramid with twenty depth levels is generated using the input image. For these scaled-down images, a local binary...
详细信息
ISBN:
(纸本)9781424445523
this paper presents an FPGA-based parallel hardware architecture for real-time face detection. An image pyramid with twenty depth levels is generated using the input image. For these scaled-down images, a local binary pattern transform and feature evaluation are performed in parallel by using the proposed block RAM-based window processing architecture. By sharing the feature look-up tables between two corresponding scaled-down images, we can reduce the use of routing resources by half. For prototyping and evaluation purposes, the hardware architecture was integrated into a Virtex-5 FPGA. the experimental result shows around 300 frames per second speed performance for processing standard VGA (640x480x8) images. In addition, the throughput of the implementation can be adjusted in proportion to the frame rate of the camera, by synchronizing each individual module withthe pixel sampling clock.
暂无评论