A high-speed stereovision system based on smart cameras is presented to track table tennis ball. A distributedparallelprocessing architecture is developed to improve the real-time performance of the system. A set of...
详细信息
Microscopic imaging is an important tool for characterizing tissue morphology and pathology. 3D reconstruction and visualization of large sample tissue structure requires registration of large sets of high-resolution ...
详细信息
Microscopic imaging is an important tool for characterizing tissue morphology and pathology. 3D reconstruction and visualization of large sample tissue structure requires registration of large sets of high-resolution images. However, the scale of this problem presents a challenge for automatic registration methods. In this paper we present a novel method for efficient automatic registration using graphics processing units (GPUs) and parallel programming. Comparing a C++ CPU implementation with Compute Unified Device Architecture (CUDA) libraries and pthreads running on GPU we achieve a speed-up factor of up to 4.11x with a single GPU and 6.68x with a GPU pair. We present execution times for a benchmark composed of two sets of large-scale images: mouse placenta (16K x16K pixels) and breast cancer tumors (23K x62K pixels). It takes more than 12 hours for the genetic case in C++ to register a typical sample composed of 500 consecutive slides, which was reduced to less than 2 hours using two GPUs, in addition to a very promising scalability for extending those gains easily on a large number of GPUs in a distributed system.
Analysis of biomedical images requires attention to image features that represent a small fraction of the total image size. A rapid method for eliminating unnecessary detail, analogous to pre-attentive processing in b...
详细信息
ISBN:
(纸本)9781424432950
Analysis of biomedical images requires attention to image features that represent a small fraction of the total image size. A rapid method for eliminating unnecessary detail, analogous to pre-attentive processing in biological vision, allows computational resources to be applied where most needed for higher-level analysis. In this report we describe a method for bottom up merging of pixels into larger units based on flexible saliency criteria using a method similar to structured adaptive grid methods used for solving differential equations on physical domains. While creating a multiscale quadtree representation of the image, a saliency test is applied to prune the tree to eliminate unneeded details, resulting in an image with adaptive resolution. This method may be used as a first step for image segmentation and analysis and is inherently parallel, enabling implementation on programmable hardware or distributed memory clusters.
Identifying peptides, which are short polymeric chains of amino acid residues in a protein sequence, is of fundamental importance in systems biology research. The most popular approach to identify peptides is through ...
详细信息
ISBN:
(纸本)9781424449231
Identifying peptides, which are short polymeric chains of amino acid residues in a protein sequence, is of fundamental importance in systems biology research. The most popular approach to identify peptides is through database search. In this approach, an experimental spectrum ("query") generated from fragments of a target peptide using mass spectrometry is computationally compared with a database of already known protein sequences. The goal is to detect database peptides that are most likely to have generated the target peptide. The exponential growth rates and overwhelming sizes of biomolecular databases make this an ideal application to benefit from parallel computing. However, the present generation of software tools is not expected to scale to the magnitudes and complexities of data that will be generated in the next few years. This is because they are all either serial algorithms or parallel strategies that have been designed over inherently serial methods, thereby requiring high space- and time-requirements. In this paper, we present an efficient parallel approach for peptide identification through database search. Three key factors distinguish our approach from that of existing solutions: i) (space) Given p processors and a database with N residues, we provide the first space-optimal algorithm (O(N/p)) under distributed memory machine model;ii) (time) Our algorithm uses a combination of parallel techniques such as one-sided communication and masking of communication with computation to ensure that the overhead introduced due to parallelism is minimal;and iii) (quality) The run-time savings achieved using parallelprocessing has allowed us to incorporate highly accurate statistical models that have previously been demonstrated to ensure high quality prediction albeit on smaller scale data. We present the design and evaluation of two different algorithms to implement our approach. Experimental results using 2.65 million microbial proteins show linear scaling up
Query optimization is the most critical phase in query processing. In this paper, we try to describe synthetically the evolution of query optimization methods from uniprocessor relational database systems to data Grid...
详细信息
Three-dimensional (3D) geophysical imaging is now receiving considerable attention for electrical conductivity mapping of potential offshore oil and gas reservoirs. The imaging technology employs controlled source ele...
详细信息
Three-dimensional (3D) geophysical imaging is now receiving considerable attention for electrical conductivity mapping of potential offshore oil and gas reservoirs. The imaging technology employs controlled source electromagnetic (CSEM) and magnetotelluric (MT) fields and treats geological media exhibiting transverse anisotropy. Moreover when combined with established seismic methods, direct imaging of reservoir fluids is possible. Because of the size of the 3D conductivity imaging problem, strategies are required exploiting computational parallelism and optimal meshing. The algorithm thus developed has been shown to scale to tens of thousands of processors. In one imaging experiment, 32,768 tasks/processors on the IBM Watson Research Blue Gene/L supercomputer were successfully utilized. Over a 24 hour period we were able to image a large scale field data set that previously required over four months of processing time on distributed clusters based on Intel or AMD processors utilizing 1024 tasks on an InfiniBand fabric. Electrical conductivity imaging using massively parallel computational resources produces results that cannot be obtained otherwise and are consistent with timeframes required for practical exploration problems.
Applying wireless sensor networks (WSNs) for bridge monitoring has received considerable interests over the past few years. In this paper two WSN-agility problems are identified based on the deployment experiences of ...
详细信息
This paper presents an overview of low level parallelimageprocessing algorithms and their implementation for active vision systems. Authors have demonstrated novel low level imageprocessing algorithms for point ope...
详细信息
This paper presents an overview of low level parallelimageprocessing algorithms and their implementation for active vision systems. Authors have demonstrated novel low level imageprocessing algorithms for point operators, local operators, dithering, smoothing, edge detection, morphological operators, image segmentation and image compression. The algorithms have been prepared & described as pseudo codes. These algorithms have been simulated using parallel Computing Toolboxtrade (PCT) of MATLAB. The PCT provides parallel constructs in the MATLAB language, such as parallel for loops, distributed arrays and message passing & enables rapid prototyping of parallel code through an interactive parallel MATLAB session.
This paper describes a new scheme for feature extraction from facial images on FPGA. The proposed method is comprised of two stages. The first stage uses the 5/3 DWT to decompose the original face image into LL, LH, H...
详细信息
This paper describes a new scheme for feature extraction from facial images on FPGA. The proposed method is comprised of two stages. The first stage uses the 5/3 DWT to decompose the original face image into LL, LH, HL, and HH wavelet coefficient to reduce the size of the image. In the second stage, PCA is employed to extract the face features from the wavelet coefficients. Here we use the power iteration algorithm to find the eigenvector of the covariance matrix. We present an efficient hardware architecture using combination of parallelprocessing module and serial processing module. This method can take the benefits of parallel usage advantage of FPGAs and can save hardware resources. Complete hardware implemented on a Stratix ii FPGA. The experimental results show that the system works with high processing rate and only 21% of the logic resources an FPGA are consumed by face recognition logic. Thus it is very suitable for the low cost implementation of the face recognition system.
Identifying peptides, which are short polymeric chains of amino acid residues in a protein sequence, is of fundamental importance in systems biology research. The most popular approach to identify peptides is through ...
详细信息
Identifying peptides, which are short polymeric chains of amino acid residues in a protein sequence, is of fundamental importance in systems biology research. The most popular approach to identify peptides is through database search. In this approach, an experimental spectrum ("query'') generated from fragments of a target peptide using mass spectrometry is computationally compared with a database of already known protein sequences. The goal is to detect database peptides that are most likely to have generated the target peptide. The exponential growth rates and overwhelming sizes of biomolecular databases make this an ideal application to benefit from parallel computing. However, the present generation of software tools is not expected to scale to the magnitudes and complexities of data that will be generated in the next few years. This is because they are all either serial algorithms or parallel strategies that have been designed over inherently serial methods, thereby requiring high space- and time- requirements. In this paper, we present an efficient parallel approach for peptide identification through database search. Three key factors distinguish our approach from that of existing solutions: (i) (space) Given p processors and a database with N residues, we provide the first space-optimal algorithm (O(N/p)) under distributed memory machine model; (ii) (time) Our algorithm uses a combination of parallel techniques such as one-sided communication and masking of communication with computation to ensure that the overhead introduced due to parallelism is minimal; and (iii) (quality) The run-time savings achieved using parallelprocessing has allowed us to incorporate highly accurate statistical models that have previously been demonstrated to ensure high quality prediction albeit on smaller scale data. We present the design and evaluation of two different algorithms to implement our approach. Experimental results using 2.65 million microbial proteins show linear sca
暂无评论