algorithms with operations on large regular data structures such as imageprocessing can be highly accelerated when executed as hardware tasks in an FPGA fabric. The Dynamic Partial Reconfiguration (DPR) feature of ne...
详细信息
ISBN:
(纸本)9783319776101;9783319776095
algorithms with operations on large regular data structures such as imageprocessing can be highly accelerated when executed as hardware tasks in an FPGA fabric. The Dynamic Partial Reconfiguration (DPR) feature of new SRAM-based FPGA families allows a dynamic swapping and replacement of hardware tasks during runtime. Particularly embedded systems with processing chains that change over time or that are too large to be implemented in an FPGA fabric in parallel, benefit from DPR. In this paper we present a complete framework for hardware acceleration using DPR in the microkernel based Genode OS. This makes the DPR feature available not only for the high-performance computing field, but also for safety-critical applications. The new framework is evaluated for an exemplary imaging application running on a Xilinx Zynq-7000 SoC.
The current trend in automotive systems is to integrate more software applications into fewer ECU's to decrease the cost and increase efficiency. This means more applications share the same resources which in turn...
详细信息
The current trend in automotive systems is to integrate more software applications into fewer ECU's to decrease the cost and increase efficiency. This means more applications share the same resources which in turn can cause congestion on resources such as such as caches. Shared resource congestion may cause problems for time critical applications due to unpredictable interference among applications. It is possible to reduce the effects of shared resource congestion using cache partitioning techniques, which assign dedicated cache lines to different applications. We propose a cache partition controller called LLC-PC that uses the Palloc page coloring framework to decrease the cache partition sizes for applications during runtime. LLC-PC creates cache partitioning directives for the Palloc tool by evaluating the performance gained from increasing the cache partition size. We have evaluated LLC-PC using 3 different applications, including the SIFT imageprocessing algorithm which is commonly used for feature detection in vision systems. We show that LLC-PC is able to decrease the amount of cache size allocated to applications while maintaining their performance allowing more cache space to be allocated for other applications.
This paper presents a novel technique of factorization for 2-D non-separable quaternionic paraunitary filter banks (2-D NSQ-PUFB). Two-dimensional factorization structures called "16in-16out" and "64in-...
详细信息
ISBN:
(纸本)9788362065332
This paper presents a novel technique of factorization for 2-D non-separable quaternionic paraunitary filter banks (2-D NSQ-PUFB). Two-dimensional factorization structures called "16in-16out" and "64in-64out" respectively for 4-channel and 8-channel Q-PUFB based on the proposed technique are shown. The given structures can be mapped to parallel-pipeline processor architecture with a minimum latency time 2(N + 1) quaternion multiplication operations, where N is transform order of the Q-PUFB. The latency of parallel-pipeline processing does not depend on the size of the original image in contrast to the conventional 2-D transform. The coding gains CG(MD) of 2-D non-separable Q-PUFBs for the isotropic auto-correlation function model with the correlation factor rho = 0.95 are the following: CG(MD) = 13.4 dB for "16in-16out" structure and CG(MD) = 15.6 dB for "64in-64out" structure.
We investigated the peculiarities of the terahertz pulse time-domain holography principle in the case of raster scanning with the balance detection system. The noise in this system represents a Skellam distribution mo...
详细信息
We investigated the peculiarities of the terahertz pulse time-domain holography principle in the case of raster scanning with the balance detection system. The noise in this system represents a Skellam distribution model, which differentiates it from systems based on a photoconductive antenna. We analyzed this Skellam model and provided both numerical and experimental investigations. We found that the variance of the noise in the balance detection system does not depend on the true signal. Complex-domain images obtained in this model are filtered by block-matching algorithms adapted for spatio-temporal and spatiospectral volumetric data. We presented a new cube complex-domain filter algorithm that uses block matching in all 3D data sets simultaneously in spatial and frequency coordinates. A combination of temporal and complex-domain filters allows us to expand the dynamic range of terahertz frequencies for which we can obtain amplitude/phase information. Experimental data demonstrate an improvement in the quality of the resultant images both in the time domain and complex-spectral domain. The simulation and experimental results are in good agreement. (C) 2019 Optical Society of America
To improve the performance of remote sensing images multiclass classification we propose two greedy algorithms of feature selection. The discriminant analysis criterion and regression coefficients are used as the meas...
详细信息
Interferometric Synthetic Aperture Radar (InSAR) is a remote sensing technology used for estimating the displacement of an object on the ground or the earth's surface itself. Persistent Scatterer-InSAR (PS-InSAR) ...
详细信息
Interferometric Synthetic Aperture Radar (InSAR) is a remote sensing technology used for estimating the displacement of an object on the ground or the earth's surface itself. Persistent Scatterer-InSAR (PS-InSAR) is a category of time series algorithms enabling high resolution monitoring. PS-InSAR relies on successful selection of points that appear stable across a set of satellite images taken over time. This paper presents PtSel, a new algorithm for selecting these points, a problem known as Persistent Scatterer Selection. The key advantage of PtSel over the key existing techniques is that it does not require model assumptions, yet preserves solution accuracy. Motivated by the abundance of parallelism the algorithm exposes, we have implemented it for GPUs. Our evaluation using real-world data shows that the GPU implementation not only offers superior performance but also scales linearly with GPU count and workload size. We compare the GPU implementation and a parallel CPU implementation: a consumer grade GPU offers 18x speedup over a 16-core Ivy Bridge Xeon System, while four GPUs offer 65x speedup. The GPU solution consumes 28x less energy than the CPU-only solution. Additionally, we present a comparison with the most widely used PS-interferometry software package StaMPS, in terms of point selection coverage and precision.
Automatic mouth detection can assist in controlling a robotic system with self-feeding of individuals with disability. To address this need we developed and evaluated algorithms that: 1) detect and track the mouth of ...
详细信息
ISBN:
(纸本)9781538659168
Automatic mouth detection can assist in controlling a robotic system with self-feeding of individuals with disability. To address this need we developed and evaluated algorithms that: 1) detect and track the mouth of an individual in real-time, and 2) classify if the mouth is open or closed. A k-nearest neighbors (KNN) clustering algorithm was used to classify and recognize the mouth’s posture. The KNN algorithm classified image frames using features extracted by four methods including a histogram of oriented gradients, Harris-Stephens algorithm, maximally stable extremal regions, and local binary patterns. The results of this study indicated a high classification accuracy (~87%) using 10-fold cross validation for three participants without disability. The study has shown that the algorithms can detect the mouth postures of a person in near real-time (<1s) while they have a robot-assisted meal in a social setting.
This paper focuses on solving a relevant and pressing safety issue on intercity roads. Two approaches were considered for solving the problem of traffic signs recognition;the approaches involved neural networks to ana...
详细信息
This paper focuses on solving a relevant and pressing safety issue on intercity roads. Two approaches were considered for solving the problem of traffic signs recognition;the approaches involved neural networks to analyze images obtained from a camera in the real-time mode. The first approach is based on a sequential imageprocessing. At the initial stage, with the help of color filters and morphological operations (dilatation and erosion), the area containing the traffic sign is located on the image, then the selected and scaled fragment of the image is analyzed using a feedforward neural network to determine the meaning of the found traffic sign. Learning of the neural network in this approach is carried out using a backpropagation method. The second approach involves convolution neural networks at both stages, i.e. when searching and selecting the area of the image containing the traffic sign, and when determining its meaning. Learning of the neural network in the second approach is carried out using the intersection over union function and a loss function. For neural networks to learn and the proposed algorithms to be tested, a series of videos from a dash cam were used that were shot under various weather and illumination conditions. As a result, the proposed approaches for traffic signs recognition were analyzed and compared by key indicators such as recognition rate percentage and the complexity of neural networks' learning process.
This paper outlines image domain material decomposition algorithms that have been routinely used in MARS spectral CT systems. These algorithms (known collectively as MARSMD) are based on a pragmatic heuristic for solv...
详细信息
This paper outlines image domain material decomposition algorithms that have been routinely used in MARS spectral CT systems. These algorithms (known collectively as MARSMD) are based on a pragmatic heuristic for solving the under-determined problem where there are more materials than energy bins. This heuristic contains three parts: (1) splitting the problem into a number of possible sub-problems, each containing fewer materials;(2) solving each sub-problem;and (3) applying rejection criteria to eliminate all but one sub-problem's solution. An advantage of this process is that different constraints can be applied to each sub-problem if necessary. In addition, the result of this process is that solutions will be sparse in the material domain, which reduces crossover of signal between material images. Two algorithms based on this process are presented: the Segmentation variant, which uses segmented material classes to define each subproblem;and the Angular Rejection variant, which defines the rejection criteria using the angle between reconstructed attenuation vectors.
Lane detection algorithms have been the key enablers for a fully-assistive and autonomous navigation systems. In this paper, a novel and pragmatic approach for lane detection is proposed using a convolutional neural n...
详细信息
Lane detection algorithms have been the key enablers for a fully-assistive and autonomous navigation systems. In this paper, a novel and pragmatic approach for lane detection is proposed using a convolutional neural network (CNN) model based on SegNet encoder-decoder architecture. The encoder block renders low-resolution feature maps of the input and the decoder block provides pixel-wise classification from the feature maps. The proposed model has been trained over 2000 image data-set and tested against their corresponding ground-truth provided in the data-set for evaluation. To enable real-time navigation, we extend our model's predictions interfacing it with the existing Google APIs evaluating the metrics of the model tuning the hyper-parameters. The novelty of this approach lies in the integration of existing segnet architecture with google APIs. This interface makes it handy for assistive robotic systems. The observed results show that the proposed method is robust under challenging occlusion conditions due to pre-processing involved and gives superior performance when compared to the existing methods.
暂无评论