Convolutional Neural Networks (CNN's) are known to perform well on computer vision tasks such as image classification, image segmentation, and object detection. However, one major drawback of CNN's is the huge...
Convolutional Neural Networks (CNN's) are known to perform well on computer vision tasks such as image classification, image segmentation, and object detection. However, one major drawback of CNN's is the huge amount of computing and memory resources needed to train them. In this paper, we propose an architectural unit which we call Upsampling-Based wavelet Residual Block (UBWRB), that utilizes the 2D discrete wavelet transform coupled with upsampling operators and a residual connection to extract features from image data while having relatively fewer trainable parameters as compared to traditional convolutional layers. The discrete wavelet transform is a family of transforms that find extensive applications in signalprocessing and time-frequency analysis. For this paper, we use the filter-bank implementation of the discrete wavelet transform, allowing it to act in a similar fashion to a convolutional layer with fixed kernel weights. We demonstrate the performance and parameter-efficiency of CNN's with UBWRB's in the task of image classification by training them on the MNIST, Fashion-MNIST, and CIFAR-10 datasets. Our best-performing models achieve a test accuracy of 99.34% on the MNIST dataset while having less than 120,000 trainable parameters, and 92.90% and 84.27% on the Fashion-MNIST and CIFAR-10 datasets respectively, with both having less than 180,000 trainable parameters.
imageprocessing has gained an increased usage and impact in modern pavement networks automatic distress severity classification (DSC). DSC defines priorities and maintenance resources optimum allocation in order to a...
详细信息
ISBN:
(纸本)9781728133775
imageprocessing has gained an increased usage and impact in modern pavement networks automatic distress severity classification (DSC). DSC defines priorities and maintenance resources optimum allocation in order to achieve a cost-effective rehabilitation process. This paper presents a novel computer vision algorithm having the ability to process, isolate and evaluate the distress severity level of a pavement. A pavement color image is converted to grayscale and then processed for image denoising of the granularity and complex texture that represent and artifact in cracks edge detection. The processing is achieved by a 2D dual-tree double density wavelet transform filter banks that significantly reduces the granularity noise while preserving the pavement cracks for edge detection. The 2D wavelet FIR filters perform analysis, soft thresholding then a synthesis of the image. The second step is then an edge detection process followed by morphological filtering and labeled components size-histogram filter to isolate false edges as residuals of denoising. A final step is performed by two Savitzky-Golay filters for the detection of longitudinal and transverse alligator cracks projections. A weighted score function with multiple parameters is used for DSC.
This paper introduces a novel framework for single-pixel imaging via compressive sensing (CS) in shift-invariant (SI) spaces by exploiting the sparsity property of a wavelet representation. We reinterpret the acquisit...
详细信息
Synthetic aperture radar (SAR) images are difficult to analyze due to the presence of speckle noise. Speckle noise must be filtered out before applying to other imageprocessingapplications. Three-layered feed forwar...
详细信息
Synthetic aperture radar (SAR) images are difficult to analyze due to the presence of speckle noise. Speckle noise must be filtered out before applying to other imageprocessingapplications. Three-layered feed forward back propagation neural network (TLFFBPNN) has been proposed to suppress the speckle noise. Gray-level co-occurrence matrix properties have been extracted, and back propagation training algorithm is used to train the neural network. The performance metrics such as peak signal to noise (PSNR), structural similarity index matrix (SSIM), edge preservation index (EPI), equivalent number of looks (ENL), and speckle suppression index (SSI) have been evaluated to find the efficiency of TLFFBPNN and compared with four recently developed de-speckling techniques. The exploratory outcomes show that the TLFFBPNN method has better de-speckling execution with great edge preservation. The comparative outcome reveals that the proposed TLFFBPNN de-speckled method outperformed in terms of PSNR of 0.98%, SSIM of 1.0%, SSI of 2.0%, EPI of 0.84%, and ENL of 0.5% when compared with the Wiener Filter Sparse Optimization in Contourlet transform domain de-speckling method.
In this paper, a new robust reversible data hiding method is proposed. The method is designed based on wavelet modifications which result in a scalable data hiding scheme. The well-known biorthogonal wavelets are modi...
详细信息
In this paper, a new robust reversible data hiding method is proposed. The method is designed based on wavelet modifications which result in a scalable data hiding scheme. The well-known biorthogonal wavelets are modified according to the watermarking bits. This is done in a way that the embedded bit can easily be interpreted based on the wavelet coefficients of the watermarked image and regardless of its resolution. Following such an algorithm would result in both reversibility and robustness. The proposed method is especially robust against wavelet resolution changing attacks and DWT based compressions. This can be of high value when dealing with low bandwidth communication situations. The practical results show high robustness against signalprocessing attacks and high PSNR and capacity in lossless scenarios.
Historical architecture is a primary element containing the identity values of a society. The wide diffusion of many ancient buildings gathering part of these values on painting walls over territories often characteri...
详细信息
ISBN:
(纸本)9783030588205;9783030588199
Historical architecture is a primary element containing the identity values of a society. The wide diffusion of many ancient buildings gathering part of these values on painting walls over territories often characterized by poor technological or economic resources brings to consider the development of low-cost protocols to inspect valued surfaces and to give the authorities in charge of preservation and restoration adequate technical information. Here we present the preliminary results of a recent application of remote sensing micro-geophysical techniques to typical architectural targets such as vaults. A modified commercial Digital Single-Lens Reflex (DSLR) camera was used to acquire multispectral datasets on portions of a painted vault. Multispectral datasets were used raw or after the application of a pre-processing step with a Multi images Stacking (MIS) algorithm. Multispectral images were then processed with spatial wavelet decomposition, histogram enhancing, thresholds application, image fusion, false colors compositing and Principal Component Analysis (PCA) techniques. Software used have been GNU image Manipulation Program (GIMP) and Mathworks MATLAB (which can be substituted for the processing steps proposed by the built-in functions of GNU OCTAVE open-source software). Processed images were able to highlight features on vault paintings revealing details of the surface or its very shallow layers which were impossible or very difficult to distinguish in raw data. In fact, they emphasized low-visible details, differences in apparently similar finishes or pigments, cracks and probably details of surface preparation.
Haar wavelet transform is an efficacious class of wavelet transform that satisfies both symmetry and orthogonality properties which are crucial in handling boundary distortion and energy preservation in image processi...
详细信息
Haar wavelet transform is an efficacious class of wavelet transform that satisfies both symmetry and orthogonality properties which are crucial in handling boundary distortion and energy preservation in imageprocessingapplications. Such applications demand power efficient design solutions that deliver high performance. Reversible logic has emerged as a solution that incorporates logical and physical reversibility to realise low power designs. This paper presents a reversible logic based design of Haar wavelet transform and lifting scheme for Haar wavelet transform, a first in literature of reversible logic. The designs are analysed to measure the efficiency of reversible logic implementations in terms of Quantum Cost (QC), Constant Inputs (CI), Garbage Outputs (GO) and Gate Count (GC). Furthermore, this paper proposes two architectures for Reversible Approximate Full Adder (RAFA) - RAFA-1 and RAFA-2;optimised explicitly for reversible logic based implementation. The proposed architectures have 25% Error Rate (ER) and optimised QC, CI, GC and GO when compared to existing exact and approximate full adder architectures implemented using reversible logic. Functional verification of the proposed architectures are performed on FPGA using 512 x 512 image. The efficiency of the imageprocessing application is projected in terms of Structural Similarity Index Measure (SSIM) and Peak signal to Noise Ratio (PSNR). Average SSIM and average PSNR are found to be 0.9679 and 31.81dB for RAFA-1 and 0.9696 and 32.15dB for RAFA-2 which are comparable with exact full adder based design.
The removal of atmospheric turbulence (AT) distortion in long range imaging is one of the most challenging areas of research in imaging processing with an immediate need for solutions in several applications such as i...
详细信息
The removal of atmospheric turbulence (AT) distortion in long range imaging is one of the most challenging areas of research in imaging processing with an immediate need for solutions in several applications such as in military and transportation systems. AT exacerbates distortion due to non-linear geometric blur and scintillations in long-distance images and videos, severely reducing image quality and information interpretation. AT negatively impacts both human and computer vision systems, compromising visibility essential for accurate object identification and tracking. In this dissertation, a novel sparse analysis framework is developed to address efficient AT blur and scintillation removal in video. Operating under the premise that distortion-free images should be sparse in a transform domain, the application of the dual-tree complex wavelet transform is utilized on frame bursts, allowing for a new near shift-invariant complex transform space that results in higher sparsity, higher object tracking accuracy, and better resilience against camera shake, geometric distortion, and imperfect frame registration encountered in real-world AT-distorted sequences. Using this new complex transform space the novel Frame-Burst Coefficient Shimmer Thresholding (FBST) algorithm is developed. FBST considers the complex coefficient shimmer across multiple frames to address threshold selection and moving object blur, issues still present in other methods which utilize techniques such as averaging and empirical threshold selection. In fact, by evaluating video sequences of moving vehicles with visible license plates, we show FBST produces up to an 85% sparse reconstruction with superior visual results compared to weighted and simple thresholding approaches while preserving object motion, reducing AT distortion, and enhancing object contrast and visibility. Moreover, compressed sensing (CS) methods to sparse AT distortion removal are also investigated through direct CS sampling of the coeffi
This paper introduces an optimal solution for wavelet-based medical image fusion using different wavelet families and Principal Component Ana1ysis (PCA) based on the Modified Central Force Optimization (MCFO) techniqu...
详细信息
This paper introduces an optimal solution for wavelet-based medical image fusion using different wavelet families and Principal Component Ana1ysis (PCA) based on the Modified Central Force Optimization (MCFO) technique. The main motivation of this work is to increase the quality of medical fused images in order to provide correct diagnosis of diseases for the objective of optimal therapy. This can be achieved by fusing medical images of different modalities using an optimization technique based on the MCFO. The MCFO technique gives the optimum gain parameters that achieve the best fused image quality. Histogram matching is applied to improve the overall values of the Peak signal-to-Noise Ratio (PSNR), entropy, local contrast, and quality of the fused image. A comparative study is performed between the proposed algorithm, the traditional Discrete wavelet Transform (DWT), and the PCA fusion using maximum fusion rule. The proposed algorithm is evaluated subjectively and objectively with different fusion quality metrics. Simulation results demonstrate that the proposed MCFO optimized wavelet-based fusion algorithm using Haar wavelet and histogram matching achieves a superior performance with the highest image quality and clearest image details in a very short processing time.
This book provides a practical guide, complete with accompanying Matlab software, to many different types of polynomial and discrete splines and spline-based wavelets, multiwavelets and wavelet frames in signal and im...
ISBN:
(纸本)9783319921228
This book provides a practical guide, complete with accompanying Matlab software, to many different types of polynomial and discrete splines and spline-based wavelets, multiwavelets and wavelet frames in signal and imageprocessingapplications. In self-contained form, it briefly outlines a broad range of polynomial and discrete splines with equidistant nodes and their signal-processing-relevant properties. In particular, interpolating, smoothing, and shift-orthogonal splines are presented.
暂无评论