Noise is one of the rare aspects of experimental work that crosses all boundaries. It is present from scientific fields like ultrafast optical signal detection to applied fields such as imageprocessing, or even in ou...
详细信息
Noise is one of the rare aspects of experimental work that crosses all boundaries. It is present from scientific fields like ultrafast optical signal detection to applied fields such as imageprocessing, or even in our day-to-day lives when we are simply trying to have a conversation in a loud room. In all these cases, incoherent, stochastic noise tends to drown a signal we aim to detect, and various techniques may need to be employed to improve the clarity of the waveform, which is characterized by the signal-to-noise ratio (SNR). Yet, considering the ubiquity of noise in scientific and technology fields, it may be surprising how few methods there exists for denoising a signal. Active amplification techniques alone cannot be employed for weak, noisy signals, since the SNR is inevitably degraded due to fundamental laws of physics, while bandpass filtering schemes necessarily lead to an attenuation of the signal. In this article, we review recent advances on the concept of passive amplification techniques based on the Talbot effect to enhance the noise properties of signals through coherent energy redistribution. We demonstrate the basic framework starting from pulse repetition rate multiplication with the Talbot effect. We then extend this theory to show the principle behind passive amplification of periodic waveforms, and then how this idea can be extended to arbitrary (generally, aperiodic) signals. methods for passive amplification of both the time-domain and the frequency-domain representations of the signal of interest are reviewed. While here we focus on the application of the technique for optical signals in the standard telecommunication band (near wavelengths of 1550 nm), the proposed denoising scheme relies on widely available wave manipulations, such that it may offer exciting opportunities for any kind of physical wave support, such as acoustics, plasmonics and other regimes of the electromagnetic spectrum, like microwaves or X-rays.
FPGAs have emerged as a promising platform for implementing neural networks due to their reconfigurability, parallelism, and low power consumption. Nonetheless, designing and optimizing FPGA-based neural network accel...
详细信息
FPGAs have emerged as a promising platform for implementing neural networks due to their reconfigurability, parallelism, and low power consumption. Nonetheless, designing and optimizing FPGA-based neural network accelerators is a complex and time-consuming task with register transfer level (RTL) languages. High-level synthesis (HLS) tools provide a higher level of abstraction for FPGA design, enabling designers to concentrate on top-level design aspects, such as algorithms, rather than low-level hardware implementation details. One of the state-of-the-art object detection networks is you look only once (YOLO) network series which is constructed using different neural network technologies using cross-stage connections and feature extraction techniques like pyramid networks. In this paper, we propose a method for the implementation of YOLOv7-tiny network on FPGAs using HLS tools. We present a comprehensive analysis of the performance and resource utilization of FPGA-based neural network accelerators. Our methods show excellent results for real-time application requirements such as latency. Specifically, our work reduces the usage of digital signalprocessing (DSP) units by 90% and it saves up to 60% of flip-flops compared to state-of-the-art designs, while achieving competitive usage of block RAM and look-up tables. Additionally, the achieved design latency of 15 ms is extremely suitable for real-time applications. Also we will propose a method for BRAM utilization method and off-chip memory access.
In this article, we investigate the spontaneity issue in facial expression sequence generation. Current leading methods in the field are commonly reliant on manually adjusted conditional variables to direct the model ...
详细信息
In this article, we investigate the spontaneity issue in facial expression sequence generation. Current leading methods in the field are commonly reliant on manually adjusted conditional variables to direct the model to generate a specific class of expression. We propose a neural network-based method which uses Gaussian noise to model spontaneity in the generation process, removing the need for manual control of conditional generation variables. Our model takes two sequential images as input, with additive noise, and produces the next image in the sequence. We trained two types of models: single-expression, and mixed-expression. With single-expression, unique facial movements of certain emotion class can be generated;with mixed expressions, fully spontaneous expression sequence generation can be achieved. We compared our method to current leading generation methods on a variety of publicly available datasets. Initial qualitative results show our method produces visually more realistic expressions and facial action unit (AU) trajectories;initial quantitative results using image quality metrics (SSIM and NIQE) show the quality of our generated images is higher. Our approach and results are novel in the field of facial expression generation, with potential wider applications to other sequence generation tasks.
image segmentation is a critical step in digital imageprocessing applications. One of the most preferred methods for image segmentation is multilevel thresholding, in which a set of threshold values is determined to ...
详细信息
image segmentation is a critical step in digital imageprocessing applications. One of the most preferred methods for image segmentation is multilevel thresholding, in which a set of threshold values is determined to divide an image into different classes. However, the computational complexity increases when the required thresholds are high. Therefore, this paper introduces a modified Coronavirus Optimization algorithm for image segmentation. In the proposed algorithm, the chaotic map concept is added to the initialization step of the naive algorithm to increase the diversity of solutions. A hybrid of the two commonly used methods, Otsu's and Kapur's entropy, is applied to form a new fitness function to determine the optimum threshold values. The proposed algorithm is evaluated using two different datasets, including six benchmarks and six satellite images. Various evaluation metrics are used to measure the quality of the segmented images using the proposed algorithm, such as mean square error, peak signal-to-noise ratio, Structural Similarity Index, Feature Similarity Index, and Normalized Correlation Coefficient. Additionally, the best fitness values are calculated to demonstrate the proposed method's ability to find the optimum solution. The obtained results are compared to eleven powerful and recent metaheuristics and prove the superiority of the proposed algorithm in the image segmentation problem.
End-to-end image compression has achieved satisfactory results in recent studies. However, existing methods suffer from high complexity of complicated neural network computation and cannot be directly deployed on mobi...
详细信息
ISBN:
(纸本)9798350387261;9798350387254
End-to-end image compression has achieved satisfactory results in recent studies. However, existing methods suffer from high complexity of complicated neural network computation and cannot be directly deployed on mobile devices due to the limitations of computing ability and storage. Therefore, considering the resource and computing ability constrains of the mobile devices, we make a trade-off in this paper between rate-distortion (R-D) performance, inference time, and model complexity. Then we design a novel lightweight perceptual image compression framework to alleviate the storage and complexity burden of mobile devices. Moreover, we design a hardware-friendly deployment scheme to apply the proposed compression framework on high-end mobile devices, which can achieve efficient image compression. Based on the above structures, we propose the first mobile system that achieves image compression on mobile devices. The supplementary material of our system demo is on https://***/documents/extreme-lowbitrate-image-compression-system-mobile-deployment.
Multi -center cervical cytology images have various image styles due to the differences in staining and imaging techniques, which pose a significant challenge to the performance of automated cervical cancer diagnosis ...
详细信息
Multi -center cervical cytology images have various image styles due to the differences in staining and imaging techniques, which pose a significant challenge to the performance of automated cervical cancer diagnosis tools. We propose a dual -head network architecture that explicitly disentangles image features into content and style features, and applies contrastive self -supervised learning to a large number of unlabeled images, achieving enhanced generalization across various styles. We pretrain our model on 1,024,855 images cropped from 3,561 whole slide images (WSIs), and visualize the features using t -distributed stochastic neighbor embedding (t-SNE) method, demonstrating the effectiveness of our method in distinguishing between content and style features. In the downstream task, we evaluate our model on 192,123 binary -classified images with 10 styles, and achieve the best accuracy among all methods for every style. Across the 10 different data sources, our method attained an average accuracy of 80.4%, outperforming all other comparative methods by 3% to 17%, demonstrating our method's potential to enhance the performance and robustness of automated cytology image analysis in multi -center settings.
Reconstructing magnetic resonance (MR) images from undersampled k-space data have always been a challenging problem. Compressed Sensing (CS) can reconstruct images from a small amount of sampled data when combined wit...
详细信息
Reconstructing magnetic resonance (MR) images from undersampled k-space data have always been a challenging problem. Compressed Sensing (CS) can reconstruct images from a small amount of sampled data when combined with the robust feature learning ability of deep learning, and it can further reduce the sampling time. Most previous deep learning methods using compressed sensing relied heavily on convolutional neural networks (CNNs) or swin transformer block(STB), even they reconstructed images through stacking or cross-domain structure. However, due to the limited size of their receptive fields, convolutional neural networks cannot explore the global features of images. Conversely, vast receptive fields would increase model complexity and make the entire network difficult to train. In this paper, we proposed a cascade dual-domain swin-conv unet for reconstruction(CDSCU-Net), which combines STB and CNNs to focus on both local and global features during reconstruction. By fusing these features through incorporating our designed residual modules in the skip connections, we can mine more refined feature representations. Compared with the best-performing deep learning reconstruction methods based on compressed sensing in recent years, CDSCU-Net can better preserve the structural details of images and achieve good reconstruction quality at lower acceleration factors, additionally, the reconstructed images can also serve as raw data for other tasks.
This paper deals with the state-space modelling of nonlinear stochastic dynamic systems. The emphasis is laid on the emerging area of data-augmented physics-based modelling of the state dynamics, which combines the be...
详细信息
ISBN:
(纸本)9798350373769;9798350373752
This paper deals with the state-space modelling of nonlinear stochastic dynamic systems. The emphasis is laid on the emerging area of data-augmented physics-based modelling of the state dynamics, which combines the benefits of the physics-driven and data-based identified models. As the augmented state-space models depend on the measured data, modelling the state noise properties becomes challenging. This paper proposes and validates a concept for the state noise identification of nonlinear data-augmented state equation using the maximum likelihood and correlation-based methods. The numerical simulation of a tracking scenario shows significant improvement of the state estimation accuracy and consistency when using the identified noise model.
RGB-D data including paired RGB color images and depth maps is widely used in downstream computer vision tasks. However, compared with the acquisition of high -resolution color images, the depth maps captured by consu...
详细信息
RGB-D data including paired RGB color images and depth maps is widely used in downstream computer vision tasks. However, compared with the acquisition of high -resolution color images, the depth maps captured by consumer-level sensors are always in low resolution. Within decades of research, the most state -of -the -art (SOTA) methods of depth map super -resolution cannot adaptively tune the guidance fusion for all feature positions by channel-wise feature concatenation with spatially sharing convolutional kernels. This paper proposes JTFNet to resolve this issue, which simulates the traditional Joint Trilateral Filter (JTF). Specifically, a novel JTF block is introduced to adaptively tune the fusion pattern between the color features and the depth features for all feature positions. Moreover, based on the variant of JTF block whose target features and guidance features are in the cross-scale shape, the fusion for depth features is performed in a bi-directional way. Therefore, the error accumulation along scales can be effectively mitigated by iteratively HR feature guidance. Compared with the SOTA methods, the sufficient experiment is conducted on the mainstream synthetic datasets and real datasets, i.e., Middlebury, NYU and ToF-Mark, which shows remarkable improvement of our JTFNet.
Underwater images suffer from color casts and low contrast degraded due to wavelength-dependent light scatter and abortion of the underwater environment. To effectively improve the quality of the underwater images, de...
详细信息
Underwater images suffer from color casts and low contrast degraded due to wavelength-dependent light scatter and abortion of the underwater environment. To effectively improve the quality of the underwater images, deep learning-based underwater image enhancement methods have been widely proposed. However, most deep learning-based underwater image enhancement methods rely heavily on paired datasets. Actually, obtaining distortion-free images as reference images is difficult in underwater imaging. To address this problem, a fully Unsupervised convolution neural network-based Underwater image Enhancement (UUIE) is proposed by pseudo-Retinex decomposition. The innovation of the proposed UUIE is to establish a relationship between the underwater imaging model and the Retinex model, then use terrestrial images to replace underwater images for training and estimate pseudo-illumination and pseudo-reflection maps through self-supervision using the pseudo-Retinex decomposition. The pseudo -reflection image and pseudo-illumination image are reconstructed by the pseudo-Retinex decomposition to obtain the enhanced image. Additionally, the proposed UUIE can also be extended to image dehazing and low-light enhancement with only one trained model. Experimental results on synthetic and real -world datasets demonstrate the effectiveness of the proposed UUIE quantitatively and qualitatively.(c) 2023 Elsevier Inc. All rights reserved.
暂无评论