检索结果-内蒙古大学图书馆

49th IEEE International Conference on Acoustics, Speech, and signal processing (ICASSP)

作者： Hoang, Trung McElvain, Jon Monga, Vishal Dolby Labs Burbank CA 91505 USA Penn State Univ University Pk PA USA

ISBN: (纸本)9798350344868;9798350344851

Joint low-light enhancement and deblurring is a challenging imaging inverse problem that estimates clean images from photography corrupted by both low-light and blurring artifacts. To address this task, we propose FELI, a Fast and physically Enriched deep neural network for joint Low-light enhancement and image deblurring. In a departure from recently proposed end-to-end networks, FELI employs a learnable Decomposer during training based on Retinex theory that helps with low-light scene recovery. FELI's encoded features are further enriched by an input reconstruction task cognizant of the blur model leading to effective deblurring. We introduce a new customized contrastive regularization (CCR) term that pulls the restored clean image closer to the ground truth while pushing it far away from both the input and reconstructed input. Experiments performed on challenging synthetic and real-world datasets demonstrate that FELI outperforms state-of-the-art methods at a lower computational cost.

关键词：

来源：评论

学校读者我要写书评

暂无评论

EFFICIENT CONTENT RECONSTRUCTION FOR HIGH DYNAMIC RANGE IMAGING 49

EFFICIENT CONTENT RECONSTRUCTION FOR HIGH DYNAMIC RANGE IMAG...

引用

49th IEEE International Conference on Acoustics, Speech, and signal processing (ICASSP)

作者： Zhang, Xiang Hu, Tao He, Jiashuang Yan, Qingsen Xian Univ Architecture & Technol Coll Informat & Control Engn Xian Peoples R China Northwestern Polytech Univ Sch Comp Sci Xian Peoples R China

ISBN: (纸本)9798350344868;9798350344851

High Dynamic Range (HDR) images can be reconstructed from multiple Low Dynamic Range (LDR) images using existing deep neural network (DNN) techniques. Despite notable advancements, DNN-based methods still exhibit ghosting artifacts when handling LDR images with saturation and significant motion. Recent Diffusion models (DMs) have been introduced in HDR imaging, showcasing promising performance, especially in achieving visually perceptible results. However, DMs typically require numerous inference iterations to recover the clean image from Gaussian noise, demanding substantial computational resources. Additionally, DM only learns a probability distribution of the added noise in each step but neglects image space constraints on HDR images, limiting distortion-based metrics. To tackle these challenges, we propose an efficient network that integrates DM modules into existing regression-based models, providing reliable content reconstruction for HDR while avoiding limitations in distortion-based metrics.

关键词： High dynamic range imaging multiexposed imaging diffusion models convolutional neural network

来源：评论

学校读者我要写书评

暂无评论

Attention Multilayer Perceptron Fusion Network

引用

IEEE ACCESS 2023年 11卷 83580-83588页

作者： Wang, Yuhua Lian, Yuhao Ying, Mingjun Wang, Shuyu Chongqing Univ Posts & Telecommun Sch Comp Sci & Technol Chongqing 400065 Peoples R China

With the rise of downstream image tasks, the requirements for the quality of images obtained upstream are becoming higher and higher. In view of the many structural features of remote sensing images, we propose a novel deep neural network architecture for hyperspectral image fusion that integrates attention mechanisms and multi-layer perceptron blocks. The proposed network can capture long-range spatial dependencies between image elements, which is critical for capturing multi-scale features in remote sensing applications. The attention mechanisms selectively focus on important image features while disregarding redundant information, and the multi-layer perceptron blocks can capture multi-scale features by processing image features at different scales. The experimental results demonstrate that the proposed network outperforms other state-of-the-art methods in terms of both objective evaluation metrics and visual quality. The proposed method achieves higher Peak signal to Noise Ratio and Spatial Consistency and Contrast values compared to other methods while preserving fine details and textures in the fused images. Overall, the proposed network provides an effective and efficient solution for hyperspectral image fusion that can contribute to the development of more accurate and reliable remote sensing applications.

关键词： image fusion pansharpening multilayer perceptron attention mechanism

来源：评论

学校读者我要写书评

暂无评论

Robust Indoor Positioning of Automated Guided Vehicles in Internet of Things Networks With Deep Convolution neural Network Considering Adversarial Attacks

引用

IEEE Transactions on Vehicular Technology 2024年第6期73卷 7748-7757页

作者： Elsisi, Mahmoud Rusidi, Akhmad Lutfi Tran, Minh-Quang Su, Chun-Lien Ali, Mahmoud N. National Kaohsiung University of Science and Technology Department of Electrical Engineering Kaohsiung807618 Taiwan Cairo11629 Egypt National Taiwan University of Science and Technology Department of Electronic and Computer Engineering Taipei106 Taiwan Tuetech University Department of Mechanical Engineering Thai Nguyen250000 Viet Nam

The effectiveness of positioning techniques that utilize the receiver signal strength (RSS) is highly dependent on the instability of the received signal strength indicator (RSSI). Up to now, there is no strategy that effectively lowers the influence of such instability on the accuracy of positioning. Moreover, recent studies showed that indoor positioning techniques are vulnerable to noise in RSSI data and cyber-attacks, which make them more expensive. In this study, a new Internet of Things (IoT) paradigm is proposed for the indoor positioning of automated guided vehicles (AGVs) using a deep convolution neural network (CNN). The proposed method handles signal processing by converting the RSSI signal into an image. In which, the 1-D RSSI signal is converted into 2-D image data in order to generate the new features based on continuous wavelet transform (CWT), and then the proposed deep CNN is implemented for the indoor positioning system. The test results show that the proposed model can outperform other state-of-the-art positioning techniques with small position errors. Furthermore, the robustness of the proposed model is validated against various adversarial attacks. In addition, the proposed method can have a lower impact on RSSI change compared with other methods. © 1967-2012 IEEE.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Towards Light-weight Transformer-based Quality Assessment Metric for Augmented Reality 26

Towards Light-weight Transformer-based Quality Assessment Me...

引用

26th International Workshop on Multimedia signal processing

作者： Sekhri, Aymen Amirshahi, Seyed Ali Larabi, Mohamed-Chaker Univ Poitiers XLIM CNRS Poitiers France Norwegian Univ Sci & Technol Gjovik Norway

ISBN: (纸本)9798350387261;9798350387254

With the rise of Augmented Reality (AR) technology, which enhances the real world by overlaying computer-generated content, immersive experiences are being offered in education, entertainment, healthcare, ... Assessing the quality of AR scenarios is crucial for understanding and improving user satisfaction and engagement. However, developing objective AR quality assessment methods is challenging due to the lack of data and the inherent complexity of technology, particularly in the presence of visual confusion. Existing convolution neural network-based approaches suffer from limited receptive fields and are not effective at capturing global information in visually confused AR scenarios. Additionally, to the best of our knowledge, exploring transformer capabilities for AR quality assessment is missing. Therefore, this study introduces transformAR, a lightweight transformer-based model for objective quality assessment in AR applications. This approach leverages pre-trained vision transformer-based encoders to capture image content information, computes distance vectors to quantify distortions, and employs cross-attention-based decoders to model perceptual quality features. The model also integrates adapted regularization techniques and label smoothing to mitigate overfitting. Experimental results demonstrate the effectiveness of transformAR, outperforming the few existing state-of-the-art methods.

关键词： Augmented Reality Vision Transformer image processing image Quality Assessment

来源：评论

学校读者我要写书评

暂无评论

Learning the degradation distribution for medical image superresolution via sparse swin transformer

引用

COMPUTERS & GRAPHICS-UK 2023年第1期114卷 168-178页

作者： Han, Xianjun Xie, Zhaoyang Chen, Qianqian Li, Xuejun Yang, Hongyu Anhui Univ Sch Comp Sci & Technol Hefei Peoples R China Sichuan Univ Coll Comp Sci Chengdu Peoples R China

High-resolution (HR) medical images can provide rich details, which are important for discovering subtle lesions to make diagnoses. Convolutional neural networks (CNNs) are widely used in this field, but struggle to model long-range dependencies. Although transformer-based methods have improved in this respect, this method requires large quantities of data. Unfortunately, large quantities of low -resolution (LR) and HR medical image pairs may not always be available. In addition, most medical image superresolution (SR) methods are deterministic, while the degradation in real scenarios is stochastic. To address these problems, we introduce a probabilistic degradation model that combines natural and medical images for training. This design alleviates the problem of insufficient medical image pairs and learns the degradation process of the natural scene. In addition, we propose a new medical image SR model that consists of CNNs and the Swin Transformer structure to excavate both local and global semantic features. Moreover, to reduce computational stress, the spherical locality -sensitive hashing (SLSH) module is employed in the nonlocal attention (NLA) mechanism to form the ENLA module. This design enables the proposed Sparse Swin Transformer (SSFormer) model to generate HR medical images without extensive training images. Experiments on diverse datasets (natural images and medical images) demonstrate that the proposed method is robust and effective, qualitatively and quantitatively outperforming other medical image SR methods. Code is available at https://***/codehxj/SSFormer.& COPY;2023 Elsevier Ltd. All rights reserved.

关键词： Medical image superresolution Swin Transformer Medical image processing image restoration Degradation distribution

来源：评论

学校读者我要写书评

暂无评论

MLATANet: a neural network based on multi-scale lead attention and temporal attention for the diagnosis of arrhythmia types

引用

signal image AND VIDEO processing 2025年第7期19卷 1-13页

作者： Zhao, Yufei Dou, Mengfei Yang, Xinwu Beijing Univ Technol Sch Comp Sci Beijing 100124 Peoples R China

Automatic diagnosis of the type of arrhythmia of a patient achieved by ECG plays an important role in the prevention and treatment of cardiovascular diseases. In recent years, convolutional neural network (CNN) and recurrent neural network (RNN) have been widely used in ECG diagnosis, however, using a simple convolutional network to capture the complex local changes in the signal is difficult. RNN is not effective enough in modeling the context of long-distance signals with dense time steps, and most of the methods are mostly modeling the lead space or the time domain individually, failing to combine the two features effectively. Therefore, we propose a network (MLATANet) based on convolution-transformer architecture with multi-scale lead attention and time domain attention. In the shallow layers of the network, parallel multi-scale convolution is used to extract features at different temporal resolutions. Small convolution kernels are used to capture local subtle features, while larger convolution kernels are used to obtain local coarse contour features. After convolution, the lead attention is used to automatically assign more weights to important lead channels based on the importance of different channel information. In the deep layers of the network, Transformer's multi-head self-attention is used to model the global temporal dependencies, enriching the feature expression in both temporal and spatial dimensions. In summary, first, spatial local features were captured through shallow multi-scale convolution and lead attention, then temporal global features were captured through deep Transformer multi-head self-attention, enabling the model to not only deeply explore the subtle aspects of the signal, but also analyze the signal from the overall trend, achieving an organic combination of local and global features. Experiments were conducted on the 2018 China Physiological signal Challenge (CPSC2018) dataset, 2021 PhysioNet/Computing in Cardiology Challenge (CinC202

关键词： Classification of arrhythmias Deep learning CNN-transformer Spatio-temporal attention

来源：评论

学校读者我要写书评

暂无评论

Analysis of signals Detection methods Using image processing

Analysis of Signals Detection Methods Using Image Processing

引用

2023 Seminar on signal processing, SoSP 2023

作者： Morozova, Kristina Y. Obukhova, Nataliia A. Saint Petersburg Electrotechnical University LETI Saint Petersburg Department of Television and Video Engineering Russia

ISBN: (纸本)9798350371086

Algorithms for multisignals detection using image processing are investigated. Approaches based on digital image processing, as well as on the use of neural networks and deep learning are considered. A comparative analysis of the listed methods for their further application in the detection of FHSS-signals from the spectrogram image is given. © 2023 IEEE.

关键词： signal detection

来源：评论

学校读者我要写书评

暂无评论

neural component search for single image super-resolution^?

引用

signal processing-image COMMUNICATION 2022年 106卷 1页

作者： Mo, Lingfei Guan, Xuchen Southeast Univ Sch Instrument Sci & Engn Nanjing 210037 Peoples R China

Deep learning has become the mainstream method in the field of single image super-resolution (SISR), and the neural architecture search has been gradually applied to build SISR networks in a non-hand-crafted way. However, the existing methods can only search the structure of models and the searching speed is slow. To solve this problem, a neural component search (NCS) method is proposed. When searching for SISR networks, the color space and the composition of loss functions during training are also parts of the search space. Under a specific computational constraint, the peak signal noise ratio (PSNR) or structural similarity (SSIM) can be used as the reward to search out an optimal super-resolution network. In addition, a super graph is designed with the idea of parameter sharing to sample adaptive residual dense networks (ARDNs), thus the NCS can complete the search of SISR networks at faster speed compared to existing methods. Experimental results indicate that ARDNs searched by the NCS is competitive with the hand-crafted state-of-the-art networks, and ARDNs achieve favorable performance against state-of-the-art methods with similar computational consumption.

关键词： neural component search Super-resolution Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Passive Amplification and Noise Mitigation of Optical signals Through Talbot processing

引用

JOURNAL OF LIGHTWAVE TECHNOLOGY 2023年第3期41卷 797-814页

作者： Crockett, Benjamin Cortes, Luis Romero Azana, Jose Inst Natl Rech Sci Ctr Energie Mat Telecommun INRS EMT Varennes PQ J3X 1P7 Canada Univ Politecn Valencia Photon Res Labs Valencia 46022 Spain

Noise is one of the rare aspects of experimental work that crosses all boundaries. It is present from scientific fields like ultrafast optical signal detection to applied fields such as image processing, or even in our day-to-day lives when we are simply trying to have a conversation in a loud room. In all these cases, incoherent, stochastic noise tends to drown a signal we aim to detect, and various techniques may need to be employed to improve the clarity of the waveform, which is characterized by the signal-to-noise ratio (SNR). Yet, considering the ubiquity of noise in scientific and technology fields, it may be surprising how few methods there exists for denoising a signal. Active amplification techniques alone cannot be employed for weak, noisy signals, since the SNR is inevitably degraded due to fundamental laws of physics, while bandpass filtering schemes necessarily lead to an attenuation of the signal. In this article, we review recent advances on the concept of passive amplification techniques based on the Talbot effect to enhance the noise properties of signals through coherent energy redistribution. We demonstrate the basic framework starting from pulse repetition rate multiplication with the Talbot effect. We then extend this theory to show the principle behind passive amplification of periodic waveforms, and then how this idea can be extended to arbitrary (generally, aperiodic) signals. methods for passive amplification of both the time-domain and the frequency-domain representations of the signal of interest are reviewed. While here we focus on the application of the technique for optical signals in the standard telecommunication band (near wavelengths of 1550 nm), the proposed denoising scheme relies on widely available wave manipulations, such that it may offer exciting opportunities for any kind of physical wave support, such as acoustics, plasmonics and other regimes of the electromagnetic spectrum, like microwaves or X-rays.

关键词： Narrowband Ultrafast optics Stimulated emission Optical noise Optical fiber amplifiers Filtering Optical signal processing Linear optics noise mitigation signal processing signal restoration Talbot effect

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：