检索结果-内蒙古大学图书馆

32nd European signal processing Conference (EUSIPCO)

作者： Gualdron-Hurtado, Romario Jacome, Roman Urrea, Sergio Arguello, Henry Gonzalez, Luis Univ Ind Santander Dept Comp Sci Bucaramanga Colombia Univ Ind Santander Dept Elect Engn Bucaramanga Colombia

ISBN: (纸本)9789464593617;9798331519773

Deep-learning (DL)-based image deconvolution (ID) has exhibited remarkable recovery performance, surpassing traditional linear methods. However, unlike traditional ID approaches that rely on analytical properties of the point spread function (PSF) to achieve high recovery performance-such as specific spectrum properties or small conditional numbers in the convolution matrix-DL techniques lack quantifiable metrics for evaluating PSF suitability for DL-assisted recovery. Aiming to enhance deconvolution quality, we propose a metric that employs a non-linear approach to learn the invertibility of an arbitrary PSF using a neural network by mapping it to a unit impulse. A lower discrepancy between the mapped PSF and a unit impulse indicates a higher likelihood of successful inversion by a DL network. Our findings reveal that this metric correlates with high recovery performance in DL and traditional methods, thereby serving as an effective regularizer in deconvolution tasks. This approach reduces the computational complexity over conventional condition number assessments and is a differentiable process. These useful properties allow its application in designing diffractive optical elements through end-to-end (E2E) optimization, achieving invertible PSFs, and outperforming the E2E baseline framework.

关键词： image deconvolution computational imaging diffractive optical element design

来源：评论

学校读者我要写书评

暂无评论

Learning the degradation distribution for medical image superresolution via sparse swin transformer

引用

COMPUTERS & GRAPHICS-UK 2023年 114卷 168-178页

作者： Han, Xianjun Xie, Zhaoyang Chen, Qianqian Li, Xuejun Yang, Hongyu Anhui Univ Sch Comp Sci & Technol Hefei Peoples R China Sichuan Univ Coll Comp Sci Chengdu Peoples R China

High-resolution (HR) medical images can provide rich details, which are important for discovering subtle lesions to make diagnoses. Convolutional neural networks (CNNs) are widely used in this field, but struggle to model long-range dependencies. Although transformer-based methods have improved in this respect, this method requires large quantities of data. Unfortunately, large quantities of low -resolution (LR) and HR medical image pairs may not always be available. In addition, most medical image superresolution (SR) methods are deterministic, while the degradation in real scenarios is stochastic. To address these problems, we introduce a probabilistic degradation model that combines natural and medical images for training. This design alleviates the problem of insufficient medical image pairs and learns the degradation process of the natural scene. In addition, we propose a new medical image SR model that consists of CNNs and the Swin Transformer structure to excavate both local and global semantic features. Moreover, to reduce computational stress, the spherical locality -sensitive hashing (SLSH) module is employed in the nonlocal attention (NLA) mechanism to form the ENLA module. This design enables the proposed Sparse Swin Transformer (SSFormer) model to generate HR medical images without extensive training images. Experiments on diverse datasets (natural images and medical images) demonstrate that the proposed method is robust and effective, qualitatively and quantitatively outperforming other medical image SR methods. Code is available at https://***/codehxj/SSFormer.& COPY;2023 Elsevier Ltd. All rights reserved.

关键词： Medical image superresolution Swin Transformer Medical image processing image restoration Degradation distribution

来源：评论

学校读者我要写书评

暂无评论

Robust Indoor Positioning of Automated Guided Vehicles in Internet of Things Networks With Deep Convolution neural Network Considering Adversarial Attacks

引用

IEEE Transactions on Vehicular Technology 2024年第6期73卷 7748-7757页

作者： Elsisi, Mahmoud Rusidi, Akhmad Lutfi Tran, Minh-Quang Su, Chun-Lien Ali, Mahmoud N. National Kaohsiung University of Science and Technology Department of Electrical Engineering Kaohsiung807618 Taiwan Cairo11629 Egypt National Taiwan University of Science and Technology Department of Electronic and Computer Engineering Taipei106 Taiwan Tuetech University Department of Mechanical Engineering Thai Nguyen250000 Viet Nam

The effectiveness of positioning techniques that utilize the receiver signal strength (RSS) is highly dependent on the instability of the received signal strength indicator (RSSI). Up to now, there is no strategy that effectively lowers the influence of such instability on the accuracy of positioning. Moreover, recent studies showed that indoor positioning techniques are vulnerable to noise in RSSI data and cyber-attacks, which make them more expensive. In this study, a new Internet of Things (IoT) paradigm is proposed for the indoor positioning of automated guided vehicles (AGVs) using a deep convolution neural network (CNN). The proposed method handles signal processing by converting the RSSI signal into an image. In which, the 1-D RSSI signal is converted into 2-D image data in order to generate the new features based on continuous wavelet transform (CWT), and then the proposed deep CNN is implemented for the indoor positioning system. The test results show that the proposed model can outperform other state-of-the-art positioning techniques with small position errors. Furthermore, the robustness of the proposed model is validated against various adversarial attacks. In addition, the proposed method can have a lower impact on RSSI change compared with other methods. © 1967-2012 IEEE.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

FAST AND PHYSICALLY ENRICHED DEEP NETWORK FOR JOINT LOW-LIGHT ENHANCEMENT AND image DEBLURRING 49

FAST AND PHYSICALLY ENRICHED DEEP NETWORK FOR JOINT LOW-LIGH...

引用

49th IEEE International Conference on Acoustics, Speech, and signal processing (ICASSP)

作者： Hoang, Trung McElvain, Jon Monga, Vishal Dolby Labs Burbank CA 91505 USA Penn State Univ University Pk PA USA

ISBN: (纸本)9798350344868;9798350344851

Joint low-light enhancement and deblurring is a challenging imaging inverse problem that estimates clean images from photography corrupted by both low-light and blurring artifacts. To address this task, we propose FELI, a Fast and physically Enriched deep neural network for joint Low-light enhancement and image deblurring. In a departure from recently proposed end-to-end networks, FELI employs a learnable Decomposer during training based on Retinex theory that helps with low-light scene recovery. FELI's encoded features are further enriched by an input reconstruction task cognizant of the blur model leading to effective deblurring. We introduce a new customized contrastive regularization (CCR) term that pulls the restored clean image closer to the ground truth while pushing it far away from both the input and reconstructed input. Experiments performed on challenging synthetic and real-world datasets demonstrate that FELI outperforms state-of-the-art methods at a lower computational cost.

关键词：

来源：评论

学校读者我要写书评

暂无评论

EFFICIENT CONTENT RECONSTRUCTION FOR HIGH DYNAMIC RANGE IMAGING 49

EFFICIENT CONTENT RECONSTRUCTION FOR HIGH DYNAMIC RANGE IMAG...

引用

49th IEEE International Conference on Acoustics, Speech, and signal processing (ICASSP)

作者： Zhang, Xiang Hu, Tao He, Jiashuang Yan, Qingsen Xian Univ Architecture & Technol Coll Informat & Control Engn Xian Peoples R China Northwestern Polytech Univ Sch Comp Sci Xian Peoples R China

ISBN: (纸本)9798350344868;9798350344851

High Dynamic Range (HDR) images can be reconstructed from multiple Low Dynamic Range (LDR) images using existing deep neural network (DNN) techniques. Despite notable advancements, DNN-based methods still exhibit ghosting artifacts when handling LDR images with saturation and significant motion. Recent Diffusion models (DMs) have been introduced in HDR imaging, showcasing promising performance, especially in achieving visually perceptible results. However, DMs typically require numerous inference iterations to recover the clean image from Gaussian noise, demanding substantial computational resources. Additionally, DM only learns a probability distribution of the added noise in each step but neglects image space constraints on HDR images, limiting distortion-based metrics. To tackle these challenges, we propose an efficient network that integrates DM modules into existing regression-based models, providing reliable content reconstruction for HDR while avoiding limitations in distortion-based metrics.

关键词： High dynamic range imaging multiexposed imaging diffusion models convolutional neural network

来源：评论

学校读者我要写书评

暂无评论

AMA: attention-based multi-feature aggregation module for action recognition

引用

signal image AND VIDEO processing 2023年第3期17卷 619-626页

作者： Yu, Mengyun Chen, Ying Jiangnan Univ Minist Educ Key Lab Adv Proc Control Light Ind Wuxi 214000 Jiangsu Peoples R China

Spatial information learning, temporal modeling and channel relationships capturing are important for action recognition in videos. In this work, an attention-based multi-feature aggregation (AMA) module that encodes the above features in a unified module is proposed, which contains a spatial-temporal aggregation (STA) structure and a channel excitation (CE) structure. STA mainly employs two convolutions to model spatial and temporal features, respectively. The matrix multiplication in STA has the ability of capturing long-range dependencies. The CE learns the importance of each channel, so as to bias the allocation of available resources toward the informative features. AMA module is simple yet efficient enough that can be inserted into a standard ResNet architecture without any modification. In this way, the representation of the network can be enhanced. We equip ResNet-50 with AMA module to build an effective AMA Net with limited extra computation cost, only 1.002 times that of ResNet-50. Extensive experiments indicate that AMA Net outperforms the state-of-the-art methods on UCF101 and HMDB51, which is 6.2% and 10.0% higher than the baseline. In short, AMA Net achieves the high accuracy of 3D convolutional neural networks and maintains the complexity of 2D convolutional neural networks simultaneously.

关键词： Action recognition Channel excitation Spatial-temporal aggregation Convolution neural network

来源：评论

学校读者我要写书评

暂无评论

Analysis of tennis games using TrackNet-based neural network and applying morphological operations to the match videos

引用

signal image AND VIDEO processing 2023年第4期17卷 1133-1141页

作者： Rocha, Nayara M. S. Pinto, Milena F. Biundini, Iago Z. Melo, Aurelio G. Marcato, Andre L. M. Fed Univ Juiz de Fora UFJF Juiz De Fora Brazil Fed Ctr Technol Educ Rio de Janeiro Rio De Janeiro Brazil

Computer vision plays a crucial role in current technological development, understanding a scene from the properties of 2D images. This research line becomes valuable in sports applications, where the scenario can be challenging to take technical decisions only from the observation. This work aims to develop a system based on computer vision for analyzing tennis games. The implemented method captures videos during the game through cameras installed on the court. Machine learning methods and morphological operations will be used over the images to locate the ball position, the court lines and the players location. In addition, the algorithm determines the moment the ball bounces during the game and analyzes whether it occurred in or out of the field. These data are available to players and judges through an Android application, allowing all processed data to be accessed from mobile devices, providing the results quickly and accessible to the user. From the results obtained, the system demonstrated robustness and reliability.

关键词： Tennis match analysis image processing Computer vision Line Detection Ball point inflection

来源：评论

学校读者我要写书评

暂无评论

Detection and localization of anomalous objects in video sequences using vision transformers and U-Net model

引用

signal image AND VIDEO processing 2024年第8-9期18卷 6379-6390页

作者： Berroukham, Abdelhafid Housni, Khalid Lahraichi, Mohammed Ibn Tofail Univ Fac Sci L RI Lab MISC Team Kenitra 14000 Morocco

The detection and localization of anomalous objects in video sequences remain a challenging task in video analysis. Recent years have witnessed a surge in deep learning approaches, especially with recurrent neural networks (RNNs). However, RNNs have limitations that vision transformers (ViTs) can address. We propose a novel solution that leverages ViTs, which have recently achieved remarkable success in various computer vision tasks. Our approach involves a two-step process. First, we utilize a pre-trained ViT model to generate an intermediate representation containing an attention map, highlighting areas critical for anomaly detection. In the second step, this attention map is concatenated with the original video frame, creating a richer representation that guides the U-Net model towards anomaly-prone regions. This enriched data is then fed into a U-Net model for precise localization of the anomalous objects. The model achieved a mean Intersection over Union (IoU) of 0.70, indicating a strong overlap between the predicted bounding boxes and the ground truth annotations. In the field of anomaly detection, a higher IoU score signifies better performance. Moreover, the pixel accuracy of 0.99 demonstrates a high level of precision in classifying individual pixels. Concerning localization accuracy, we conducted a comparison of our method with other approaches. The results obtained show that our method outperforms most of the previous methods and achieves a very competitive performance in terms of localization accuracy.

关键词： Vision transformers Deep learning Video processing Anomaly localization

来源：评论

学校读者我要写书评

暂无评论

ITERATIVELY PRECONDITIONED GUIDANCE OF DENOISING (DIFFUSION) MODELS FOR image RESTORATION 49

ITERATIVELY PRECONDITIONED GUIDANCE OF DENOISING (DIFFUSION)...

引用

49th IEEE International Conference on Acoustics, Speech, and signal processing (ICASSP)

作者： Tirer, Tom Bar Ilan Univ Fac Engn Ramat Gan Israel

ISBN: (纸本)9798350344868;9798350344851

Training deep neural networks has become a common approach for addressing image restoration problems. An alternative for training a "task-specific" network for each observation model is to use pretrained deep denoisers for imposing only the signal's prior within iterative algorithms, without additional training. Recently, this approach has become increasingly popular with the rise of diffusion/score-based generative models, whose core is iterative denoising. Using denoisers for general purpose restoration requires guiding the iterations to ensure agreement of the signal with the observations. In low-noise settings, guidance that is based on back-projection (BP) has been shown to be a promising strategy (used recently in the context of diffusion models also under the names "pseudoinverse" or "range/null-space" guidance). However, the presence of noise in the observations hinders the gains from this approach. In this paper, we propose a novel guidance technique, based on preconditioning that allows traversing from BP-based guidance to least squares based guidance along the restoration scheme. The proposed approach is robust to noise while still having much simpler implementation than alternative methods (e.g., no SVD is required). We demonstrate its advantages for image deblurring and superresolution.

关键词： image restoration iterative denoising plug-and-play denoisers diffusion models back-projection

来源：评论

学校读者我要写书评

暂无评论

Analysis of signals Detection methods Using image processing

Analysis of Signals Detection Methods Using Image Processing

引用

2023 Seminar on signal processing, SoSP 2023

作者： Morozova, Kristina Y. Obukhova, Nataliia A. Saint Petersburg Electrotechnical University LETI Saint Petersburg Department of Television and Video Engineering Russia

ISBN: (纸本)9798350371086

Algorithms for multisignals detection using image processing are investigated. Approaches based on digital image processing, as well as on the use of neural networks and deep learning are considered. A comparative analysis of the listed methods for their further application in the detection of FHSS-signals from the spectrogram image is given. © 2023 IEEE.

关键词： signal detection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：