检索结果-内蒙古大学图书馆

MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multiscale Dilated Convolution for image Compressive Sensing (CS)

IET signal processing 2024年第1期2024卷

作者： Yu, Youhao Dansereau, Richard M. Carleton Univ Dept Syst & Comp Engn Ottawa ON Canada

Compressive sensing (CS) is a technique that enables the recovery of sparse signals using fewer measurements than traditional sampling methods. To address the computational challenges of CS reconstruction, our objective is to develop an interpretable and concise neural network model for reconstructing natural images using CS. We achieve this by mapping one step of the iterative shrinkage thresholding algorithm (ISTA) to a deep network block, representing one iteration of ISTA. To enhance learning ability and incorporate structural diversity, we integrate aggregated residual transformations (ResNeXt) and squeeze-and-excitation mechanisms into the ISTA block. This block serves as a deep equilibrium layer connected to a semi-tensor product network for convenient sampling and providing an initial reconstruction. The resulting model, called MsDC-DEQ-Net, exhibits competitive performance compared to state-of-the-art network-based methods. It significantly reduces storage requirements compared to deep unrolling methods, using only one iteration block instead of multiple iterations. Unlike deep unrolling models, MsDC-DEQ-Net can be iteratively used, gradually improving reconstruction accuracy while considering computation tradeoffs. Additionally, the model benefits from multiscale dilated convolutions, further enhancing performance.

关键词： Compressed sensing

来源：评论

学校读者我要写书评

暂无评论

Hyperspectral image denoising via self-modulating convolutional neural networks

引用

signal processing 2024年 214卷

作者： Torun, Orhan Yuksel, Seniha Esen Erdem, Erkut Imamoglu, Nevrez Erdem, Aykut Hacettepe Univ Inst Sci TR-06800 Ankara Turkiye Hacettepe Univ Dept Elect & Elect Engn TR-06800 Ankara Turkiye Hacettepe Univ Dept Comp Engn TR-06800 Ankara Turkiye Natl Inst Adv Ind Sci & Technol Digital Architecture Res Ctr Tokyo 1350064 Japan Koc Univ Dept Comp Engn TR-34450 Istanbul Turkiye Koc Univ Is Bank AI Ctr TR-34450 Istanbul Turkiye

Compared to natural images, hyperspectral images (HSIs) consist of a large number of bands, with each band capturing different spectral information from a certain wavelength, even some beyond the visible spectrum. These characteristics of HSIs make them highly effective for remote sensing applications. That said, the existing hyperspectral imaging devices introduce severe degradation in HSIs. Hence, hyperspectral image denoising has attracted lots of attention by the community lately. While recent deep HSI denoising methods have provided effective solutions, their performance under real-life complex noise remains suboptimal, as they lack adaptability to new data. To overcome these limitations, in our work, we introduce a self-modulating convolutional neural network which we refer to as SM-CNN, which utilizes correlated spectral and spatial information. At the core of the model lies a novel block, which we call spectral self-modulating residual block (SSMRB), that allows the network to transform the features in an adaptive manner based on the adjacent spectral data, enhancing the network's ability to handle complex noise. In particular, the introduction of SSMRB transforms our denoising network into a dynamic network that adapts its predicted features while denoising every input HSI with respect to its spatio-spectral characteristics. Experimental analysis on both synthetic and real data shows that the proposed SM-CNN outperforms other state-of-the-art HSI denoising methods both quantitatively and qualitatively on public benchmark datasets. Our code will be available at https://***/orhan-t/SM-CNN.

关键词： HSIs Denoising Spectral self-modulation SM-CNN

来源：评论

学校读者我要写书评

暂无评论

A Comprehensive Survey of Animal Identification: Exploring Data Sources, AI Advances, Classification Obstacles and the Role of Taxonomy

引用

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS 2024年第1期2024卷

作者： Zhang, Qianqian Ahmed, Khandakar Sharda, Nalin Wang, Hua Victoria Univ Inst Sustainable Ind & Liveable Cities ISILC Footscray Vic 3011 Australia

With the rapid development of entity recognition technology, animal recognition has gradually become essential in modern society, supporting labour-intensive agriculture and animal husbandry tasks. Severe problems such as maintaining biodiversity can also benefit from animal identification technology. However, certain invasive recognition systems have resulted in permanent harm to animals, while noninvasive identification methods also exhibit certain drawbacks. This paper conducts a systematic literature review (SLR), presenting a comprehensive overview of various animal recognition technologies and their applications. Specifically, it examines methodologies such as deep learning, image processing and acoustic analysis used for different animal characteristics and identification purposes. The contribution of machine learning to animal feature extraction is highlighted, emphasising its significance for animal taxonomy and wild species monitoring. Additionally, this review addresses the challenges and limitations of current technologies, including data scarcity, model accuracy and computational requirements, and suggests opportunities for future research to overcome these obstacles.

关键词： animal identification image processing machine learning neural network signal processing

来源：评论

学校读者我要写书评

暂无评论

Convolutional neural Network Algorithm and Application Method for Real-Time Beam Steering in RF System

引用

IEEE ACCESS 2024年 12卷 134498-134509页

作者： Byun, Sung-June Ann, Da-Yeong Jo, Jong-Wan Lee, Heejeong Jasmine Jung, Yeon-Jae Kim, Seok-Kee Pu, Young-Gun Lee, Kang-Yoon Sungkyunkwan Univ Dept Elect & Comp Engn Suwon 16419 South Korea SKAIChips Suwon 16571 South Korea Sungkyunkwan Univ Coll Informat & Commun Engn Suwon 16419 South Korea

This paper presents a novel artificial intelligence (AI)-based phase shift system in a beamforming system implemented with field programmable gate array (FPGA)-based hardware by integrating a conventional convolutional neural network (CNN) algorithm. The position of the target can be determined through a phase shifter in a beamforming system using artificial intelligence. In a system that emits a beam from a radio frequency (RF) transmitter and receives a beam from an RF receiver, artificial intelligence can control the phase. It controls the phase of the transmitter for beam scanning and the phase to optimize the signal-to-noise ratio (SNR) of the receiver. The position of the target was detected by learning the signal input data from the receiver. Targets were detected through two-beam scanning processes in a 3D space. The first is a coarse process of detecting the approximate position of the target in the entire space, and the second is a fine process of detecting the area in detail after detecting the first approximate position. The phases of the individual antennae should be controlled for optimal beamforming based on the 5x 5 antenna, and the phase is detected at high speed by holding the phase large in the first coarse tuning. The second scan entails a narrow range scan with a small phase to detect it at a high speed accurately. This study shows that with FPGA, AI beamforming can be implemented through two scanning methods without image sensors. Based on the receiver's 5x5 antenna, the CNN input feature consisted of 35x35 classifies the class with high accuracy.

关键词： Artificial intelligence Array signal processing Field programmable gate arrays Antennas Radio frequency Receiving antennas Transceivers Convolutional neural networks Convolutional neural network artificial intelligence beamforming beamforming algorithm RF system

来源：评论

学校读者我要写书评

暂无评论

Test Automation for Symbol Recognition on the Map 31

Test Automation for Symbol Recognition on the Map

引用

31st IEEE Conference on signal processing and Communications Applications (SIU)

作者： Turhan, Fatmanur Carkacioglu, Levent Toreyin, Behcet Ugur Aselsan AS Ankara Turkiye Istanbul Tech Univ Bilisim Enstitusu Istanbul Turkiye

ISBN: (纸本)9798350343557

In this study, various machine learning and image analysis approaches such as Template Matching, HOG, SVM, Faster RCNN and YOLO are examined and compared for the symbol recognition problem in color maps. Some difficulties were identified regarding the forms of the symbols, the complexity of the maps or the placement of the symbols on the map. Observations about the success or failure of the methods against the difficulties defined according to the experiments are presented. It has been observed that methods involving artificial neural networks are more successful when performing symbol recognition on color maps. The highest result was obtained with Faster RCNN as 91%.

关键词： Symbol Recognition Feature Extraction Support Vector Machines Template Matching Convolutional neural Network Object Detection Software Testing

来源：评论

学校读者我要写书评

暂无评论

NERF-GAZE: A HEAD-EYE REDIRECTION PARAMETRIC MODEL FOR GAZE ESTIMATION 49

NERF-GAZE: A HEAD-EYE REDIRECTION PARAMETRIC MODEL FOR GAZE ...

引用

49th IEEE International Conference on Acoustics, Speech, and signal processing (ICASSP)

作者： Yin, Pengwei Wang, Jingjing Dai, Jiawu Wu, Xiaojun Hikvis Res Inst Hangzhou Peoples R China Harbin Inst Technol Shenzhen Shenzhen Peoples R China

ISBN: (纸本)9798350344868;9798350344851

Gaze estimation is a fundamental aspect of many visual tasks. However, the high cost of acquiring gaze datasets with 3D annotations hinders the optimization and application of gaze estimation models. In this work, we propose a novel Head-Eye redirection parametric model based on neural Radiance Field. This model allows for dense gaze data generation with view consistency and accurate gaze direction. Furthermore, our head-eye redirection parametric model can decouple the face and eyes for separate neural rendering, which enables us to separately control the attributes of the face, identity, illumination, and eye gaze direction. As a result, diverse 3D-aware gaze datasets can be obtained by manipulating the latent code belonging to different face attributes in an unsupervised manner. Our method has achieved state-of-the-art performance in image quality and accuracy gaze annotations compared with existing gaze data synthesis methods. Extensive experiments on several benchmarks demonstrate that our method can effectively improve domain generalization and domain adaptation in the gaze estimation task.

关键词： gaze estimation neural radiance field continuous image generation

来源：评论

学校读者我要写书评

暂无评论

CORRELATION-AWARE JOINT PRUNING-QUANTIZATION USING GRAPH neural NETWORKS 31

CORRELATION-AWARE JOINT PRUNING-QUANTIZATION USING GRAPH NEU...

引用

2024 International Conference on image processing

作者： Nor-Azman, Muhammad Nor Azzafri Sheikh, Usman Ullah Mohammed, Mohammed Sultan Sirkunan, Jeevan Marsono, Muhammad Nadzir Univ Teknol Malaysia Dept Elect & Comp Engn Fac Elect Engn Johor Baharu 81310 Malaysia

ISBN: (纸本)9798350349405;9798350349399

Deep learning in image classification has achieved remarkable success but at the cost of high resource demands. Model compression through automatic joint pruning-quantization addresses this issue, yet most existing techniques overlook a critical aspect: layer correlations. These correlations are essential as they expose redundant computations across layers, and leveraging them facilitates efficient design space exploration. This study employs Graph neural Networks (GNN) to learn these inter-layer relationships, thereby optimizing the pruning-quantization strategy for the targeted model. This approach has yielded a 99.36% reduction in complexity for ResNet20 on CIFAR-10, with only a minimal 0.11% drop in accuracy. Furthermore, the integration of GNN sped up the convergence process, reducing iterations by 2.46 times on average, compared to methods without GNN.

关键词： Model Compression Pruning Quantization CNN GNN

来源：评论

学校读者我要写书评

暂无评论

NERD: neural FIELD-BASED DEMOSAICKING 30

NERD: NEURAL FIELD-BASED DEMOSAICKING

引用

30th IEEE International Conference on image processing (ICIP)

作者： Kerepecky, Tomas Sroubek, Filip Novozamsky, Adam Flusser, Jan Czech Acad Sci Inst Informat Theory & Automat Prague Czech Republic Czech Tech Univ Fac Nucl Sci & Phys Engn Prague Czech Republic

ISBN: (纸本)9781728198354

We introduce NeRD, a new demosaicking method for generating full-color images from Bayer patterns. Our approach leverages advancements in neural fields to perform demosaicking by representing an image as a coordinate-based neural network with sine activation functions. The inputs to the network are spatial coordinates and a low-resolution Bayer pattern, while the outputs are the corresponding RGB values. An encoder network, which is a blend of ResNet and U-net, enhances the implicit neural representation of the image to improve its quality and ensure spatial consistency through prior learning. Our experimental results demonstrate that NeRD outperforms traditional and state-of-the-art CNN-based methods and significantly closes the gap to transformer-based methods.

关键词： Demosaicking neural field implicit neural representation

来源：评论

学校读者我要写书评

暂无评论

Deep Residual and Classified neural Networks for Inverse Halftoning

Deep Residual and Classified Neural Networks for Inverse Hal...

引用

Asia-Pacific-signal-and-Information-processing-Association Annual Summit and Conference (APSIPA ASC)

作者： Guo, Jing-Ming Sankarasrinivasan, S. Let Viet Hung Liu, Wei Natl Taiwan Univ Sci & Technol Dept Elect Engn Taipei 10607 Taiwan Sun Yat Sen Univ Sch Data & Comp Sci Guangzhou Peoples R China

ISBN: (纸本)9798350300673

Inverse Halftoning is an ill-posed problem which restores a continuous-tone image from a halftone image. Many conventional inverse halftoning methods have tried to solve this problem, yet the recovered images still suffer several unwanted artifacts and fine details losses. In addition, recent deep neural network-based approaches have shown their advantages on restoration of the high-quality images with rich textures and detailed information. However, it is truly challenging for these deep learning methods to reconstruct a variety of different halftone patterns. For instance, the model trained with the halftone patterns of homogenous distribution cannot perform ideally for high structural information patterns. To solve this problem, an inverse halftoning based on deep residual neural network (DRNN) and variance classification is proposed. The proposed method utilizes benefits of progressive learning concept involving two main stages: First, the DRNN extracts numerous intrinsic features of an image, and significantly removes the halftone patterns. Subsequently, consecutive deep residual blocks are integrated to network restoring the fine details with good accuracy. Consequently, the proposed model comprises the integration of various DRNNs which are trained over various statistical ranges with respect to the statistics of halftone patches. Comprehensive experimental results demonstrate that the proposed deep learning-based technique significantly outperforms not only the conventional methods but also deep learning approaches.

关键词： Inverse halftoning progressive learning residual neural networks variance classification convolutional neural network

来源：评论

学校读者我要写书评

暂无评论

Hazy Removal via Graph Convolutional with Attention Network

引用

JOURNAL OF signal processing SYSTEMS FOR signal image AND VIDEO TECHNOLOGY 2023年第4期95卷 517-527页

作者： Hu, Bin Yue, Zhuangzhuang Gu, Mingcen Zhang, Yan Xu, Zhen Li, Jinhang Nantong Univ Sch Informat Sci & Technol Nantong Jiangsu Peoples R China

Most deep learning based single image dehazing methods use convolutional neural networks (CNN) to extract features, however CNN can only capture local features. To address the limitations of CNN, We propose a basic module that combines CNN and graph convolutional network (GCN) to capture both local and non-local features. The basic module consist of a CNN with triple attention modules (CAM) and a dual GCN module (DGM). CAM that combines the channel attention, spatial attention and pixel attention is designed to earn more weight from important local features. DGM combines spatial coherence computing and channel correlation computing to extract non-local information. The architecture of the network is similar to U-Net, and skip connections used in the symmetrical network can pass the image details from shallow layers to deep layers. Experimental results in several datasets indicate that the proposed method outperforms the state-of-the-arts both quantitatively and qualitatively.

关键词： Graph convolutional network Attention image dehazing Deep learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：