检索结果-内蒙古大学图书馆

26th IEEE International Conference on image processing (ICIP)

作者： Valsesia, Diego Fracastoro, Giulia Magli, Enrico Politecn Torino Dept Elect & Telecommun Turin Italy

ISBN: (纸本)9781538662496

Recovering an image from a noisy observation is a key problem in signal processing. Recently, it has been shown that data-driven approaches employing convolutional neural networks can outperform classical model-based techniques, because they can capture more powerful and discriminative features. However, since these methods are based on convolutional operations, they are only capable of exploiting local similarities without taking into account non-local self-similarities. In this paper we propose a convolutional neural network that employs graph-convolutional layers in order to exploit both local and non-local similarities. The graph-convolutional layers dynamically construct neighborhoods in the feature space to detect latent correlations in the feature maps produced by the hidden layers. The experimental results show that the proposed architecture outperforms classical convolutional neural networks for the denoising task.

关键词： image denoising Convolutional neural Networks Graph convolution

来源：评论

学校读者我要写书评

暂无评论

Efficient transformation of ECG signals from 1-D to 2-D for atrial fibrillation detection using deep learning

引用

signal, image and Video processing 2025年第9期19卷

作者： Gao, Jiahui Li, Yongjian Chen, Meng Zhang, Xiuxin Sun, Yiheng Jiang, Xinge Wei, Shoushui School of Control Science and Engineering Shandong University Jinan China School of Information Science and Electrical Engineering Shandong Jiaotong University Jinan China

With the widespread use of wearable electrocardiographic (ECG) devices, there’s a growing need for efficient processing of large-scale real-time data to detect cardiovascular diseases. Deep learning, known for its accuracy in ECG signal analysis, has emerged as a crucial tool in computer-aided diagnosis. Leveraging two-dimensional (2-D) representations like time–frequency diagrams, Poincaré plots, and Gramian Angular Fields can enhance deep learning’s capability in capturing edge and texture features. However, the computational complexity of high-resolution images poses challenges for clinical application. methods: This study proposes a Z-shaped reconstruction method to transform 1-D time series into 2-D modalities. Additionally, this study introduces a multiscale Squeeze-and-Excitation based convolutional neural network (SE-ConvNet) that integrates multiscale convolutional kernels and attention mechanisms. This facilitates rapid localization of key channel information while simultaneously reducing parameter count and computational costs. Our method achieves a significantly lower FLOPs (185 M) compared to inputting 2-D images (2529 M). The accuracies of the proposed method on the public database and clinical dataset were 99.30 and 99.04%, respectively, with F1 scores of 99.09 and 99.03%. Moreover, we verify its generalization ability, demonstrating its potential for practical clinical use. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2025.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

GW-DC: A Deep Clustering Model Leveraging Two-Dimensional image Transformation and Enhancement

引用

ALGORITHMS 2021年第12期14卷 349页

作者： Li, Xutong Li, Taoying Wang, Yan Dalian Maritime Univ Sch Maritime Econ & Management Dalian 116026 Peoples R China

Traditional time-series clustering methods usually perform poorly on high-dimensional data. However, image clustering using deep learning methods can complete image annotation and searches in large image databases well. Therefore, this study aimed to propose a deep clustering model named GW_DC to convert one-dimensional time-series into two-dimensional images and improve cluster performance for algorithm users. The proposed GW_DC consisted of three processing stages: the image conversion stage, image enhancement stage, and image clustering stage. In the image conversion stage, the time series were converted into four kinds of two-dimensional images by different algorithms, including grayscale images, recurrence plot images, Markov transition field images, and Gramian Angular Difference Field images;this last one was considered to be the best by comparison. In the image enhancement stage, the signal components of two-dimensional images were extracted and processed by wavelet transform to denoise and enhance texture features. Meanwhile, a deep clustering network, combining convolutional neural networks with K-Means, was designed for well-learning characteristics and clustering according to the aforementioned enhanced images. Finally, six UCR datasets were adopted to assess the performance of models. The results showed that the proposed GW_DC model provided better results.

关键词： two-dimensional image Gramian Angular Difference Field wavelet transform deep embedded clustering

来源：评论

学校读者我要写书评

暂无评论

A Deep Learning-Based Pipeline for Multi-Class Motor imagery Problems with Small Portion of Labeled Datasets

A Deep Learning-Based Pipeline for Multi-Class Motor Imagery...

引用

Iranian Conference of Biomedical Engineering (ICBME)

作者： Neda Abdollahpour Mohammadreza Yazdchi Zahra Baharlouei Department of Biomedical Engineering Ragheb Isfahani Institute of Higher Education Isfahan Iran Medical Image and Signal Processing Research Center School of Advanced Technologies in Medicine Isfahan University of Medical Sciences Isfahan Iran

In this article, a new framework is proposed to address multi-class Motor imagery Brain-Computer Interface (MIBCI) problems containing a small portion of labeled datasets. In this framework, the combination of Independent Component Analysis (ICA), multi-class Common Spatial Pattern (CSP), and a functional Application Programming Interface (API) model assumes a pivotal role. In the feature extraction stage of the work, a concatenated altered signal affected by spatial weights is proposed for each trial in three frequency ranges. This distribution of features can both provide suitable feature maps for augmentation, preparing data for the deep learning analysis, and underscore distinguishable features of MI classes. In the classification stage, spatial and temporal features are dominated by using the effective combination of a one-dimensional Convolutional neural Network (CNN) and a two-staged Bidirectional Long Short-Term Memory (BLSTM) in three branches containing different distributions of frequency. Given that, the model simultaneously learns past-to-future and future-to-past patterns in two stages. The experimental result on datasets 2a BCI-Competition IV illustrates that the proposed method can be liable, practical and more competitive than the other popular methods pointed out in this paper. All in all, the proposed framework can alleviate the issue of small portions of labeled datasets in MI problems.

关键词： Deep learning Biological system modeling Pipelines Independent component analysis Feature extraction Brain modeling Brain-computer interfaces

来源：评论

学校读者我要写书评

暂无评论

VARIATIONAL AND HIERARCHICAL RECURRENT AUTOENCODER 44

VARIATIONAL AND HIERARCHICAL RECURRENT AUTOENCODER

引用

44th IEEE International Conference on Acoustics, Speech and signal processing (ICASSP)

作者： Chien, Jen-Tzung Wang, Chun-Wei Natl Chiao Tung Univ Dept Elect & Comp Engn Hsinchu Taiwan

ISBN: (纸本)9781479981311

Despite a great success in learning representation for image data, it is challenging to learn the stochastic latent features from natural language based on variational inference. The difficulty in stochastic sequential learning is due to the posterior collapse caused by an autoregressive decoder which is prone to be too strong to learn sufficient latent information during optimization. To compensate this weakness in learning procedure, a sophisticated latent structure is required to assure good convergence so that random features are sufficiently captured for sequential decoding. This study presents a new variational recurrent autoencoder (VRAE) for sequence reconstruction. There are two complementary encoders consisting of a long short-term memory (LSTM) and a pyramid bidirectional LSTM which are merged to discover the global and local dependencies in a hierarchical latent variable model, respectively. Experiments on Penn Treebank and Yelp 2013 demonstrate that the proposed hierarchical VRAE is able to learn the complementary representation as well as tackle the posterior collapse in stochastic sequential learning. The performance of recurrent autoencoder is substantially improved in terms of perplexity.

关键词： Sequence generation recurrent neural network variational autoencoder hierarchical model

来源：评论

学校读者我要写书评

暂无评论

Full RGB Just Noticeable Difference (JND) Modelling

arXiv

引用

arXiv 2022年

作者： Jin, Jian Yu, Dong Lin, Weisi Meng, Lili Wang, Hao Zhang, Huaxiang The School of Computer Science and Engineering Nanyang Technological University 639798 Singapore AlibabaNTU Singapore Joint Research Institute Nanyang Technological University 639798 Singapore The School of Information Science and Engineering Shandong Normal University Jinan250014 China The Alibaba cloud business group Alibaba Hangzhou310052 China

Just Noticeable Difference (JND) has many applica-tions in multimedia signal processing, especially for visual data processing up to date. It's generally defined as the minimum visual content changes that the human can perspective, which has been studied for decades. However, most of the existing methods only focus on the luminance component of JND modelling and simply regard chrominance components as scaled versions of luminance. In this paper, we propose a JND model to generate the JND by taking the characteristics of full RGB channels into account, termed as the RGB-JND. To this end, an RGB-JND-NET is proposed, where the visual content in full RGB channels is used to extract features for JND generation. To supervise the JND generation, an adaptive image quality assessment combination (AIC) is developed. Besides, the RDB-JND-NET also takes the visual attention into account by automatically mining the underlying relationship between visual attention and the JND, which is further used to constrain the JND spatial distribution. To the best of our knowledge, this is the first work on careful investigation of JND modelling for full-color space. Experimental results demonstrate that the RGB-JND-NET model outperforms the relevant state-of-the-art JND models. Besides, the JND of the red and blue channels are larger than that of the green one according to the experimental results of the proposed model, which demonstrates that more changes can be tolerated in the red and blue channels, in line with the well-known fact that the human visual system is more sensitive to the green channel in comparison with the red and blue ones. Copyright © 2022, The Authors. All rights reserved.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

Multimodal Graph Coarsening for Interpretable, MRI-Based Brain Graph neural Network

Multimodal Graph Coarsening for Interpretable, MRI-Based Bra...

引用

IEEE Workshop on Machine Learning for signal processing

作者： Isaac Sebenius Alexander Campbell Sarah E. Morgan Edward T. Bullmore Pietro Liò Department of Computer Science and Technology University of Cambridge United Kingdom Departmcnt of Psychiatry University of Cambridge United Kingdom Alan Turing Institute London United Kingdom

ISBN: (数字)9781728163383

ISBN: (纸本)9781665411844

Graph neural networks (GNN s) are a powerful class of model for representation learning on relational data and graph-structured signal, such as brain connectivity graphs derived from neuroimaging. To date, existing work applying graph learning methods to brain connectivity is limited to a single neuroimaging modality such as structural or functional MRI. In practice, the brain is best represented by multiple networks arising from different imaging modalities. We develop a gen-eral framework for jointly pooling multimodal graphs which share the same set of underlying nodes whilst differing in edge connectivity. Building on this approach, we propose a multimodal GNN (MM-GNN) model that incorporates mul-tiple types of neuroimaging-based brain connectivity. When applied to the task of classifying brain images from patients with schizophrenia and healthy control subjects, we observe that incorporating multimodal pooling dramatically improves performance over non-pooled networks and that MM-GNN matches or improves performance over multiple single-modal and non-GNN baselines. Finally, we demonstrate how our approach uses multimodal data to learn a unified, interpretable measure of the salience of individual brain regions of interest. In this way, MM-GNN represents a new method for leveraging diverse brain connectivity data to enhance the detection of mental health disorders and to understand their biological underpinnings.

关键词： Neuroimaging Representation learning Learning systems Magnetic resonance imaging image edge detection Mental health signal processing

来源：评论

学校读者我要写书评

暂无评论

Blur image identification with ensemble convolution neural networks

引用

signal processing 2019年 155卷 73-82页

作者： Wang, Rui Li, Wei Zhang, Liang Beihang Univ Sch Instrumentat Sci & Optoelect Engn Key Lab Precis Optomechatron Technol Minist Educ 37 Xueyuan Rd Beijing 100191 Haidian Peoples R China Univ Connecticut Dept Elect & Comp Engn 371 Fairfield WayU-4157 Storrs CT 06269 USA

Blur image classification is a key step to image recovery in image processing. In this article, an ensemble convolution neural network (CNN) is designed to identify and classify four types of blur images: defocus blur, Gaussian blur, haze blur, and motion blur. To achieve this, a two-stage pipeline, comprised of deep compression and ensemble technique, is proposed to enhance model discriminability without incurring additional computing burden. Specifically, our method first prunes the well-known networks, Alexnet and GoogleNet, by an appropriate compression ratio. The pruned networks are denoted as Simplified-Fast-Alexnet (SFA) and Simplified-Fast-GoogleNet (SFGN). Next, we employ an ensemble policy to integrate the SFA with SFGN as SFA+SFGN by assigning their respective weights based on a voting mechanism. In addition, to provide a benchmark set of blur image samples for training and testing blur classification models, we create a new public blur image dataset (available online at http://***/info/1092/***) containing 80,000+ patch-level, naturally blurred photographs, constructed using the improved super-pixel segmentation method, and 200,000+ artificially blurred images. Numerical experiments demonstrate the superior performance of the proposed approach in comparison with the original Alexnet and GoogleNet, as well as other state-of-the-art methods. (C) 2018 Elsevier B.V. All rights reserved.

关键词： Blur image classification image blur modeling SFA plus SFGN model Batch normalization Ensemble deep convolution neural network

来源：评论

学校读者我要写书评

暂无评论

Metaphase finding with deep convolutional neural networks

引用

BIOMEDICAL signal processing AND CONTROL 2019年第0期52卷 353-361页

作者： Moazzen, Yaser Capar, Abdulkerim Albayrak, Abdulkadir Calik, Nurullah Toreyin, Behcet Ugur Istanbul Tech Univ Informat Inst Ayazaga Campus Istanbul Turkey Yildiz Tech Univ Fac Elect & Elect Engn Davutpasa Campus Istanbul Turkey Dicle Univ Dept Comp Engn Diyarbakir Turkey

Background: Finding analyzable metaphase chromosome images is an essential step in karyotyping which is a common task for clinicians to diagnose cancers and genetic disorders precisely. This step is tedious and time-consuming. Hence developing automated fast and reliable methods to assist clinical technicians becomes indispensable. Previous approaches include methods with feature extraction followed by rule or quality based classifiers, component analysis, and neural networks. methods: A two-stage automated metaphase-finding scheme, consisting of an image processing based metaphase detection stage, and a deep convolutional neural network based selection stage is proposed. The first stage detects metaphase images from 10x scan of specimen slides. The selection stage, on the other hand, selects the analyzable ones among them. Results: The proposed scheme has a 99.33% true positive rate and 0.34% of the false positive rate of metaphase finding. Conclusion: This study demonstrates an effective scheme for the automated finding of analyzable metaphase images with high True positive and low False positive rates. (C) 2019 Elsevier Ltd. All rights reserved.

关键词： Metaphase detection Karyotyping Deep convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Tensor rank learning in CP decomposition via convolutional neural network

引用

signal processing-image COMMUNICATION 2019年 73卷 12-21页

作者： Zhou, Mingyi Liu, Yipeng Long, Zhen Chen, Longxi Zhu, Ce Univ Elect Sci & Technol China Ctr Informat Med Ctr Robot Sch Informat & Commun Engn Xiyuan Ave 2006 Chengdu Sichuan Peoples R China

Tensor factorization is a useful technique for capturing the high-order interactions in data analysis. One assumption of tensor decompositions is that a predefined rank should be known in advance. However, the tensor rank prediction is an NP-hard problem. The CANDECOMP/PARAFAC (CP) decomposition is a typical one. In this paper, we propose two methods based on convolutional neural network (CNN) to estimate CP tensor rank from noisy measurements. One applies CNN to the CP rank estimation directly. The other one adds a pre-decomposition for feature acquisition, which inputs rank-one components to CNN. Experimental results on synthetic and real-world datasets show the proposed methods outperforms state-of-the-art methods in terms of rank estimation accuracy.

关键词： CANDECOMP/PARAFAC decomposition Convolutional neural network Deep learning Low rank tensor approximation Tensor rank estimation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：