检索结果-内蒙古大学图书馆

End-to-end trained encoder-decoder convolutional neural network for fetal electrocardiogram signal denoising

PHYSIOLOGICAL MEASUREMENT 2020年第1期41卷 015005-015005页

作者： Fotiadou, Eleni Konopczynski, Tomasz Hesser, Juergen Vullings, Rik Eindhoven Univ Technol Dept Elect Engn NL-5612 AP Eindhoven Netherlands Heidelberg Univ Med Fac Mannheim Cent Inst Sci Comp IWR Dept Radiat Oncol Heidelberg Germany Heidelberg Univ Cent Inst Comp Engn ZITI Med Fac Mannheim Heidelberg Germany

Objective: Non-invasive fetal electrocardiography has the potential to provide vital information for evaluating the health status of the fetus. However, the low signal-to-noise ratio of the fetal electrocardiogram (ECG) impedes the applicability of the method in clinical practice. Quality improvement of the fetal ECG is of great importance for providing accurate information to enable support in medical decision-making. In this paper we propose the use of artificial intelligence for the task of one-channel fetal ECG enhancement as a post-processing step after maternal ECG suppression. Approach: We propose a deep fully convolutional encoder-decoder framework, learning end-to-end mappings from noise-contaminated fetal ECGs to clean ones. Symmetric skip-layer connections are used between corresponding convolutional and transposed convolutional layers to help recover the signal details. Main results: Experiments on synthetic data show an average improvement of 7.5 dB in the signal-to-noise ratio (SNR) for input SNRs in the range of -15 to 15 dB. Application of the method with real signals and subsequent ECG interval analysis demonstrates a root mean square error of 9.9 and 14 ms for the PR and QT intervals, respectively, when compared with simultaneous scalp measurements. The proposed network can achieve substantial noise removal on both synthetic and real data. In cases of highly noise-contaminated signals some morphological features might be unreliably reconstructed. Significance: The presented method has the advantage of preserving individual variations in pulse shape and beat-to-beat intervals. Moreover, no prior knowledge on the power spectra of the noise or the pulse locations is required.

关键词： convolutional neural networks encoder-decoder network fetal ECG denoising fetal ECG enhancement fetal electrocardiography

来源：评论

学校读者我要写书评

暂无评论

SSDBN: A Single-Side Dual-Branch network with encoder-decoder for Building Extraction

引用

REMOTE SENSING 2022年第3期14卷 768-768页

作者： Li, Yang Lu, Hui Liu, Qi Zhang, Yonghong Liu, Xiaodong Nanjing Univ Informat Sci & Technol Minist Educ Sch Comp & Software Engn Res Ctr Digital Forens Nanjing 210044 Peoples R China Nanjing Univ Informat Sci & Technol Sch Automat Nanjing 210044 Peoples R China Edinburgh Napier Univ Sch Comp Edinburgh EH10 5DT Midlothian Scotland

In the field of building detection research, an accurate, state-of-the-art semantic segmentation model must be constructed to classify each pixel of the image, which has an important reference value for the statistical work of a building area. Recent research efforts have been devoted to semantic segmentation using deep learning approaches, which can be further divided into two aspects. In this paper, we propose a single-side dual-branch network (SSDBN) based on an encoder-decoder structure, where an improved Res2Net model is used at the encoder stage to extract the basic feature information of prepared images while a dual-branch module is deployed at the decoder stage. An intermediate framework was designed using a new feature information fusion methods to capture more semantic information in a small area. The dual-branch decoding module contains a deconvolution branch and a feature enhancement branch, which are responsible for capturing multi-scale information and enhancing high-level semantic details, respectively. All experiments were conducted using the Massachusetts Buildings Dataset and WHU Satellite Dataset I (global cities). The proposed model showed better performance than other recent approaches, achieving an F1-score of 87.69% and an IoU of 75.83% with a low network size volume (5.11 M), internal parameters (19.8 MB), and GFLOPs (22.54), on the Massachusetts Buildings Dataset.

关键词： building extraction dual-branch semantic segmentation encoder-decoder network

来源：评论

学校读者我要写书评

暂无评论

MSANet: Mixed Spectral and Attention network for Robust 3D Human Pose Estimation

MSANet: Mixed Spectral and Attention Network for Robust 3D H...

引用

2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025

作者： Wang, Bing Wu, Suping Zhang, Xitie Shi, Liyuan Yang, Sheng Duan, Zhijian Xiong, Tuo Ningxia University School of Information Engineering Ningxia China

ISBN: (纸本)9798350368741

Despite significant advances in 3D human pose estimation from a single-view video, existing methods often struggle to produce reasonable human poses when the human is heavily occluded or blurred. To address this issue, we propose a Mixed Spectral and Attention network (MSANet) that stacks spectral and attention blocks alternately. The attention block captures visual cues before and after occlusions or blurs, while the spectral block perceives subtle localized occlusions or blurs for robust 3D human pose estimation. Specifically, our attention block captures the global information of intra-frame joints and enhances the coherent representation of inter-frame joints, the spectral block complements the intra- and inter-frame local occlusions information which is difficult to capture by the attention block. In addition, we improve the regression head (IRH) to narrow the grained gap between joint-level feature extraction and frame-level pose regression for smooth regression. With better temporal consistency and subtle localized occlusion awareness, our MSANet outperforms previous state-of-the-art methods on the commonly used benchmarks Human3.6M and MPI-INF-3DHP. Moreover, MSANet demonstrates broad real world applicability, realizing occlusions and blurs robust and accurate 3D pose estimation. The Code will be made public. © 2025 IEEE.

关键词： 3D Human Pose Estimation Attention Blurs encoder-decoder network Occlusions Spectral

来源：评论

学校读者我要写书评

暂无评论

Are all shortcuts in encoder-decoder networks beneficial for CT denoising?

引用

COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION 2023年第1期11卷 59-66页

作者： Chen, Junhua Zhang, Chong Wee, Leonard Dekker, Andre Bermejo, Inigo Maastricht Univ Grow Sch Oncol & Dev Biol Dept Radiat Oncol MAASTRO Med Ctr NL-6229 ET Maastricht Netherlands

Denoising of CT scans has attracted the attention of many researchers in the medical image analysis domain. encoder-decoder networks are deep learning neural networks that have become common for image denoising in recent years. Shortcuts between the encoder and decoder layers are crucial for some image-to-image translation tasks. However, are all shortcuts necessary for CT denoising? To answer this question, we set up two encoder-decoder networks representing two popular architectures and then progressively removed shortcuts from the networks from shallow to deep (forward removal) and from deep to shallow (backward removal). We used two unrelated datasets with different noise levels to test the denoising performance of these networks using two metrics, namely root mean square error and content loss. The results show that while more than half of the shortcuts are still indispensable for CT scan denoising, removing certain shortcuts leads to performance improvement for denoising. Both shallow and deep shortcuts might be removed, thus retaining sparse connections, especially when the noise level is high. Backward removal seems to have a better performance than forward removal, which means deep shortcuts have priority to be removed. Finally, we propose a hypothesis to explain this phenomenon and validate it in the experiments.

关键词： Deep learning encoder-decoder network medical image denoising shortcuts comparative analysis

来源：评论

学校读者我要写书评

暂无评论

Multimodal 3D medical image registration guided by shape encoder-decoder networks

引用

INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY 2020年第2期15卷 269-276页

作者： Blendowski, Max Bouteldja, Nassim Heinrich, Mattias P. Univ Lubeck Inst Med Informat Ratzeburger Allee 160 D-23562 Lubeck Germany Rhein Westfal TH Aachen Inst Imaging & Comp Vis Templergraben 55 D-52056 Aachen Germany

Purpose Nonlinear multimodal image registration, for example, the fusion of computed tomography (CT) and magnetic resonance imaging (MRI), fundamentally depends on a definition of image similarity. Previous methods that derived modality-invariant representations focused on either global statistical grayscale relations or local structural similarity, both of which are prone to local optima. In contrast to most learning-based methods that rely on strong supervision of aligned multimodal image pairs, we aim to overcome this limitation for further practical use cases. Methods We propose a new concept that exploits anatomical shape information and requires only segmentation labels for both modalities individually. First, a shape-constrained encoder-decoder segmentation network without skip connections is jointly trained on labeled CT and MRI inputs. Second, an iterative energy-based minimization scheme is introduced that relies on the capability of the network to generate intermediate nonlinear shape representations. This further eases the multimodal alignment in the case of large deformations. Results Our novel approach robustly and accurately aligns 3D scans from the multimodal whole-heart segmentation dataset, outperforming classical unsupervised frameworks. Since both parts of our method rely on (stochastic) gradient optimization, it can be easily integrated in deep learning frameworks and executed on GPUs. Conclusions We present an integrated approach for weakly supervised multimodal image registration. Achieving promising results due to the exploration of intermediate shape features as registration guidance encourages further research in this direction.

关键词： Multimodal fusion Guided image registration encoder-decoder network Nonlinear shape interpolation

来源：评论

学校读者我要写书评

暂无评论

Deep convolutional encoder-decoder networks based on ensemble learning for semantic segmentation of high-resolution aerial imagery

引用

CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING 2024年第4期6卷 408-424页

作者： Zhu, Huming Liu, Chendi Li, Qiuming Zhang, Lingyun Wang, Libing Li, Sifan Jiao, Licheng Hou, Biao Xidian Univ Sch Artificial Intelligence Key Lab Intelligent Percept & Image Understanding Minist Educ Xian 710071 Shaanxi Peoples R China AVIC Xian Aeronaut Comp Tech Res Inst Xian Peoples R China China Star Network Applicat Co Ltd Chongqing Peoples R China

Due to the complexity of object information and optical conditions of high-resolution aerial imagery, it is difficult to obtain fine semantic segmentation performance. Although various deep neural network structures have been proposed to improve segmentation accuracy, there is still room for improving accuracy by making full use of multiscale features and integrating these single weak classifiers into a strong classifier. In this paper, we use a reduced SegNet network to realize the end-to-end classification of high-resolution aerial images. In addition, to use multiscale information, we present the R-SegUnet which combines the feature information of each convolution block in the reduced SegNet encoding network with the feature information of the corresponding convolution block in the decoding network. Furthermore, considering that the surface features in high-resolution aerial images are very complex, we investigate a 6to2_Net that converts the six-classification model into six binary-classification models for the recognition effect on small objects. Finally, we ensemble the above three different models to get the segmentation results. Experiment results on ISPRS Potsdam benchmark dataset show that our algorithm is state-of-the-art method. We also analyze the inference performance of our models on a variety of parallel computing devices.

关键词： Aerial imagery Semantic segmentation encoder-decoder network Ensemble ISPRS FCN

来源：评论

学校读者我要写书评

暂无评论

Turning brain MRI into diagnostic PET: ¹⁵O-water PET CBF synthesis from multi-contrast MRI via attention-based encoder-decoder networks

引用

MEDICAL IMAGE ANALYSIS 2024年 93卷 103072页

作者： Hussein, Ramy Shin, David Zhao, Moss Y. Guo, Jia Davidzon, Guido Steinberg, Gary Moseley, Michael Zaharchuk, Greg Stanford Univ Dept Radiol Radiol Sci Lab Stanford CA 94305 USA GE Healthcare Global MR Applicat & Workflow Menlo Pk CA 94025 USA Stanford Univ Stanford Cardiovasc Inst Stanford CA 94305 USA Univ Calif Riverside Dept Bioengn Riverside CA 92521 USA Stanford Univ Dept Radiol Div Nucl Med Stanford CA 94305 USA Stanford Univ Dept Neurosurg Stanford CA 94304 USA

Accurate quantification of cerebral blood flow (CBF) is essential for the diagnosis and assessment of a wide range of neurological diseases. Positron emission tomography (PET) with radiolabeled water (O-15-water) is the gold-standard for the measurement of CBF in humans, however, it is not widely available due to its prohibitive costs and the use of short-lived radiopharmaceutical tracers that require onsite cyclotron production. Magnetic resonance imaging (MRI), in contrast, is more accessible and does not involve ionizing radiation. This study presents a convolutional encoder-decoder network with attention mechanisms to predict the gold-standard O-15-water PET CBF from multi-contrast MRI scans, thus eliminating the need for radioactive tracers. The model was trained and validated using 5-fold cross-validation in a group of 126 subjects consisting of healthy controls and cerebrovascular disease patients, all of whom underwent simultaneous O-15-water PET/MRI. The results demonstrate that the model can successfully synthesize high-quality PET CBF measurements (with an average SSIM of 0.924 and PSNR of 38.8 dB) and is more accurate compared to concurrent and previous PET synthesis methods. We also demonstrate the clinical significance of the proposed algorithm by evaluating the agreement for identifying the vascular territories with impaired CBF. Such methods may enable more widespread and accurate CBF evaluation in larger cohorts who cannot undergo PET imaging due to radiation concerns, lack of access, or logistic challenges.

关键词： PET multi-contrast MRI cerebrovascular disease Attention mechanisms encoder-decoder network

来源：评论

学校读者我要写书评

暂无评论

Compressive MRI quantification using convex spatiotemporal priors and deep encoder-decoder networks

引用

MEDICAL IMAGE ANALYSIS 2021年 69卷 101945-101945页

作者： Golbabaee, Mohammad Buonincontri, Guido Pirkl, Carolin M. Menzel, Marion, I Menze, Bjoern H. Davies, Mike Gomez, Pedro A. Univ Bath Comp Sci Dept Bath Avon England Imago7 Fdn Pisa Italy Tech Univ Munich Comp Sci Dept Munich Germany GE Healthcare Munich Germany Univ Edinburgh Sch Engn Edinburgh Midlothian Scotland

We propose a dictionary-matching-free pipeline for multi-parametric quantitative MRI image computing. Our approach has two stages based on compressed sensing reconstruction and deep learned quantitative inference. The reconstruction phase is convex and incorporates efficient spatiotemporal regularisations within an accelerated iterative shrinkage algorithm. This minimises the under-sampling (aliasing) artefacts from aggressively short scan times. The learned quantitative inference phase is purely trained on physical simulations (Bloch equations) that are flexible for producing rich training samples. We propose a deep and compact encoder-decoder network with residual blocks in order to embed Bloch manifold projections through multi-scale piecewise affine approximations, and to replace the non-scalable dictionary matching baseline. Tested on a number of datasets we demonstrate effectiveness of the proposed scheme for recovering accurate and consistent quantitative information from novel and aggressively subsampled 2D/3D quantitative MRI acquisition protocols. (c) 2020 Elsevier B.V. All rights reserved.

关键词： Magnetic resonance fingerprinting Compressed sensing Convex model-based reconstruction Residual network encoder-decoder network

来源：评论

学校读者我要写书评

暂无评论

An Encoding-Decoding Framework Based on CNN for circ RNA-RBP Binding Sites Prediction

引用

Chinese Journal of Electronics 2024年第1期33卷 256-263页

作者： Yajing GUO Xiujuan LEI Yi PAN School of Computer Science Shaanxi Normal University Faculty of Computer Science and Control Engineering Shenzhen Institute of Advanced TechnologyChinese Academy of Sciences Department of Computer Science Georgia State University

Predicting RNA binding protein(RBP) binding sites on circular RNAs(circ RNAs) is a fundamental step to understand their interaction mechanism. Numerous computational methods are developed to solve this problem, but they cannot fully learn the features. Therefore, we propose circ-CNNED, a convolutional neural network(CNN)-based encoding and decoding framework. We first adopt two encoding methods to obtain two original matrices. We preprocess them using CNN before fusion. To capture the feature dependencies, we utilize temporal convolutional network(TCN) and CNN to construct encoding and decoding blocks, respectively. Then we introduce global expectation pooling to learn latent information and enhance the robustness of circ-CNNED. We perform circ-CNNED across 37 datasets to evaluate its effect. The comparison and ablation experiments demonstrate that our method is superior. In addition, motif enrichment analysis on four datasets helps us to explore the reason for performance improvement of circ-CNNED.

关键词： Circular RNAs (circRNAs) RNA binding proteins Convolutional neural network Temporal convolutional network encoder-decoder network

来源：评论

学校读者我要写书评

暂无评论

Global River Monitoring Using Semantic Fusion networks

引用

WATER 2020年第8期12卷 2258页

作者： Wei, Zhihao Jia, Kebin Jia, Xiaowei Khandelwal, Ankush Kumar, Vipin Beijing Univ Technol Fac Informat Technol Dept Informat & Commun Engn Beijing 100124 Peoples R China Univ Minnesota Dept Comp Sci & Engn Minneapolis MN 55455 USA

Global river monitoring is an important mission within the remote sensing society. One of the main challenges faced by this mission is generating an accurate water mask from remote sensing images (RSI) of rivers (RSIR), especially on a global scale with various river features. Aiming at better water area classification using semantic information, this paper presents a segmentation method for global river monitoring based on semantic clustering and semantic fusion. Firstly, an encoder-decoder network (AEN)-based architecture is proposed to obtain the semantic features from RSIR. Secondly, a clustering-based semantic fusion method is proposed to divide semantic features of RSIR into groups and train convolutional neural networks (CNN) models corresponding to each group using data augmentation and semi-supervised learning. Thirdly, a semantic distance-based segmentation fusion method is proposed for fusing the CNN models result into final segmentation mask. We built a global river dataset that contains multiple river segments from each continent of the world based on Sentinel-2 satellite imagery. The result shows that the F1-score of the proposed segmentation method is 93.32%, which outperforms several state-of-the-art algorithms, and demonstrates that grouping semantic information helps better segment the RSIR in global scale.

关键词： convolution encoder-decoder network feature extraction remote sensing image of river semantic fusion semi-supervised learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：