In recent years, deep learning (DL)-based registration technology has significantly improved the calculation speed of medical image registration. Existing DL-based registration methods generally use raw data features ...
详细信息
In recent years, deep learning (DL)-based registration technology has significantly improved the calculation speed of medical image registration. Existing DL-based registration methods generally use raw data features to predict the deformation field. However, this strategy may not be very effective for difficult registration tasks. Hence, in this study, we propose a similarity attention-based convolutional neural network (CNN) for accurate and robust three-dimensional medical image registration. We first introduce a similarity-based local attention model as an auxiliary module for building a displacement searching space, instead of a direct displacement prediction based on raw data. The proposed model can help the network focus on spatial correspondences with high similarities and ignore those with low similarities. A multi-scale CNN is then integrated with the similaritybased local attention for providing non-local attention, lightweight network, and coarse-to-fine registration. We evaluated the proposed method for various applications, such as the registration of large-scope abdominal computerized tomography (CT) images and chest CT images acquired at different respiratory phases, and atlas registration in magnetic resonance imaging. The experimental results demonstrate that the proposed method can provide a more accurate and robust registration performance than state-of-the-art registration methods.
Fault diagnosis based on vibration signals is widely used in various industrial production processes to prevent catastrophic accidents and ensure timely repairs. However, traditional fault diagnosis methods provide li...
详细信息
Fault diagnosis based on vibration signals is widely used in various industrial production processes to prevent catastrophic accidents and ensure timely repairs. However, traditional fault diagnosis methods provide limited accuracy due to their inability to visually represent fault characteristics and heavy reliance on expert involvement. In this study, we propose a time series image generation scheme that incorporates a convolutional neural network (CNN) to perform data-driven analysis and open-set classification on fault data to represent faults intuitively and diagnose them more accurately. In this study, based on the basic G image, we proposed two series of extended coding methods, gray-transform (G-TF) and transform-gray (TF-G) to convert the 1-D time series signal into a grayscale image from different perspectives for figurative expression, so as to adapt to different fault diagnosis scenarios. Then, a CNN is designed to maximize recognition accuracy based on faster calculation. To demonstrate the effectiveness of the proposed method, three types of faults are tested in this study, i.e., faults with motor bearings, self-priming centrifugal pumps, and hydraulic pumps. The experimental results demonstrate that, when the most suitable method is employed, the proposed approach achieves better recognition performance in diagnosing these three types of faults. The proposed method provides a new solution for future fault diagnosis tasks.
Compared to natural images, hyperspectral images (HSIs) consist of a large number of bands, with each band capturing different spectral information from a certain wavelength, even some beyond the visible spectrum. The...
详细信息
Compared to natural images, hyperspectral images (HSIs) consist of a large number of bands, with each band capturing different spectral information from a certain wavelength, even some beyond the visible spectrum. These characteristics of HSIs make them highly effective for remote sensing applications. That said, the existing hyperspectral imaging devices introduce severe degradation in HSIs. Hence, hyperspectral image denoising has attracted lots of attention by the community lately. While recent deep HSI denoising methods have provided effective solutions, their performance under real-life complex noise remains suboptimal, as they lack adaptability to new data. To overcome these limitations, in our work, we introduce a self-modulating convolutional neural network which we refer to as SM-CNN, which utilizes correlated spectral and spatial information. At the core of the model lies a novel block, which we call spectral self-modulating residual block (SSMRB), that allows the network to transform the features in an adaptive manner based on the adjacent spectral data, enhancing the network's ability to handle complex noise. In particular, the introduction of SSMRB transforms our denoising network into a dynamic network that adapts its predicted features while denoising every input HSI with respect to its spatio-spectral characteristics. Experimental analysis on both synthetic and real data shows that the proposed SM-CNN outperforms other state-of-the-art HSI denoising methods both quantitatively and qualitatively on public benchmark datasets. Our code will be available at https://***/orhan-t/SM-CNN.
A stochastic binary-ternary (SBT) quantization approach is introduced for communication efficient federated computation;form of collaborative computing where locally trained models are exchanged between institutes. Co...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
A stochastic binary-ternary (SBT) quantization approach is introduced for communication efficient federated computation;form of collaborative computing where locally trained models are exchanged between institutes. Communication of deep neural network models could be highly inefficient due to their large size. This motivates model compression in which quantization is an important step. Two well-known quantization algorithms are binary and ternary quantization. The first leads into good compression, sacrificing accuracy. The second provides good accuracy with less compression. To better benefit from trade-off between accuracy and compression, we propose an algorithm to stochastically switch between binary and ternary quantization. By combining with uniform quantization, we further extend the proposed algorithm to a hierarchical method which results in even better compression without sacrificing the accuracy. We tested the proposed algorithm using neural network Compression Test Model (NCTM) provided by MPEG community. Our results demonstrate that the hierarchical variant of the proposed algorithm outperforms other quantization algorithms in term of compression, while maintaining the accuracy competitive to that provided by other methods.
Calcium imaging is susceptible to motion distortions and background noises, particularly for monitoring active animals under low-dose laser irradiation, and hence unavoidably hinder the critical analysis of neural fun...
详细信息
Calcium imaging is susceptible to motion distortions and background noises, particularly for monitoring active animals under low-dose laser irradiation, and hence unavoidably hinder the critical analysis of neural functions. Current research efforts tend to focus on either denoising or dewar ping and do not provide effective methods for videos distorted by both noises and motion artifacts simultaneously. We found that when a self-supervised denoising model of DeepCAD [Nat. methods 18 , 1359 (2021)] is used on the calcium imaging contaminated by noise and motion warping, it can remove the motion artifacts effectively but with regenerated noises. To address this issue, we develop a two-level deep-learning (DL) pipeline to dewarp and denoise the calcium imaging video sequentially. The pipeline consists of two 3D self-supervised DL models that do not require warp-free and high signal-to-noise ratio (SNR) observations for network optimization. Specifically, a high-frequency enhancement block is presented in the denoising network to restore more structure information in the denoising process;a hierarchical perception module and a multi-scale attention module are designed in the dewar ping network to tackle distortions of various sizes. Experiments conducted on seven videos from two-photon and confocal imaging systems demonstrate that our two-level DL pipeline can restore high-clarity neuron images distorted by both motion warping and background noises. Compared to typical DeepCAD, our denoising model achieves a significant improvement of approximately 30% in image resolution and up to 28% in signal-to-noise ratio;compared to traditional dewar ping and denoising methods, our proposed pipeline network recovers more neurons, enhancing signal fidelity and improving data correlation among frames by 35% and 60% respectively. This work may provide an attractive method for long-term neural activity monitoring in awake animals and also facilitate functional analysis of neural circ
Classic and deep learning-based generalized canonical correlation analysis (GCCA) algorithms seek low-dimensional common representations of data entities from multiple "views" (e.g., audio and image) using l...
详细信息
Classic and deep learning-based generalized canonical correlation analysis (GCCA) algorithms seek low-dimensional common representations of data entities from multiple "views" (e.g., audio and image) using linear transformations and neural networks, respectively. When the views are acquired and stored at different computing agents (e.g., organizations and edge devices) and data sharing is undesired due to privacy or communication cost considerations, federated learning-based GCCA is well-motivated. In federated learning, the views are kept locally at the agents and only derived, limited information exchange with a central server is allowed. However, applying existing GCCA algorithms onto such setting may incur prohibitively high communication overhead. This work puts forth a communication-efficient federated learning framework for both linear and deep GCCA under the maximum variance (MAX-VAR) formulation. The overhead issue is addressed by aggressively compressing (via quantization) the exchanging information between the computing agents and a central controller. Our synthetic and real-data experiments shows that the proposed algorithm enjoys a substantial reduction of communication overheads with virtually no loss in accuracy and convergence speed compared to the unquantized version. Rigorous convergence analyses are also presented, which is a nontrivial effort since generic federated optimization results do not cover the special problem structure of GCCA. Our result shows that the proposed algorithms for both linear and deep GCCA converge to critical points at a sublinear rate, even under heavy quantization and stochastic approximations. In addition, in the linear MAX-VAR case, the quantized algorithm approaches a global optimum in a geometric rate under reasonable conditions.
COVID-19 infection detection through initial lesion classification provides early diagnosis and prevents breathing difficulties. Detecting the infectious part of the lungs using computerized tomography (CT) images has...
详细信息
COVID-19 infection detection through initial lesion classification provides early diagnosis and prevents breathing difficulties. Detecting the infectious part of the lungs using computerized tomography (CT) images has become an instantaneous detection method. In this article, a Hybrid Classification Optimization (HCO) using Recurrent Leaning and Fuzzy (RLF) is proposed. The neural network classifies infected and non-infected regions using pixel distributions and their variations. This is performed by identifying missing features and training the recurrent network using regional differences. Based on the feature availability, recurrent learning classifies the region through the input dataset and recurrent training correlation. The fuzzy predicts missing features through substituted derivatives obtained between wide ranges of variations. In this process, the maximum fuzzy derivatives for feature substitution are used for infected region prediction. The least fuzzy derivatives are prevented from the training layer to reduce false rates in region classification. This joint process improves the training consistency to leverage the detection and region classification accuracies. The proposed HCO-RLF improves detection and classification accuracy, and precision by 11.96 %, 9.98 %, and 13.42 % for the varying classification rates. Besides, the results are obtained in comparison with the existing DR-MIL, DSAE, and BS-FSA methods discussed later in the article.
In goal-oriented communications, the objective of the receiver is often to apply a Deep-Learning model, rather than reconstructing the original data. In this context, direct learning over compressed data, without any ...
详细信息
ISBN:
(纸本)9789464593617;9798331519773
In goal-oriented communications, the objective of the receiver is often to apply a Deep-Learning model, rather than reconstructing the original data. In this context, direct learning over compressed data, without any prior decoding, holds promise for enhancing the time-efficient execution of inference models at the receiver. However, conventional entropic-coding methods like Huffman and Arithmetic break data structure, rendering them unsuitable for learning without decoding. In this paper, we propose an alternative approach in which entropic coding is realized with Low-Density Parity Check (LDPC) codes. We hypothesize that Deep Learning models can more effectively exploit the internal code structure of LDPC codes. At the receiver, we leverage a specific class of Recurrent neural Networks (RNNs), specifically Gated Recurrent Unit (GRU), trained for image classification. Our numerical results indicate that classification based on LDPC-coded bit-planes surpasses Huffman and Arithmetic coding, while necessitating a significantly smaller learning model. This demonstrates the efficiency of classification directly from LDPC-coded data, eliminating the need for any form of decompression, even partial, prior to applying the learning model.
In this paper, we develop methods for efficient and accurate information extraction from calcium-imaging-based neuralsignals. The particular form of information extraction we investigate involves predicting behavior ...
详细信息
In this paper, we develop methods for efficient and accurate information extraction from calcium-imaging-based neuralsignals. The particular form of information extraction we investigate involves predicting behavior variables linked to animals from which the calcium imaging signals are acquired. More specifically, we develop algorithms to systematically generate compact deep neural network (DNN) models for accurate and efficient calcium-imaging-based predictive modeling. We also develop a software tool, called NeuroGRS, to apply the proposed methods for compact DNN derivation with a high degree of automation. GRS stands for Greedy inter-layer order with Random Selection of intra-layer units, which describes the central algorithm developed in this work for deriving compact DNN structures. Through extensive experiments using NeuroGRS and calcium imaging data, we demonstrate that our methods enable highly streamlined information extraction from calcium images of the brain with minimal loss in accuracy compared to much more computationally expensive approaches.
This study presents a versatile approach integrating various neural network architectures with a focus on classifying architectural works. To address the lack of suitable datasets in the literature, a custom dataset h...
详细信息
ISBN:
(纸本)9798350388978;9798350388961
This study presents a versatile approach integrating various neural network architectures with a focus on classifying architectural works. To address the lack of suitable datasets in the literature, a custom dataset has been created and made publicly available. The study aims to determine the most effective model, taking into account fine details in architectural styles. In this context, a comparative analysis has been conducted on four different convolutional neural network (CNN) architectures, including a baseline model trained from scratch and models using transfer learning methods with VGG, ResNet, and EfficientNet architectures. Through experiments, the EfficientNet architecture was fine-tuned, achieving an accuracy of %84.65 for 3 architects and %74.08 for 16 architects. Additionally, the two obtained models were used as feature extractors to visualize relationships among architects in a 2D space using t-SNE dimension reduction technique. These promising results indicate that these techniques can significantly contribute to architectural style analysis and serve as valuable tools for creating innovative designs through the use of generative artificial intelligence.
暂无评论