Convolutional neural Networks (CNNs) have exhibited considerable success in the realm of cervical cytopathology image classification, owing to their efficient design. We find that existing CNN-based cervical cytopatho...
详细信息
Convolutional neural Networks (CNNs) have exhibited considerable success in the realm of cervical cytopathology image classification, owing to their efficient design. We find that existing CNN-based cervical cytopathology classification methods fail to fully exploit the cell morphology and nucleus information. To address the above problems, we propose an efficient network called Pyramid Convolutional Mixer. We capture multi-scale subtle morphology features at the cellular level and convey nuclear neighborhood spatial information by integrating convolutional operations within the transformer structure. PCMixer contains two key modules, i.e. pyramid morphology module (PMM) and nuclear spatial mixing block (NSMB) to retrieve cervical cytopathology information. PMM is characterized by a multi-scale pyramid architecture employing a convolutional layer and a local encoder to generate local morphology information at each scale. In addition, NSMB operates on the input patches to separate the mixing of spatial and channel dimensions to encode nuclear neighborhood spatial information. We intend to unveil a more intricate cervical cytopathology dataset: Cervical Cytopathology image Dataset (CCID). We achieve a classification accuracy of 89.62% along with precision, recall and F1 score of 82.76%, 85.97% and 84.15% respectively on the CCID dataset. Also, we use cervical cytopathology images from the publicly available SIPaKMeD dataset. We obtain 96.21%, 95.70% 95.60% and 95.30% respectively for the four metrics. Through comprehensive experiments conducted on two real-world datasets, our proposed model demonstrates superior performance compared to state-of-the-art cervical cytopathology classification models. The results demonstrate that our method can significantly assist cytopathologists in appropriately evaluating cervical smears.
This paper deals with the state prediction of nonlinear stochastic dynamic systems. The emphasis is laid on a solution to the integral Chapman-Kolmogorov equation by a deterministic-integration-rule-based point-mass m...
详细信息
This paper deals with the state prediction of nonlinear stochastic dynamic systems. The emphasis is laid on a solution to the integral Chapman-Kolmogorov equation by a deterministic-integration-rule-based point-mass method. A novel concept of reliable data-augmented, i.e., mathematics- and data-informed, integration rule is developed to enhance the point-mass state predictor, where the trained neural network (representing data contribution) is used for the selection of the best integration rule from a set of available rules (representing mathematics contribution). The proposed approach combining the best properties of the standard mathematics-informed and novel data-informed rules is thoroughly discussed. Copyright (c) 2024 The Authors.
In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self- atte...
详细信息
In recent years, convolutional neural networks and vision transformers have emerged as predominant models for hyperspectral remote sensing image classification task, leveraging staked convolution layers and self- attention mechanisms with high computation costs, respectively. Recent studies, such as the Mamba model, have showcased the ability of state space model (SSM) with efficient hardware-aware designs in efficiently modeling sequences and extracting implicit features along tokens, which is precisely needed for accurate hyperspectral image (HSI) classification. Thus making SSM-based model potentially a new architecture for remote sensing HSI classification task. However, SSM encounters challenges in modeling HSI due to the insensitivity of spatial information and redundant spectral characteristics. Given SSM-based methods rarely explored in HSI classification, in this work, we present the first exploration of SSM-based models for HSI classification task. Our proposed method MamTrans effectively leverages the capacity of transformer for capturing spatial tokens relationships and Mamba for extracting implicit features along tokens. Besides, we propose a Bidirectional Mamba Module to enhance SSM's spatial perception ability of extracting spatial features in HSI. Our proposed MamTrans obtains a new state-of-the-art performance across five commonly employed HSI classification benchmarks, demonstrating the robust generalization of MamTrans and effectiveness of SSM-based structure for HSI classification task. Our codes could be found at https://***/PPPPPsanG/MamTrans.
We present a neural network able to fully linearise an OCT image without any a priori knowledge about the spectrometer characteristics or the extent of dispersion in the interferometer and the object. Unlike the earli...
详细信息
ISBN:
(纸本)9781510669208;9781510669192
We present a neural network able to fully linearise an OCT image without any a priori knowledge about the spectrometer characteristics or the extent of dispersion in the interferometer and the object. Unlike the earlier solutions, this blind linearisation is not biased towards a specific object, nor its dispersion characteristics, and in the future can be made independent of the light source parameters.
We propose a method that exploits the feedback provided by visual explanation methods combined with pattern mining techniques to identify the relevant class-specific and class-shared internal units. In addition, we pu...
详细信息
ISBN:
(纸本)9781728198354
We propose a method that exploits the feedback provided by visual explanation methods combined with pattern mining techniques to identify the relevant class-specific and class-shared internal units. In addition, we put forward a patch extraction approach to find faithfully class-specific and class-shared visual patterns. Contrary to the common practice in literature, our approach does not require pushing augmented visual patches through the model. Experiments on two CNN architectures show the effectiveness of the proposed method.
We introduce a new method based on nonnegative matrix factorization, neural NMF, for detecting latent hierarchical structure in data. Datasets with hierarchical structure arise in a wide variety of fields, such as doc...
详细信息
We introduce a new method based on nonnegative matrix factorization, neural NMF, for detecting latent hierarchical structure in data. Datasets with hierarchical structure arise in a wide variety of fields, such as document classification, imageprocessing, and bioinformatics. neural NMF recursively applies NMF in layers to discover overarching topics encompassing the lower-level features. We derive a backpropagation optimization scheme that allows us to frame hierarchical NMF as a neural network. We test neural NMF on a synthetic hierarchical dataset, the 20 Newsgroups dataset, and the MyLymeData symptoms dataset. Numerical results demonstrate that neural NMF outperforms other hierarchical NMF methods on these data sets and offers better learned hierarchical structure and interpretability of topics.
Motivation: Hemodynamic analysis is crucial for diagnosing and predicting cardiovascular diseases. However, methods relying on fluid flow simulations or blood flow imaging are complex, time-consuming, and require spec...
详细信息
Motivation: Hemodynamic analysis is crucial for diagnosing and predicting cardiovascular diseases. However, methods relying on fluid flow simulations or blood flow imaging are complex, time-consuming, and require specialized expertise, limiting their clinical use. Goal: This research aims to automate the enhancement of blood flow images, providing clinicians with a fast, accurate tool for hemodynamic analysis without requiring advanced expertise. Objectives: A software tool based on physics-constrained neural networks was developed to enable clinicians to easily select and process regions of interest (ROIs) in time-resolved three-dimensional phase contrast magnetic resonance imaging (4D-Flow MRI) blood flow images for quick, accurate analysis. methods: The Input Parameterized Physics-Informed neural Network (IP-PINN) was introduced to improve the spatio-temporal resolution of 4D-Flow MRI. IP-PINN mitigates noise, velocity aliasing, and phase errors. A convolutional neural network processes ROI data into latent vectors, which are then used to predict velocity, pressure, and spin density via a multi-layer perceptron. The method is trained with synthetic blood flow data using an innovative loss function that addresses noise and artifacts. Results: IP-PINN successfully enhanced image resolution, reducing noise and artifacts when tested on synthetic 4D-Flow MRI data derived from blood flow simulations of intracranial aneurysms. For data with 20 decibels (dB) signal-to-noise ratio, results closely matched the ground truth with less than 5.5% relative error. processing took under two minutes. The method also has the potential to reduce data acquisition time by 25%. Conclusions: IP-PINN could significantly enhance the clinical use of 4D-Flow MRI for personalized hemodynamic analysis in cardiovascular diseases.
Breast cancer is a significant health concern that remains one of the leading causes of mortality in women worldwide. Convolutional neural Networks (CNNs) have been shown to be effective in ultrasound breast image seg...
详细信息
Breast cancer is a significant health concern that remains one of the leading causes of mortality in women worldwide. Convolutional neural Networks (CNNs) have been shown to be effective in ultrasound breast image segmentation. Yet, because of the lack of long-distance dependence, the segmentation performance of CNNs is limited in addressing challenges typical of segmentation of ultrasound breast lesions, such as similar intensity distributions, the presence of irregular objects, and blurred boundaries. In order to overcome these issues, several studies have combined transformers and CNNs, to compensate for the shortcomings of CNNs with the ability of transformers to exploit long-distance dependence. Most of these studies limited themselves to rigidly plug transformer blocks into the CNN, lacking consistency in the process of feature extraction and therefore leading to poor performances in segmenting challenging medical images. In this paper, we propose HAU-Net(hierarchical attention-guided U-Net), a hybrid CNN-transformer framework that benefits from both the long-range dependency of transformers and the local detail representation of CNNs. To incorporate global context information, we introduce a L-G transformer block nested into the skip connections of the U shape architecture network. In addition, to further improve the segmentation performance, we added a cross attention block (CAB) module on the decoder side to allow different layers to interact. Extensive experimental results on three public datasets indicate that the proposed HAU-Net can achieve better performance than other state-of-the-art methods for breast lesions segmentation, with Dice coefficient of 83.11% for BUSI, 88.73% for UDIAT, and 89.48% for BLUI respectively.
Parkinson's disease (PD) is a neurodegenerative disorder affecting millions globally. Current diagnostic methods lack precision. In this research, an approach combining Histogram of Oriented Gradients (HOG) with a...
详细信息
Existing methods for Deep neural Networks (DNN) watermarking either require accessing the internal parameters of the DNN models (white-box watermarking), or rely on backdooring to enforce a desired behavior of the mod...
详细信息
Existing methods for Deep neural Networks (DNN) watermarking either require accessing the internal parameters of the DNN models (white-box watermarking), or rely on backdooring to enforce a desired behavior of the model when the DNN is fed with a specific set of key input images (black-box watermarking). In this letter, we propose a black-box multi-bit DNN watermarking algorithm, suitable for multiclass classification networks, whereby the presence of the watermark can be retrieved from the output of the network in correspondence to any input. To read the watermark, we first apply a power function to the softmax output of the DNN model to map it from an impulse-like to a smooth distibution. Then, we extract the watermark bits by projecting the output of the DNN onto a pseudorandom key vector. Watermark embedding is achieved by adding a proper regularizer term to the training loss. The effectiveness of the proposed method is demonstrated by applying it to various network architectures working on different datasets. The experimental results demonstrate the possibility to embed a robust watermark into the output of the host DNN with a negligible impact on the accuracy of the original task.
暂无评论