Active contour models (ACM) have achieved remarkable results in image segmentation. However, existing ACMs have some shortcomings, such as over-dependence on the initial contour, complex parameter adjustments, and dif...
详细信息
Active contour models (ACM) have achieved remarkable results in image segmentation. However, existing ACMs have some shortcomings, such as over-dependence on the initial contour, complex parameter adjustments, and difficulty in balancing segmentation accuracy and speed. To further improve the performance of ACM, this paper proposes an active contour model based on fuzzy c-means and local pre-fitting function (FLPF). Firstly, a linear weighted image is defined as a sample for the fuzzy c-means (FCM) clustering algorithm to pre-fit the image intensity. A pre-processing operation is proposed to improve the FCM clustering algorithm and increase the computation speed. Then, second-order differential data-driven terms based on the local pre-fitting energy are designed to guide the curve evolution rapidly and adaptively toward the target boundary. In addition, adaptive regularization functions are constructed to optimize and normalize the data-driven terms and level set functions, which improves the robustness of the proposed model. Finally, an improved parameter tuning framework based on the deep learning algorithm YOLOv5 is proposed for the FLPF model to achieve automated parameter adjustments. Compared to the other six models, our model has advantages in segmentation speed and accuracy, reducing the average segmentation time by 77.5% and improving the average segmentation accuracy by more than 7.6% of 10 images.
Semi-supervised learning (SSL) has been proven beneficial for mitigating the issue of limited labeled data, especially on volumetric medical image segmentation. Unlike previous SSL methods which focus on exploring hig...
详细信息
Semi-supervised learning (SSL) has been proven beneficial for mitigating the issue of limited labeled data, especially on volumetric medical image segmentation. Unlike previous SSL methods which focus on exploring highly confident pseudo-labels or developing consistency regularization schemes, our empirical findings suggest that differential decoder features emerge naturally when two decoders strive to generate consistent predictions. Based on the observation, we first analyze the treasure of discrepancy in learning towards consistency, under both pseudo-labeling and consistency regularization settings, and subsequently propose a novel SSL method called LeFeD, which learns the feature-level discrepancies obtained from two decoders, by feeding such information as feedback signals to the encoder. The core design of LeFeD is to enlarge the discrepancies by training differential decoders, and then learn from the differential features iteratively. We evaluate LeFeD against eight state-of-the-art (SOTA) methods on three public datasets. Experiments show LeFeD surpasses competitors without any bells and whistles, such as uncertainty estimation and strong constraints, as well as setting a new state of the art for semi-supervised medical image segmentation. Code has been released at https://***/maxwell0027/LeFeD.
In the actual production process of shale shakers, detecting the solid-liquid separation state of the screen surface faces numerous challenges, such as difficulty in recognizing the mud boundary, insufficient anti-int...
详细信息
In the actual production process of shale shakers, detecting the solid-liquid separation state of the screen surface faces numerous challenges, such as difficulty in recognizing the mud boundary, insufficient anti-interference ability, and misjudgment caused by background interference. To address these issues, this paper proposes a screen surface mud image segmentation method based on U2Net, namely CBAM-U2Net. By introducing the Convolutional Block Attention Module (CBAM) and combining it with Multi-layer Recursive Residual Blocks (RSU), a network structure is designed that can efficiently fuse global and local features, significantly improving segmentation accuracy and robustness. The network includes encoder and decoder parts, employing convolution, batch normalization, ReLU activation, and multi-scale feature fusion strategies. Experimental results show that the CBAM-U2Net method demonstrates excellent segmentation performance under various working conditions, achieving outstanding results with mIoU, F1-score, Precision, and Recall at 83.38%, 89.75%, 89.38%, and 92.64%, respectively, with significantly enhanced anti-interference capability. The CBAM-U2Net method provides an efficient and reliable solution for the intelligent monitoring of the solid-liquid separation state in shale shakers, offering significant practical application value.
The Segment Anything Model (SAM) excels in general segmentation but encounters difficulties in medical imaging due to few-shot learning challenges, particularly with extremely limited annotated data. Existing approach...
详细信息
The Segment Anything Model (SAM) excels in general segmentation but encounters difficulties in medical imaging due to few-shot learning challenges, particularly with extremely limited annotated data. Existing approaches often suffer from insufficient feature extraction and inadequate loss function balancing, resulting in decreased accuracy and poor generalization. To address these issues, we propose BiASAM, which uniquely incorporates two bidirectional attention mechanisms into SAM for medical image segmentation. Firstly, BiASAM integrates a spatial-frequency attention module to improve feature extraction, enhancing the model's ability to capture both fine and coarse details. Secondly, we employ an attention-based gradient update mechanism that dynamically adjusts loss weights, boosting the model's learning efficiency and adaptability in data-scarce scenarios. Additionally, BiASAM utilizes the point and box fusion prompt to enhance segmentation precision at both global and local levels. Experiments across various medical datasets show BiASAM achieves performance comparable to fully supervised methods with just two labeled samples.
Despite the evident advantages of variants of UNet in medical image segmentation, these methods still exhibit limitations in the extraction of foreground, background, and boundary features. Based on feature guidance, ...
详细信息
Despite the evident advantages of variants of UNet in medical image segmentation, these methods still exhibit limitations in the extraction of foreground, background, and boundary features. Based on feature guidance, we propose a new network (FG-UNet). Specifically, adjacent high-level and low-level features are used to gradually guide the network to perceive lesion features. To accommodate lesion features of different scales, the multi-order gated aggregation (MGA) block is designed based on multi-order feature interactions. Furthermore, a novel feature-guided context-aware (FGCA) block is devised to enhance the capability of FG-UNet to segment lesions by fusing boundary-enhancing features, object-enhancing features, and uncertain areas. Eventually, a bi-dimensional interaction attention (BIA) block is designed to enable the network to highlight crucial features effectively. To appraise the effectiveness of FG-UNet, experiments were conducted on Kvasir-seg, ISIC2018, and COVID-19 datasets. The experimental results illustrate that FG-UNet achieves a DSC score of 92.70% on the Kvasir-seg dataset, which is 1.15% higher than that of the latest SCUNet++, 4.70% higher than that of ACC-UNet, and 5.17% higher than that of UNet.
In this letter, we propose a novel lightweight X-ray image contraband segmentation network, XSNet, which integrates State Space Models (SSM) with Convolutional Neural Networks (CNNs) to achieve a significant trade-off...
详细信息
In this letter, we propose a novel lightweight X-ray image contraband segmentation network, XSNet, which integrates State Space Models (SSM) with Convolutional Neural Networks (CNNs) to achieve a significant trade-off between segmentation accuracy and lightweight design for computer-aided X-ray security check. The model is built based on the encoder-decoder framework. Specifically, we design an Multi-scale Convolution Fusion (MCF) block for multi-scale information extraction and a Dual-branch State Space Model (DSSM) block to relieve the bias caused by the imbalance of single branch structure in feature extraction and maintain the capabilities of SSM in modeling long range pixel dependencies. In addition, we present two versions of the model in two different sizes called XSNet-s and XSNet-l respectively. The quantitative and qualitative evaluations on the public PIDray and PIXray datasets both show the superiority of two models in terms of mean Intersection over Union (mIoU) and FLOPs.
Medical image segmentation demands the aggregation of global and local feature representations, posing a challenge for current methodologies in handling both long-range and short-range feature interactions. Recently, ...
详细信息
Manual labeling of lesions in medical image analysis presents a significant challenge due to its labor-intensive and inefficient nature, which ultimately strains essential medical resources and impedes the advancement...
详细信息
Manual labeling of lesions in medical image analysis presents a significant challenge due to its labor-intensive and inefficient nature, which ultimately strains essential medical resources and impedes the advancement of computer-aided diagnosis. This paper introduces a novel medical image-segmentation framework named Efficient Generative-Adversarial U-Net (EGAUNet), designed to facilitate rapid and accurate multi-organ labeling. To enhance the model's capability to comprehend spatial information, we propose the Global Spatial-Channel Attention Mechanism (GSCA). This mechanism enables the model to concentrate more effectively on regions of interest. Additionally, we have integrated Efficient Mapping Convolutional Blocks (EMCB) into the feature-learning process, allowing for the extraction of multi-scale spatial information and the adjustment of feature map channels through optimized weight values. Moreover, the proposed framework progressively enhances its performance by utilizing a generative-adversarial learning strategy, which contributes to improvements in segmentation accuracy. Consequently, EGAUNet demonstrates exemplary segmentation performance on public multi-organ datasets while maintaining high efficiency. For instance, in evaluations on the CHAOS T2SPIR dataset, EGAUNet achieves approximately 2% higher performance on the Jaccard metric, 1% higher on the Dice metric, and nearly 3% higher on the precision metric in comparison to advanced networks such as Swin-Unet and TransUnet.
Medical image segmentation is very important for the diagnosis of related diseases. To reduce the labeling work of related medical images, numerous models based on U-Net have been proposed to achieve automatic segment...
详细信息
By the end of 2023, Athere were approximately 220 million elderly patients aged 60 and above in China, with the incidence of stroke increasing significantly with age. The incidence rate for those over 75 is 5 to 8 tim...
详细信息
By the end of 2023, Athere were approximately 220 million elderly patients aged 60 and above in China, with the incidence of stroke increasing significantly with age. The incidence rate for those over 75 is 5 to 8 times higher than that of individuals aged 45-55. AElderly strokes typically have an acute onset and rapid progression, making early detection critical for prognosis. Medical research has shown that left ventricular hypertrophy (LVH) on an electrocardiogram (ECG) is an independent risk factor for stroke inApatients. Therefore, this study aims to develop an intelligent diagnostic model for stroke in elderly patients. First, we analyze 12-lead ECG data from health check-ups of elderly patients over 60 years old to construct a LVH classification model. This model, based on convolutional neural networks (CNN) Aand Transformer networks, extracts ECG features from both local waveform characteristics and global long-range dependencies. The fusion of abnormal ECG features improves the model's ability to identify specific LVH rhythm types associated with certain leads, while the inclusion of global context information optimizes model performance. Experiments demonstrate that the model, tested on a self-built dataset, achieves sensitivity, specificity, accuracy, and F1 score of 0.81, 0.92, 0.87, Aand 0.91, Arespectively, with an AUC of 0.91. ASubsequently, we integrate MRI image segmentation technology to assist doctors in diagnosing lesion areas. We propose an MRI image segmentation model based on an improved UNet network with an attention mechanism. Experimental results show that the stroke image segmentation algorithm proposed in this study achieves an accuracy of 98.78%, Asensitivity of 92.03%, Aand specificity of 96.58%. AThe research in this paper can assist doctors in clinical decision-making by first detecting potential elderly LVH patients through ECG data and then using MRI image segmentation algorithms to assist in the precise diagnosis of stroke lesions,
暂无评论