Biomedical image segmentation has been widely studied, and lots of methods have been proposed. Among these methods, attention U-Net has achieved a promising performance. However, it has drawbacks of extracting the mul...
ISBN:
(数字)9781509066315
ISBN:
(纸本)9781509066322
Biomedical image segmentation has been widely studied, and lots of methods have been proposed. Among these methods, attention U-Net has achieved a promising performance. However, it has drawbacks of extracting the multi-scaled receptive field features at the high-level feature maps, resulting in the degeneration when dealing with the lesions with apparent scale variations. To solve this problem, this paper integrates an atrous spatial pyramid pooling (ASPP) module in the contracting path of attention U-Net. This module employs multiple dilation rates for the purpose of obtaining several multi-scale receptive fields, which significantly improves the networks' ability to handle both large and small lesions. Evaluation experimental result shows that our approach significantly improves the performance of medical image segmentation and substantially outperforms the representative deep learning models on public datasets.
High-dimensional quantum systems can offer extended possibilities and multiple advantages while developing advanced quantum technologies. In this paper, we propose a class of quantum-walk ar-chitecture networks that a...
详细信息
It is very challenging for speech enhancement methods to achieves robust performance under both high signal-to-noise ratio (SNR) and low SNR simultaneously. In this paper, we propose a method that integrates an SNR-ba...
详细信息
As for an automatic text classification approach, a large body of research on latent-topic based Dataless Text Classification (DTC) has been emerged in recent years. Perusing the candidate seed words or guaranteeing t...
详细信息
ISBN:
(数字)9781728192284
ISBN:
(纸本)9781728185361
As for an automatic text classification approach, a large body of research on latent-topic based Dataless Text Classification (DTC) has been emerged in recent years. Perusing the candidate seed words or guaranteeing the quality of the category-topics is the core mission of this approach. However, few previous studies consider the quality of specific category-topics at the collection level instead at the document level, because not all topics are equally coherent or category sparsity. In this paper, we focus on alleviating the dilemma for the seed words selection problem in DTC by using pseudo text understanding. Differently from the existing latent-topic based DTC approach, we propose an unsupervised method named Pseudo Document Labeled Classification (PDLC). It extracts the most representative word list to capture the best latent semantic category-topic description. Experimental results indicate that our PDLC scheme achieves better classification accuracy without any labeled data or external resource.
In the image inpainting task, the ability to repair both high-frequency and low-frequency information in the missing regions has a substantial influence on the quality of the restored image. However, existing inpainti...
详细信息
Scene text image super-resolution aims to enhance the resolution of images containing text in various scenes, which amplifies the prominence of the text and improves its recognizability. Existing methods struggle to a...
详细信息
Scene text image super-resolution aims to enhance the resolution of images containing text in various scenes, which amplifies the prominence of the text and improves its recognizability. Existing methods struggle to accurately localize text regions in high-noise environments, which hampers their ability to effectively implement targeted super-resolution. To address these issues, we propose a Text image Super-resolution Semantic-aware Interaction Network (TSSIN) by embedding a text region segmentation network. First, we propose to use a pre-trained text region segmentation network (TRSN) to extract text region information. This approach semantically guides our model to address the challenge of the model not performing targeted super-resolution processing of text in high-noise environments. Secondly, we propose a multi-modal semantic information interaction module (MSIIM) to mitigate the issue of insufficient global information exchange. Comprehensive experiments conducted on the TextZoom dataset demonstrate that our TSSIN significantly enhances image quality. Furthermore, it shows a clear superiority over state-of-the-art methods on TextZoom, achieving an average text recognition accuracy improvement of +1.0% over Transformer-Based Super-Resolution Network (TBSRN) (49.6%, 56.2%, 60.1% vs. 48.1%, 55.9%, 58.9%), +0.83% over Parallelly Contextual Attention Network (PCAN), and +1.46% over Text Prior Guided Super-Resolution (TPGSR), as evaluated by three pre-trained text recognition models. Code is available at https://***/ads2d/TSSIN .
Crowd counting is an important task that shown great application value in public safety-related fields, which has attracted increasing attention in recent years. In the current research, the accuracy of counting numbe...
详细信息
Numerical analysis were performed to study venting explosion load of methane-air in vented chamber. The effects of venting pressure and ignition location on indoor gas explosion overpressure are discussed. The simulat...
详细信息
With the growing awareness of privacy protection, the security of image data during network transmission has become a focal point of concern. Reducing computational complexity and improving transmission efficiency whi...
With the growing awareness of privacy protection, the security of image data during network transmission has become a focal point of concern. Reducing computational complexity and improving transmission efficiency while meeting high-security requirements has become a primary focus of current research. To address this, we propose a novel image privacy protection scheme that combines compressive sensing with chaotic encryption, aiming to ensure image privacy and security while enhancing transmission and storage efficiency. We employ compressive sensing technology to achieve efficient compression of image data. Unlike traditional compression and encryption schemes, the proposed method does not require explicit sparsification preprocessing, thereby avoiding the complex operations introduced by signal transformations and simplifying the signal recovery process. To enhance encryption security, a four-dimensional hyperchaotic system with stronger chaotic properties is designed to generate highly random and unpredictable key streams, ensuring the security of the encrypted data. Furthermore, this paper combines disjoint Latin squares with fractal generation strategies to design a new fractal index matrix, based on which a novel image permutation scheme is proposed. This scheme effectively eliminates the linear relationships and correlations between adjacent pixels, achieving global pixel permutation. Coupled with the proposed dual-channel bidirectional diffusion structure, the algorithm effectively diffuses pixel information across the entire image, increasing the complexity and unpredictability of the image encryption process. Experimental results indicate that the proposed algorithm exhibits excellent performance in terms of compression efficiency, encryption effectiveness, and resistance to attacks, providing a highly efficient and reliable solution in the field of image compression and encryption.
暂无评论