Deep learning basedimage deraining has been widely researched. However, rain streaks are hard to differentiate with similar textures of background without context knowledge. In this paper, a novel Context-Aware Trans...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Deep learning basedimage deraining has been widely researched. However, rain streaks are hard to differentiate with similar textures of background without context knowledge. In this paper, a novel Context-Aware Transformer (CAT) is proposed for single image deraining where both local and global context within the input rainy image are utilized for better background reconstruction performance. The proposed CAT perceives a comprehensive context view through efficient self-attention mechanism and dilated convolutions in the Context-Aware Transformer Block (CATB). The Rain-Aware Feature Selection module (RAFS) generates feature blending coefficients adaptively to filter out rain streaks components and preserves clear background in hierarchical features of CAT. Meanwhile, a High-Frequency Preserved Loss (HFPL) provides further supervision on training and promotes reserving clearer structures and sharper details. Experiments on synthesized and real-world benchmarks illustrate the outstanding performance over state-of-the-art methods and pleasing visual results in various scenes.
To deploy neural networks on clinical edge devices, quantization is the most commonly used method to compress the models, which requires a calibration set of hundreds of real images. However, due to privacy concerns, ...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
To deploy neural networks on clinical edge devices, quantization is the most commonly used method to compress the models, which requires a calibration set of hundreds of real images. However, due to privacy concerns, the scarcity of private histopathological images hinders the application of quantization. To address this issue, we develop HIQ, a novel one-shot quantization framework for histopathological image classification networks, which requires only one real image per class for calibration. To compensate for data scarcity, sample BNS alignment is introduced to generate synthetic images with similar distribution to the real ones. To improve the diversity of synthetic images, fine-grained diversity enhancement that provides fine-grained enhancement intensity for different classes and network layers is proposed, based on the observation of the class-wise and layer-wise fine-grained data. Finally, the asymptotic enhancement strategy is highlighted to achieve a trade-off between inter-class distance and intra-class diversity of synthetic images, based on the insight of the smaller inter-class distance of histopathological images than that of natural ones. Extensive experiments on the BRACS dataset show that our method achieves an extremely low accuracy loss even compared to the full precision model in low-bit cases and maintains robustness when missing classes of real images.
This paper designs and implements a deep vision indoor positioning system with individual movement recognition, called DeepEyes, based on internet of Things localization. DeepEyes integrates deep learning with pedestr...
详细信息
ISBN:
(纸本)9798350304367;9798350304374
This paper designs and implements a deep vision indoor positioning system with individual movement recognition, called DeepEyes, based on internet of Things localization. DeepEyes integrates deep learning with pedestrian dead reckoning, enabling the recognition of individual walking distances and moving angles. DeepEyes addresses the accuracy challenges in step length and heading direction estimation encountered by pedestrian dead reckoning methods, which often result in positioning errors that accumulate over time, impacting subsequent localization. In DeepEyes, we design deep vision indoor positioning using existing surveillance cameras for high-precision real-time localization and correction. To the best of our knowledge, this is the first centimeter-level positioning solution to combine deep neural networks with pedestrian dead reckoning methods to recognize individual movement distances and heading angles. In particular, an Android-based prototype with web cameras is implemented to verify the feasibility and performance of our DeepEyes system.
Tracking image sources and verifying copyright information is crucial in digital media communication. Digital image watermarking technology, widely used for copyright protection and source tracking, faces challenges i...
详细信息
Segmenting the proximal femur from a fluoroscopy image is crucial for precise surgical treatment of proximal femur fracture. Accurate segmentation of the proximal femur helps the surgeon to insert screws and metal pin...
详细信息
Multimodal image fusion aims to merge features from different modalities to create a comprehensively representative image. However, existing medical image fusion methods often struggle to handle noise generated during...
详细信息
As the internet becomes increasingly image-centric, users face substantial challenges in efficiently locating desired images due to the inherent limitations of image search engines in interpreting visual content and k...
详细信息
In recent years, the widespread application of Transformers in multi-modal computer vision tasks has brought ncreasing attention to indoor scene image segmentation based on RGB-D images. Due to the significant differe...
详细信息
Information security is currently a major problem for data communication. Cryptography has emerged as a treatment and is essential to information security systems. Techniques for encryption can effectively prevent att...
详细信息
In recent years, there has been an increasing fascination in constructing intelligent agricultural systems. Implementing intelligent agricultural techniques has the potential to enhance crop yield while minimizing res...
详细信息
暂无评论