Face recognition systems are essential in practically every industry in our digital age. One biometric that is frequently utilized is face recognition. It is helpful for security and has a ton of other advantages, ide...
详细信息
Scene text detection methods based on segmentation have been extensively employed in the area of scene text detection. However, for tasks involving dense text, multi-directional texts and extreme aspect ratios, existi...
详细信息
Extracting local features for accurate correspondences between image pairs is an essential basis for various computer vision tasks. Recent works have shown that deep neural networks (DNNs) have demonstrated promising ...
详细信息
ISBN:
(纸本)9798350342734
Extracting local features for accurate correspondences between image pairs is an essential basis for various computer vision tasks. Recent works have shown that deep neural networks (DNNs) have demonstrated promising performance in challenging environments. However, these state-of-the-art DNN-based approaches are not well suited for the scenario with geometry rotations due to their intrinsic deficiencies of square kernel structure. That is, square kernel structures in standard DNNs cannot fully identify the essentials of the rotations in geometry. To address this problem, we present RICNN, a novel deep learning framework that encodes invariance against the rotations in geometry explicitly into convolutional neural networks. Rather than using the square-shaped kernel structure, RICNN adopts sector-shaped convolutional kernels to achieve encoding invariance in all rotations. With the explicitness of such rotation encoding, RICNN enables the transfer of perspective DNN models to obtain rotation-invariant descriptions. Furthermore, we propose a novel multi-level hinge triplet loss function to strengthen the matching constraints against geometry rotations. Comprehensive experiments demonstrate the strong generalization ability of the RICNN descriptor on the HPatches dataset. Toward the rotation invariance evaluation, our method shows state-of-the-art results.
Polyp segmentation in colonoscopy is crucial for detecting colorectal cancer. However, it is challenging due to variations in the structure, color, and size of polyps, as well as the lack of clear boundaries with surr...
详细信息
Feature selection is an important step in data mining and pattern recognition tasks. In unsupervised cases, feature selection becomes more difficult due to the lack of labels in the samples. Therefore, this paper prop...
详细信息
The growing complexity and scale of Industrial Internet of Things (IIoT) networks have made them increasingly vulnerable to cyber-attacks, underscoring the need for reliable and precise Intrusion Detection Systems (ID...
详细信息
Synchronizing the system time between devices by exchanging timestamped messages over the network is a popular method to achieve time-consistency in distributed applications. Accurate time synchronization is essential...
详细信息
In recent times, face biometric systems have recognized widely and got attention in computer vision. Further, the biometric face recognition systems are susceptible for face spoofing attacks, where an attacker uses a ...
详细信息
Payment Channel networks (PCNs) can significantly enhance the scalability of blockchain transactions without requiring major modifications to the underlying distributed ledger protocol. However, to enable efficient an...
详细信息
Image inpainting has made significant progress benefiting from the advantages of convolutional neural networks (CNNs). Deep learning-based methods have shown extraordinary performance in this field. In this paper, we ...
详细信息
ISBN:
(纸本)9781728198354
Image inpainting has made significant progress benefiting from the advantages of convolutional neural networks (CNNs). Deep learning-based methods have shown extraordinary performance in this field. In this paper, we propose a novel image inpainting architecture with pure CNN that can jointly reconstruct the structure and texture of the image. Our generative network architecture (TSFC) consists of two parallel stages: structure generation and texture generation. In the structure generation stage, we use the large convolution kernel, which is highly neglected in modern networks, using the effective perceptual field of the large convolution kernel to enhance the perception of overall structural features. In the texture generation stage, we use the small convolution kernel to extract local texture features. Qualitative and quantitative experimental results on CelebA-HQ and Paris Street View datasets demonstrate the effectiveness and superiority of our method.
暂无评论