Removing moire patterns from videos recorded on screens or complex textures is known as video demoireing. It is a challenging task as both structures and textures of an image usually exhibit strong periodic patterns, ...
详细信息
ISBN:
(纸本)9798350307184
Removing moire patterns from videos recorded on screens or complex textures is known as video demoireing. It is a challenging task as both structures and textures of an image usually exhibit strong periodic patterns, which thus are easily confused with moire patterns and can be significantly erased in the removal process. By interpreting video demoireing as a multi-frame decomposition problem, we propose a compact invertible dyadic network called CIDNet that progressively decouples latent frames and the moire patterns from an input video sequence. Using a dyadic cross-scale coupling structure with coupling layers tailored for multi-scale processing, CIDNet aims at disentangling the features of image patterns from that of moire patterns at different scales, while retaining all latent image features to facilitate reconstruction. In addition, a compressed form for the networks output is introduced to reduce computational complexity and alleviate overfitting. The experiments show that CIDNet outperforms existing methods and enjoys the advantages in model size and computational efficiency.
Image inpainting has made significant progress benefiting from the advantages of convolutional neural networks (CNNs). Deep learning-based methods have shown extraordinary performance in this field. In this paper, we ...
详细信息
ISBN:
(纸本)9781728198354
Image inpainting has made significant progress benefiting from the advantages of convolutional neural networks (CNNs). Deep learning-based methods have shown extraordinary performance in this field. In this paper, we propose a novel image inpainting architecture with pure CNN that can jointly reconstruct the structure and texture of the image. Our generative network architecture (TSFC) consists of two parallel stages: structure generation and texture generation. In the structure generation stage, we use the large convolution kernel, which is highly neglected in modern networks, using the effective perceptual field of the large convolution kernel to enhance the perception of overall structural features. In the texture generation stage, we use the small convolution kernel to extract local texture features. Qualitative and quantitative experimental results on CelebA-HQ and Paris Street View datasets demonstrate the effectiveness and superiority of our method.
Cross-modal matching is one of the most fundamental and widely studied tasks in the field of data science. To have a better understanding of the complicated cross-modal correspondences, the powerful attention mechanis...
详细信息
This article discusses the servo control technology for the automatic screw-tightening process of a robotic arm based on multiple visual sensors, aiming at the assembly requirements of complex spatial structural compo...
详细信息
In recent years, impressive results have been achieved in robotic manipulation. While many efforts focus on generating collision-free reference signals, few allow safe contact between the robot bodies and the environm...
详细信息
ISBN:
(纸本)9798350323658
In recent years, impressive results have been achieved in robotic manipulation. While many efforts focus on generating collision-free reference signals, few allow safe contact between the robot bodies and the environment. However, in human's daily manipulation, contact between arms and obstacles is prevalent and even necessary. This paper investigates the benefit of allowing safe contact during robotic manipulation and advocates generating and tracking compliance reference signals in both operational and null spaces. In addition, to optimize the collision-allowed trajectories, we present a hybrid solver that integrates sampling- and gradient-based approaches. We evaluate the proposed method on a goal-reaching task in five simulated and real-world environments with different collisional conditions. We show that allowing safe contact improves goal-reaching efficiency and provides feasible solutions in highly collisional scenarios where collision-free constraints cannot be enforced. Moreover, we demonstrate that planning in null space, in addition to operational space, improves trajectory safety. Further information is available at https://***/ContactReach/.
The proceedings contain 17 papers. The topics discussed include: clustering-based cancer diagnosis model for whole slide image;shared embedding of x-ray & Enose networks for lung cancer classification;performance ...
ISBN:
(纸本)9798400716584
The proceedings contain 17 papers. The topics discussed include: clustering-based cancer diagnosis model for whole slide image;shared embedding of x-ray & Enose networks for lung cancer classification;performance analysis of lightweight vision transformers and deep convolutional neural networks in detecting brain tumors in MRI scans: an empirical approach;an automated skin lesions classification using hybrid CNN and transformer based deep learning model;a novel approach for repairing unsegmented liver vascular images based on centerline;medical image joint Deringing and denoising using Fourier neural operator;and NerveStitcher2.0: evolution of stitching algorithm for corneal confocal microscope images with optical flow.
Face restoration is a critical task in computer vision, aiming to restore high-quality facial images from degraded inputs. In existing diffusion models, identity information is not well preserved when confronted with ...
详细信息
The development of efficient noise reduction techniques is imperative to maintain the quality and features of digital images, as they are essential in numerous key applications. Specifically, noise from salt and peppe...
详细信息
Computer-aided gangue recognition is of great significance to the coal industry. For the current situation of low accuracy and relatively high error rate of coal gangue in computer-aided recognition system, a coal gan...
详细信息
Balise uplink signal IQ (I: Inphase;Q: Quadrature) acquisition is an important basis for signal demodulation and uplink signal parameter estimation. Relying on traditional acquisition methods, it is difficult to achie...
详细信息
暂无评论