Image inpainting has been researched for years. From deeper and larger models to models that focus on global information, all of them aim to obtain results closer to reality. In this paper, we combine the stripe windo...
详细信息
ISBN:
(纸本)9781728198354
Image inpainting has been researched for years. From deeper and larger models to models that focus on global information, all of them aim to obtain results closer to reality. In this paper, we combine the stripe window and line-by-line feature shift to modify the vision Transformer (ViT) to reduce the computation cost and obtain global information from the oblique attention. In addition, we design a new loss function to enhance the texture and colors for inpainting. At last, to validate the efficacy of our proposed model, we conduct extensive experiments on commonly seen datasets (Places2 and CelebA) compared with other state-of-the-art methods. The source code and pretrained models are available at https://***/bobo0303/MSCS-Net.
The proceedings contain 33 papers. The topics discussed include: DWT-RT: a lightweight image deraining model based on discrete wavelet transforms;data-driven optimal traffic signal control with phase priority and swit...
ISBN:
(纸本)9798350308020
The proceedings contain 33 papers. The topics discussed include: DWT-RT: a lightweight image deraining model based on discrete wavelet transforms;data-driven optimal traffic signal control with phase priority and switching cost;sonar object detection based on global context feature fusion and extraction;an image decomposition-based enhancement using a matrix iterative algorithm;tendency coefficient-based weighted distance measure for intuitionistic fuzzy sets with applications;higher-order link prediction based on message passing simplicial networks;short-term power load forecasting based on CEEMDAN-CNN-LSTM hybrid modeling;a method for large scale unconstrained binary quadratic programming problem based on graph neural network;encoding variable stiffness skills with interaction force and motion information for robot-environment interaction;and distributed Nash equilibrium seeking for high-order dynamics with event-triggered communication.
This paper introduces briefly the history and growth of the Detection and Classification of Acoustic Scenes and Events (DCASE) challenge, workshop, research area and research community. Created in 2013 as a data evalu...
详细信息
In recent years, the applications of small-scale and soft robots in various tasks have significantly increased. However, traditional control systems have been proven no longer competent for further use due to neglecti...
详细信息
vision Transformers (ViTs) have shown promise in medical image semantic segmentation (MISS) by capturing long-range correlations. However, ViTs often struggle to model local spatial information effectively, which is e...
详细信息
Depth estimation is a pivotal challenge in the realm of signalprocessing, finding various applications in fields like robotics and autonomous systems. Multiple cameras are used in these applications and are found to ...
详细信息
Due to the high activation sparsity and use of accumulates (AC) instead of expensive multiply-and-accumulates (MAC), neuromorphic spiking neural networks (SNNs) have emerged as a promising low-power alternative to tra...
At present, the programming methods of industrial robots mainly include off-line programming and instructional programming. However, both of the methods are time-consuming and require experienced robotics technicians....
详细信息
Sign language is the most common means of communication among the speech- and hearing-impaired. Just like all other languages, tools are being developed for interlanguage translation from sign language to text;however...
详细信息
Zero-shot anomaly detection (ZSAD) identifies anomalies without needing training samples from the target dataset, essential for scenarios with privacy concerns or limited data. vision-language models like CLIP show po...
详细信息
暂无评论