Arbitrary-resolution image generation still remains a challenging task in AIGC, as it requires handling varying resolutions and aspect ratios while maintaining high visual quality. Existing transformer-based diffusion...
Super-resolution is particularly helpful for applications like larger computer screens, portable electronics like cellphones and cameras, and high-definition television sets. It plays a crucial role in improving visua...
详细信息
As video conferencing becomes an indispensable part of human's daliy life, how to achieve a high-fidelity calling experience under low bandwidth has been a popular and challenging issue. Deep generative models hav...
详细信息
visual tracking aims to estimate the state of an arbitrary object in a video frame only when the bounding box is given in the first frame. However, the existing trackers still struggle to adapt to complex environments...
详细信息
ISBN:
(纸本)9789819916382;9789819916399
visual tracking aims to estimate the state of an arbitrary object in a video frame only when the bounding box is given in the first frame. However, the existing trackers still struggle to adapt to complex environments due to the lack of adaptive appearance features. In this paper, we propose a graph attention transformer network, termed GATransT, to improve the robustness of visual tracking. Specifically, we design an adaptive graph attention module to enrich the embedding information extracted by the transformer backbone, which establishes the part-to-part correspondences between the template and search nodes. Extensive experimental results demonstrate that the proposed tracker outperforms the state-of-the-art methods on five challenging datasets, including OTB100, UAV123, LaSOT, GOT-10k, and TrackingNet.
image background removal is a crucial technique for enhancing the visual impact of images or altering their composition, finding applications in various fields such as photography and computer vision. This process can...
详细信息
Intra block copy with local illumination compensation (IBC-LIC) is a coding technique utilized in video coding to compensate for illumination variation between the current block and its prediction block within the pic...
详细信息
Versatile Video Coding (VVC) has adopted a quad-Tree with a nested multi-Type tree (QTMT) partition structure to improve the rate-distortion (RD) performance, but this greatly increases complexity due to the brute-for...
详细信息
Based on the research of AI (Artificial Intelligence) technology and image recognition technology, this article designed a monitoring and recognition system around the principle of universality. image data collection ...
详细信息
Remote medical diagnosis has emerged as a critical and indispensable technique in practical medical systems, where medical data are required to be efficiently compressed and transmitted for diagnosis by either profess...
详细信息
In this paper, I propose a fuzzy logic methodology for the problem of uncertainty in an innovative manner. The foundation of this strategy is synthetic aperture radar, which can provide high-resolution images in any c...
详细信息
暂无评论