Transformers have recently lead to encouraging progress in computer *** this work,we present new baselines by improving the original Pyramid Vision Transformer(PVT v1)by adding three designs:(i)a linear complexity att...
详细信息
Transformers have recently lead to encouraging progress in computer *** this work,we present new baselines by improving the original Pyramid Vision Transformer(PVT v1)by adding three designs:(i)a linear complexity attention layer,(ii)an overlapping patch embedding,and(iii)a convolutional feed-forward *** these modifications,PVT v2 reduces the computational complexity of PVT v1 to linearity and provides significant improvements on fundamental vision tasks such as classification,detection,and *** particular,PVT v2 achieves comparable or better performance than recent work such as the Swin *** hope this work will facilitate state-ofthe-art transformer research in computer *** is available at https://***/whai362/PVT.
Hair editing is a critical image synthesis task that aims to edit hair color and hairstyle using text descriptions or reference images, while preserving irrelevant attributes (e.g., identity, background, cloth). Many ...
With the rapid growth in the number of mobile devices, more and more data is created and requested by users. By caching the data to the edge server, users can obtain the content they request in a closer place, which r...
详细信息
Cross-modal retrieval of image-text and video-text is a prominent research area in computer vision and natural language processing. However, there has been insufficient attention given to cross-modal retrieval between...
详细信息
Sequential diagnosis prediction (SDP) is a challenging task, aiming to predict patients' future diagnoses based on their historical medical records. While methods based on graph neural networks (GNNs) have proven ...
详细信息
Medical images, such as X-rays, MRI scans, and CT scans, play an important role in diagnosing and treating various diseases. However, the sensitive nature of these images requires protection against unauthorized use a...
详细信息
Sim-to-real transfer, which trains RL agents in the simulated environments and then deploys them in the real world, has been widely used to overcome the limitations of gathering samples in the real world. Despite the ...
Two-stage recommender systems play a crucial role in efficiently identifying relevant items and personalizing recommendations from a vast array of options. This paper, based on an error decomposition framework, analyz...
ISBN:
(纸本)9798331314385
Two-stage recommender systems play a crucial role in efficiently identifying relevant items and personalizing recommendations from a vast array of options. This paper, based on an error decomposition framework, analyzes the generalization error for two-stage recommender systems with a tree structure, which consist of an efficient tree-based retriever and a more precise yet time-consuming ranker. We use the Rademacher complexity to establish the generalization upper bound for various tree-based retrievers using beam search, as well as for different ranker models under a shifted training distribution. Both theoretical insights and practical experiments on real-world datasets indicate that increasing the branches in tree-based retrievers and harmonizing distributions across stages can enhance the generalization performance of two-stage recommender systems.
The spatio-temporal sequence of human body movements provides important information about daily action patterns. This article presents a pyroelectric infrared (PIR) sensor array for detecting human motion features and...
详细信息
暂无评论