Retinopathy of Prematurity (ROP) is a retina disorder that affects premature infants with lower weights. If the patient cannot get the treatment in time when the illness reaches the last stage, irreversible vision los...
详细信息
Crowdsourcing has been a helpful mechanism to leverage human intelligence to acquire useful ***,when we aggregate the crowd knowledge based on the currently developed voting algorithms,it often results in common knowl...
详细信息
Crowdsourcing has been a helpful mechanism to leverage human intelligence to acquire useful ***,when we aggregate the crowd knowledge based on the currently developed voting algorithms,it often results in common knowledge that may not be *** this paper,we consider the problem of collecting specific knowledge via *** the help of using external knowledge base such as WordNet,we incorporate the semantic relations between the alternative answers into a probabilisticmodel to determine which answer is more *** formulate the probabilistic model considering both worker’s ability and task’s difficulty from the basic assumption,and solve it by the expectation-maximization(EM)*** increase algorithm compatibility,we also refine our method into semi-supervised *** results show that our approach is robust with hyper-parameters and achieves better improvement thanmajority voting and other algorithms when more specific answers are expected,especially for sparse data.
Stance detection is an important task, which aims to classify the attitude of an opinionated text toward a given target. In this paper, we develop an interpretable neural production system for stance detection (NPS4SD...
详细信息
The purpose of video inpainting is to fill a specified area with reasonable content. However, in the case of multiple targets and complex textures, current methods struggle to distinguish between feature information o...
The purpose of video inpainting is to fill a specified area with reasonable content. However, in the case of multiple targets and complex textures, current methods struggle to distinguish between feature information of the targets, leading to confusing or fuzzy inpainting results. In this paper, we design a new text-video completion network based on a motion compensation and temporal attention feature aggregation. Our network utilizes information from reference frames and target frames to complete the damaged region of the target frame. We first employ motion compensation to align the features of reference frames, and then use the temporal attention module to aggregate these features, resulting in accurate and reasonable content. To evaluate the effectiveness of our method, we introduce a new text video dataset with multiple text objects and complex textures, presenting a novel and challenging task for inpainting research. Through quantitative and qualitative comparison experiments, we demonstrate that our model outperforms existing baseline models in scenarios with multiple objects and complex textures.
Generating genuine images from textual description is challenging for both computer vision and linguistic representation in text-to-image synthesis. Generative adversarial networks (GAN) are an emerging generative mod...
详细信息
ISBN:
(数字)9798350375237
ISBN:
(纸本)9798350375244
Generating genuine images from textual description is challenging for both computer vision and linguistic representation in text-to-image synthesis. Generative adversarial networks (GAN) are an emerging generative model that has been producing great results by generating high-quality images with diverse images. The present review provides an overview of GAN with its background like architecture, game theory key ideas, loss functions, performance metrics and challenges. Recent and relevant text-to-image GAN models are discussed, including Gradual Refinement GAN (GR-GAN), Generative Adversarial CLIPS (GALIP), GigaGAN and StyleGAN-T with their architecture, dataset used highlighting limitations, strengths, year, and applications and comparing their performance metrics like Inception Score (IS) and Fréchet Inception Distance (FID) with identifying future directions, such as drawing boundaries for open research challenges. The present review functions as an extensive knowledge base and is highly valuable for researchers and practitioners who are interested in learning more about text-to-image using GANs.
Predicting retail sales is a hot research topic that can help firms achieve on-demand procurement. That can reduce the extra costs caused by an inventory shortage or surpluses. Traditional methods usually regard the s...
详细信息
Stable learning aims to learn a model that generalizes well to arbitrary unseen target domain by leveraging a single source domain. Recent advances in stable learning have focused on balancing the distribution of conf...
详细信息
Tiny object detection has been a challenging topic in computer vision recent years. Moreover, in remote sensing field, smaller and clustered tiny objects make its detection more difficult compared to ground-based imag...
详细信息
Dialogue response generation has made significant progress, but most research has focused on dyadic dialogue. In contrast, multi-party dialogues involve more participants, each potentially discussing different topics,...
详细信息
Correspondence pruning aims to establish reliable correspondences between two related images and recover relative camera motion. Existing approaches often employ a progressive strategy to handle the local and global c...
详细信息
暂无评论