Dialogue response generation has made significant progress, but most research has focused on dyadic dialogue. In contrast, multi-party dialogues involve more participants, each potentially discussing different topics,...
详细信息
As the material basis for all life activities, proteins play a crucial role in performing life activities. Most real-life proteins perform various functions in the form of protein complexes, so it is essential for und...
详细信息
Predicting retail sales is a hot research topic that can help firms achieve on-demand procurement. That can reduce the extra costs caused by an inventory shortage or surpluses. Traditional methods usually regard the s...
详细信息
Correspondence pruning aims to establish reliable correspondences between two related images and recover relative camera motion. Existing approaches often employ a progressive strategy to handle the local and global c...
详细信息
Stable learning aims to learn a model that generalizes well to arbitrary unseen target domain by leveraging a single source domain. Recent advances in stable learning have focused on balancing the distribution of conf...
详细信息
Tiny object detection has been a challenging topic in computer vision recent years. Moreover, in remote sensing field, smaller and clustered tiny objects make its detection more difficult compared to ground-based imag...
详细信息
Contrastive learning has been widely applied in sequential recommendation to improve the recommendation performance. Existing contrastive learning methods focus on adjusting the views number of positive and negative s...
详细信息
Generating genuine images from textual description is challenging for both computer vision and linguistic representation in text-to-image synthesis. Generative adversarial networks (GAN) are an emerging generative mod...
详细信息
ISBN:
(数字)9798350375237
ISBN:
(纸本)9798350375244
Generating genuine images from textual description is challenging for both computer vision and linguistic representation in text-to-image synthesis. Generative adversarial networks (GAN) are an emerging generative model that has been producing great results by generating high-quality images with diverse images. The present review provides an overview of GAN with its background like architecture, game theory key ideas, loss functions, performance metrics and challenges. Recent and relevant text-to-image GAN models are discussed, including Gradual Refinement GAN (GR-GAN), Generative Adversarial CLIPS (GALIP), GigaGAN and StyleGAN-T with their architecture, dataset used highlighting limitations, strengths, year, and applications and comparing their performance metrics like Inception Score (IS) and Fréchet Inception Distance (FID) with identifying future directions, such as drawing boundaries for open research challenges. The present review functions as an extensive knowledge base and is highly valuable for researchers and practitioners who are interested in learning more about text-to-image using GANs.
Deep learning has gained tremendous success in various fields while training deep neural networks(DNNs) is very compute-intensive, which results in numerous deep learning frameworks that aim to offer better usability ...
详细信息
Deep learning has gained tremendous success in various fields while training deep neural networks(DNNs) is very compute-intensive, which results in numerous deep learning frameworks that aim to offer better usability and higher performance to deep learning practitioners. Tensor Flow and Py Torch are the two most popular frameworks. Tensor Flow is more promising within the industry context, while Py Torch is more appealing in academia. However, these two frameworks differ much owing to the opposite design philosophy:static vs dynamic computation graph. Tensor Flow is regarded as being more performance-friendly as it has more opportunities to perform optimizations with the full view of the computation graph. However, there are also claims that Py Torch is faster than Tensor Flow sometimes, which confuses the end-users on the choice between them. In this paper, we carry out the analytical and experimental analysis to unravel the mystery of comparison in training speed on single-GPU between Tensor Flow and Py Torch. To ensure that our investigation is as comprehensive as possible, we carefully select seven popular neural networks, which cover computer vision, speech recognition, and natural language processing(NLP). The contributions of this work are two-fold. First, we conduct the detailed benchmarking experiments on Tensor Flow and Py Torch and analyze the reasons for their performance difference. This work provides the guidance for the end-users to choose between these two frameworks. Second, we identify some key factors that affect the performance,which can direct the end-users to write their models more efficiently.
In this work, we extend the concept of the p-mean welfare objective from social choice theory (Moulin 2004) to study p-mean regret in stochastic multi-armed bandit problems. The p-mean regret, defined as the differenc...
详细信息
暂无评论