In this work, an Enhanced-Precision Bandgap Reference Incorporating Temperature-Compensation was proposed. Unlike traditional bandgap references, the reference voltage output of 0.701 V is realized by combining refere...
详细信息
Heterogeneous domain adaptation(HDA) mainly considers how to solve the target domain task with the help of the relevant knowledge of the source domain when both data distribution and feature space of the target domain...
详细信息
Heterogeneous domain adaptation(HDA) mainly considers how to solve the target domain task with the help of the relevant knowledge of the source domain when both data distribution and feature space of the target domain are different from the source domain. In this paper, we propose Label-guided Heterogeneous Domain Adaptation method, which focuses on how to enhances the application of a small amount of labeled target domain data. In our algorithm, we consider learning a mapping matrix to map the data of two domains into a shared subspace and make predictions accordingly. Firstly, we match the marginal and conditional distribution of the source and target domain data. Secondly, considering the guidance of labeled data in the target domain, we combine all labeled data and adapted it to the unlabeled part of the target domain. Finally, we introduce F-norm to reduce the parameter complexity of the mapping matrix. We conduct extensive experiments on text-to-text and image-to-image transfer tasks, and the experimental results demonstrated that our algorithm is significantly superior to several state-of-the-art algorithms.
As an interesting and challenging task, sarcasm generation has attracted widespread attention. Although very recent studies have made promising progress, none of them considers generating a sarcastic description for a...
详细信息
ISBN:
(纸本)9781959429623
As an interesting and challenging task, sarcasm generation has attracted widespread attention. Although very recent studies have made promising progress, none of them considers generating a sarcastic description for a given image - as what people usually do on Twitter. In this paper, we present a Multi-modal Sarcasm Generation (MSG) task: Given an image with hashtags that provide the sarcastic target, MSG aims to generate sarcastic descriptions like humans. Compared with textual sarcasm generation, MSG is more challenging as it is difficult to accurately capture the key information from images, hashtags, and OCR tokens and exploit multi-modal incongruity to generate sarcastic descriptions. To support the research on MSG, we develop MuSG, a new dataset with 5000 images and related Twitter text. We also propose a multi-modal Transformer-based method as a solution to this MSG task. The input features are embedded in the common space and passed through the multi-modal Transformer layers to generate the sarcastic descriptions by the auto-regressive paradigm. Both automatic and manual evaluations demonstrate the superiority of our method. The dataset and code will be available at ***/lukakupolida/MSG.
Visual question generation aims to focus on some target objects in an image to generate questions with certain questioning purposes. Existing studies mainly utilize an answer to extract the target object corresponding...
详细信息
Visual question generation aims to focus on some target objects in an image to generate questions with certain questioning purposes. Existing studies mainly utilize an answer to extract the target object corresponding to the questioning purpose for questioning. However, answers fail to accurately and completely map to every target object, such as the objects corresponding to the answer are ambiguous or the answers are the relationship between multiple objects. To address this problem, we propose a content-controlled question generation model, which generates questions based on a given target object set specified from an image. Considering that the target objects have different contributions during the generation process, we design a recurrent generative architecture to explicitly control attention to different objects and their corresponding image information at each generative stage. Extensive experiments on the VQA v2.0 dataset and the Visual7w dataset show that the proposed model outperforms the state-of-the-art models and can controllably generate questions with specified content.(c) 2023 Elsevier Ltd. All rights reserved.
Web services have been integrated with all walks of life in society. Abnormalities in the network and services seriously affect user experience and company revenue. The system log records various information of the sy...
详细信息
Knee osteoarthritis (KOA) is a common joint disease that severely affects the normal lives of patients. In clinical practice, the severity of KOA is evaluated by observing the X-ray images of the knee joint, which is ...
详细信息
Computer vision technologies are increasingly commonly used in daily life, and video super-resolution is gradually drawing more attention in the computer vision community. In this work, we propose an improved EDVR mod...
详细信息
ISBN:
(纸本)9781665458245
Computer vision technologies are increasingly commonly used in daily life, and video super-resolution is gradually drawing more attention in the computer vision community. In this work, we propose an improved EDVR model to tackle the robustness and efficiency problems of the original EDVR model in video super-resolution. First, to handle the blurring situations and emphasize the effective features, we devise a preprocessing module consisting of rigid convolution sub-modules and feature enhancement sub-modules, which are flexible and effective. Second, we devise a temporal 3D convolutional fusion module, which can extract information in image frames more accurately and rapidly. Third, to better utilize the information in feature maps, we design a new reconstruction block by introducing a new channel attention approach. Moreover, we use multiple programmatic methods to accelerate the model training and inference process, making the model useful for practical applications.
For the sake of the fixed magnitude of candidate control sets, the conventional model predictive current control (MPCC) for asymmetric six-phase permanent magnet motors suffers from huge harmonic currents. This articl...
详细信息
Increasingly complex systems contain large numbers of devices that generate great number of multivariate time series that are monitored and recorded. For anomaly detection of these complex time series, deep learning t...
详细信息
Graph convolutional networks (GCNs) are popular for a variety of graph learning tasks. ReRAM-based processing-in-memory (PIM) accelerators are promising to expedite GCN training owing to their in-situ computing capabi...
详细信息
暂无评论