In this study, we consider a crowdsourcing classification problem in which labeling information from crowds is aggregated to infer latent true labels. We propose a fully Bayesian deep generative crowdsourcing model(Ba...
详细信息
In this study, we consider a crowdsourcing classification problem in which labeling information from crowds is aggregated to infer latent true labels. We propose a fully Bayesian deep generative crowdsourcing model(Bayes DGC), which combines the strength of deep neural networks(DNNs) on automatic representation learning and the interpretable probabilistic structure encoding of probabilistic graphical models. The model comprises a DNN classifier as a prior for the true labels and a probabilistic model for the annotation generation process. The DNN classifier and annotation generation process share the latent true label variables. To address the inference challenge, we developed a natural-gradient stochastic variational inference, which combines variational message passing for conjugate parameters and stochastic gradient descent for DNN and learns the distribution of latent true labels and workers' confusion matrix via end-to-end training. We illustrated the effectiveness of the proposed model using empirical results on 22 real-world datasets.
We present a unified probabilistic formulation for diffusion-based image editing, where a latent variable is edited in a task-specific manner and generally deviates from the corresponding marginal distribution induced...
详细信息
Soft threshold pruning is among the cutting-edge pruning methods with state-of-the-art performance. However, previous methods either perform aimless searching on the threshold scheduler or simply set the threshold tra...
详细信息
The study of rigid protein-protein docking plays an essential role in a variety of tasks such as drug design and protein engineering. Recently, several learning-based methods have been proposed, exhibiting much faster...
详细信息
Most problems in the power system are related to the temperature of electrical equipment. The abnormal high temperature of electrical equipment will not only cause damage to equipment itself, but also threaten the saf...
详细信息
Multimodal knowledge editing represents a critical advancement in enhancing the capabilities of Multimodal Large Language Models (MLLMs). Despite its potential, current benchmarks predominantly focus on coarse-grained...
详细信息
Adapting large language models (LLMs) to new languages typically involves continual pre-training (CT) followed by supervised finetuning (SFT).However, this CT-then-SFT approach struggles with limited data in the conte...
详细信息
作者:
Li, XiyuanWang, YoujunLiu, WeiweiSchool of Computer Science
Wuhan University National Engineering Research Center for Multimedia Software Wuhan University Institute of Artificial Intelligence Wuhan University Hubei Key Laboratory of Multimedia and Network Communication Engineering Wuhan University
Sparse ridge regression problems play a significant role across various domains. To solve sparse ridge regression, [1] recently proposes an advanced algorithm, Scalable Optimal K-Sparse Ridge Regression (OKRidge), whi...
More and more researchers have paid attention to the tracking of visible-thermal infrared (RGB-T). How to fully exploit the complementary features of visible and thermal infrared images and fully integrate them is a k...
详细信息
Deep and convolutional neural networks have performed well in capturing speaker characteristics, while the ECAPA-TDNN model has demonstrated outstanding performance in both the fields of speaker validation and speaker...
详细信息
暂无评论