In recent years, cross-modal hashing (CMH) has attracted increasing attentions, mainly because its potential ability of mapping contents from different modalities, especially in vision and language, into the same spac...
详细信息
ISBN:
(数字)9781728171685
ISBN:
(纸本)9781728171692
In recent years, cross-modal hashing (CMH) has attracted increasing attentions, mainly because its potential ability of mapping contents from different modalities, especially in vision and language, into the same space, so that it becomes efficient in cross-modal data retrieval. There are two main frameworks for CMH, differing from each other in whether semantic supervision is required. Compared to the unsupervised methods, the supervised methods often enjoy more accurate results, but require much heavier labors in data annotation. In this paper, we propose a novel approach that enables guiding a supervised method using outputs produced by an unsupervised method. Specifically, we make use of teacher-student optimization for propagating knowledge. Experiments are performed on two popular CMH benchmarks, i.e., the MIRFlickr and NUS-WIDE datasets. Our approach outperforms all existing unsupervised methods by a large margin.
By comparing quantitative ranking with qualitative contributions, we reveal that academic assessment has to put real contributions ahead of quantitative indicators and that rankings have nothing to do with universitie...
By comparing quantitative ranking with qualitative contributions, we reveal that academic assessment has to put real contributions ahead of quantitative indicators and that rankings have nothing to do with universities’ and their libraries’ true values. The greatness of a university lies in its impacts on the progress for human knowledge and the promotion for social development. Although ranking of universities by way of quantitative indicators can reflect some information, we should pay more attention to qualitative contributions.
The proliferation of massive open online courses (MOOCs) demands an effective way of course recommendation for jobs posted in recruitment websites, especially for the people who take MOOCs to find new jobs. Despite th...
详细信息
Relation extraction is an important information extraction task in many Natural Language Processing (NLP) applications, such as automatic knowledge graph construction, question answering, sentiment analysis, etc. Howe...
详细信息
In recent years, cross-modal hashing (CMH) has attracted increasing attentions, mainly because its potential ability of mapping contents from different modalities, especially in vision and language, into the same spac...
详细信息
Big graph data is different from traditional data and they usually contain complex relationships and multiple attributes. With the help of graph pattern matching, a pattern graph can be designed, satisfying special pe...
详细信息
This paper presents one-bit supervision, a novel setting of learning from incomplete annotations, in the scenario of image classification. Instead of training a model upon the accurate label of each sample, our settin...
ISBN:
(纸本)9781713829546
This paper presents one-bit supervision, a novel setting of learning from incomplete annotations, in the scenario of image classification. Instead of training a model upon the accurate label of each sample, our setting requires the model to query with a predicted label of each sample and learn from the answer whether the guess is correct. This provides one bit (yes or no) of information, and more importantly, annotating each sample becomes much easier than finding the accurate label from many candidate classes. There are two keys to training a model upon one-bit supervision: improving the guess accuracy and making use of incorrect guesses. For these purposes, we propose a multi-stage training paradigm which incorporates negative label suppression into an off-the-shelf semi-supervised learning algorithm. In three popular image classification benchmarks, our approach claims higher efficiency in utilizing the limited amount of annotations.
Triplet learning, i.e. learning from triplet data, has attracted much attention in computer vision tasks with an extremely large number of categories, e.g., face recognition and person re-identification. Albeit with r...
详细信息
Standardized visualization of electro- or mechano-anatomical data allows easy inter- and intra-patient comparison. For this purpose, we developed the open-source and freely available UNISYS (Universal Ventricular Bull...
详细信息
ISBN:
(数字)9781728173825
ISBN:
(纸本)9781728111056
Standardized visualization of electro- or mechano-anatomical data allows easy inter- and intra-patient comparison. For this purpose, we developed the open-source and freely available UNISYS (Universal Ventricular Bullseye Visualization) software. A patient-specific mesh of the ventricular anatomy typically consists of a certain number of vertices and their associated values. Based on a limited amount of user inputs, the algorithm transforms these 3D single-layer coordinates to a circular 2D disk ('bullseye') through a number of translations and rotations, and interpolates them to achieve a continuous standardized visualization. The algorithm shows a high degree of bidirectionality and a robust spatial preservation of points of interest.
Deep learning seeks to achieve excellent performance for representation learning in image datasets. However, supervised deep learning models such as convolutional neural networks require a large number of labeled imag...
详细信息
暂无评论