After the outbreak of the COVID-19 pandemic, online teaching has gradually emerged as an indispensable component of education. Despite its convenience, online education lacks the immediate interactive experience inher...
详细信息
Edge networks are highly volatile and the quality of device communication and computational resources change not only over time but also according to the movement of users. Current federation learning suffers from poo...
详细信息
Multi-focus image fusion is a hot topic in the field of image processing, and it is a fundamental problem in the fields of image editing, image synthesis, and target retrieval. In previous fusion methods, although fea...
详细信息
Mathematical formulas are ubiquitous in courses such as Machine Learning. Understanding these formulas is the primary challenge for studying of students. If a searchable knowledge base can be built for these formulas,...
详细信息
Accurate prediction of future traffic flow trends is essential to solve urban transportation problems. However, traffic flow prediction faces great challenges due to the multimodal nature of pedestrian behavior and th...
详细信息
作者:
Li, ChenmingLiu, ShiguangTianjin University
School of Computer Science and Technology College of Intelligence and Computing Tianjin300350 China Tianjin University
Tianjin Key Laboratory of Cognitive Computing and Application Tianjin300350 China
This paper proposes an end-to-end video saliency prediction network model, termed TM2SP-Net (Transformer-based Multi-level Spatiotemporal Feature Pyramid Network). Leveraging the strong encoding learning capability of...
详细信息
Temporal Anti-Aliasing (TAA) is a popular method for eliminating temporal aliasing problems. However, the images simply processed by TAA become blurred and lose some details. In this paper, an improved TAA algorithm n...
详细信息
Directed fuzzing technology is one of the key technologies to quickly reach a specific location of software, and to conduct targeted testing or bug recurrence. However, directed fuzzing technology has some problems, s...
详细信息
In this paper, we propose the speech quality assessment transformer utilizing listener dependent modeling (SQAT-LD) mean opinion score (MOS) prediction system, which was submitted to the 2023 VoiceMOS Challenge. The s...
In this paper, we propose the speech quality assessment transformer utilizing listener dependent modeling (SQAT-LD) mean opinion score (MOS) prediction system, which was submitted to the 2023 VoiceMOS Challenge. The system is based on a combination of self-supervised learning (SSL) models and listener-dependent modeling. Due to this challenge’s emphasis on real-world and challenging zero-shot out-of-domain MOS prediction in three different voice evaluation scenarios, we specifically designed a two-branch module to predict scores and weights for each frame, aiming to achieve better generalization. In the challenge, our system achieved fourth place in Track 1a, second place in Track 1b and first place in Track 2. Additionally, we conducted an ablation study to investigate the effectiveness of our proposed method.
Cross-scene multispectral point cloud classification aims to transfer knowledge of labeled source scenes to improve the discriminability of the model on the unlabeled target scenes. From a novel perspective, we argue ...
Cross-scene multispectral point cloud classification aims to transfer knowledge of labeled source scenes to improve the discriminability of the model on the unlabeled target scenes. From a novel perspective, we argue that the information transfer between the source and target scenes can be used to solve cross-scene multispectral point cloud classification task. Specifically, we propose a Coupled Graph Convolutional Network (Coupled-GCN) to achieve joint alignment of node- and class-level structures within scenes by passing information between different scenes. To reduce the effect of spectral shift between the source and target scenes and seek scene-invariant intrinsic features, we propose a scene adaptive learning module by optimizing three different loss functions, namely, source classifier loss, domain classifier loss, and target classifier loss as a whole. In the cross-scene multispectral point cloud classification task, the proposed Coupled-GCN can alleviate the spectral shift problem compared to the traditional GCN and achieves an overall F_score of 65.04%.
暂无评论