Underwater image enhancement (UIE) focuses on mitigating image quality degradation due to light absorption and scattering. However, most existing methods enhance images via a global and uniform manner, neglecting the ...
详细信息
Aiming at the problem that ultra-wideband systems are easily interfered by narrowband signals, In this paper, a novel ultra-wideband (UWB) bandpass filter with dual-notch characteristics. This filter is mainly compose...
详细信息
This paper presents a hybrid method for determination of continuous dielectric properties of clothing materials. The dielectric constant and loss tangent of three types of materials are firstly investigated using open...
详细信息
We present a novel algorithm for point pattern matching by means of spectra of directed graphs. Given a feature point-set, we construct a weighted directed graph and skew-symmetric matrix associated with the graph. By...
详细信息
We present a novel algorithm for point pattern matching by means of spectra of directed graphs. Given a feature point-set, we construct a weighted directed graph and skew-symmetric matrix associated with the graph. By using spectral decomposition of the matrix, we give a spectral representation of the feature points with half of the eigenvectors. We theoretically analyze that our method can well deal with the matching problem under affine transformation. The expreiments applied to synthetic data and real-world images show the effectiveness of our method.
With the rapid development of multimedia technology, audio-visual learning has emerged as a promising research topic within the field of multimodal analysis. In this paper, we explore parameter-efficient transfer lear...
ISBN:
(纸本)9798331314385
With the rapid development of multimedia technology, audio-visual learning has emerged as a promising research topic within the field of multimodal analysis. In this paper, we explore parameter-efficient transfer learning for audio-visual learning and propose the Audio-Visual Mixture of Experts (AVMoE) to inject adapters into pre-trained models flexibly. Specifically, we introduce unimodal and cross-modal adapters as multiple experts to specialize in intra-modal and intermodal information, respectively, and employ a lightweight router to dynamically allocate the weights of each expert according to the specific demands of each task. Extensive experiments demonstrate that our proposed approach AVMoE achieves superior performance across multiple audio-visual tasks, including AVE, AVVP, AVS, and AVQA. Furthermore, visual-only experimental results also indicate that our approach can tackle challenging scenes where modality information is missing. The source code is available at https://***/yingchengy/AVMOE.
To deal with the insufficiency problem of Laplacian eigenmap (LE) method and Maximum margin criterion (MMC) method in feature extraction, a new dimensionality reduction method called Laplacian eigenmap based on Improv...
详细信息
To deal with the insufficiency problem of Laplacian eigenmap (LE) method and Maximum margin criterion (MMC) method in feature extraction, a new dimensionality reduction method called Laplacian eigenmap based on Improved maximum margin criterion (LE/IMMC) is proposed with applications in gene expression data classification. The LE/IMMC intends to constrain similar data points as close to each other as possible and maximize the margin regions between different pattern classes simultaneously. The proposed LE/IMMC by introducing IMMC into the cost function of LE retains the characteristic of local neighborhood relationship of LE. Meanwhile, it emphasizes the discriminative information by incorporating IMMC, which can maximize the between-class scatter and minimize the within-class scatter. Gene expression data classification experiments on four public datasets demonstrate our method is effective for feature extraction,
The study of Tibetan function words is an indispensable basic work in Tibetan natural language processing and has a wide range of practical application value. It is the core of Tibetan information processing and the b...
详细信息
This paper investigates how to take full advantage of the tem-poral and spatial information in videos with minimal compu-tational cost in the semi-supervised video object segmentation (VOS) task. Current state-of-the-...
详细信息
Heterogeneous domain adaptation seeks to learn an effective classifier or regression model for unlabeled target samples by using the well-labeled source samples but residing in different feature spaces and lying diffe...
详细信息
The goal for experiments for programming languages is to polish students' programming skills solving problems by programming languages. Programming contests are contests solving problems by programming. A programm...
详细信息
暂无评论