Since most communications in vehicular networks rely on wireless transmissions, information broadcast by vehicles is highly susceptible to interception and eavesdropping by third parties, making it vulnerable to vario...
详细信息
The rapid growth of machine learning(ML)across fields has intensified the challenge of selecting the right algorithm for specific tasks,known as the Algorithm Selection Problem(ASP).Traditional trial-and-error methods...
详细信息
The rapid growth of machine learning(ML)across fields has intensified the challenge of selecting the right algorithm for specific tasks,known as the Algorithm Selection Problem(ASP).Traditional trial-and-error methods have become impractical due to their resource *** Machine Learning(AutoML)systems automate this process,but often neglect the group structures and sparsity in meta-features,leading to inefficiencies in algorithm recommendations for classification *** paper proposes a meta-learning approach using Multivariate Sparse Group Lasso(MSGL)to address these *** method models both within-group and across-group sparsity among meta-features to manage high-dimensional data and reduce multicollinearity across eight meta-feature *** Fast Iterative Shrinkage-Thresholding Algorithm(FISTA)with adaptive restart efficiently solves the non-smooth optimization *** validation on 145 classification datasets with 17 classification algorithms shows that our meta-learning method outperforms four state-of-the-art approaches,achieving 77.18%classification accuracy,86.07%recommendation accuracy and 88.83%normalized discounted cumulative gain.
Phase shifting transformers (PSTs) are a cost-efficient solution for controlling power flow without additional operating expenses. They can enhance power distribution in transmission systems and increase system capaci...
详细信息
The widespread adoption of digital communication technologies across various domains has led to a significant shift towards digitalization. However, this evolution has also introduced vulnerabilities, including unauth...
详细信息
Machine Learning (ML) promises to enhance the efficacy of Android Malware Detection (AMD);however, ML models are vulnerable to realistic evasion attacks - crafting realizable Adversarial Examples (AEs) that satisfy An...
详细信息
Person re-identification (Re-ID) is a classical computer vision task and has significant applications for public security and information forensics. Recently, long-term Re-ID with clothes-changing has attracted increa...
详细信息
Aiming at the problems of low recognition rate of Chinese dialects and poor robustness in noisy environment, this paper proposes an improved feature extraction algorithm, which combines the Sparrow Search Algorithm (S...
详细信息
This paper introduces a novel approach to the smart management of public EV charging infrastructure, combining day-ahead energy bidding with a dynamic end-user pricing model. It addresses critical challenges such as d...
详细信息
This paper presents a comparison of features derived from various biomedical signals from both younger as well as elderly subjects. The dataset consists of accelerometer, gyroscope and altimeter signals acquired from ...
详细信息
Recent advancements in action recognition leverage both skeleton and video modalities to achieve state-of-the-art performance. However, due to the challenges of early fusion, which tends to underutilize the strengths ...
详细信息
ISBN:
(数字)9798331510831
ISBN:
(纸本)9798331510848
Recent advancements in action recognition leverage both skeleton and video modalities to achieve state-of-the-art performance. However, due to the challenges of early fusion, which tends to underutilize the strengths of each modality, existing methods often resort to late fusion, consequently leading to more complex designs. Additionally, self-supervised learning approaches utilizing both modalities remain underexplored. In this paper, we introduce SV-data2vec, a novel self-supervised framework for learning from skeleton and video data. Our approach employs a student-teacher architecture, where the teacher network generates contextualized targets based on skeleton data. The student network per-forms a masked prediction task using both skeleton and visual data. Remarkably, after pretraining with both modalities, our method allows for fine-tuning with RGB data alone, achieving results on par with multimodal approaches by effectively learning video representations through skeleton data guidance. Extensive experiments on benchmark datasets NTU RGB+D 60, NTU RGB+D 120, and Toyota Smarthome confirm that our method outperforms existing RGB based state-of-the-art techniques. The code is available at github. com/zoranadozdor/SVdata2vec.
暂无评论