In this paper, to address the optimization problem on a compact matrix manifold, we introduce a novel algorithmic framework called the Transformed Gradient Projection (TGP) algorithm, using the projection onto this co...
详细信息
In speaker verification, we use computational method to verify if an utterance matches the identity of an enrolled speaker. This task is similar to the manual task of forensic voice comparison, where linguistic analys...
详细信息
Automated Essay Scoring (AES) plays a crucial role in educational assessment by providing scalable and consistent evaluations of writing tasks. However, traditional AES systems face three major challenges: 1 reliance...
详细信息
Multi-task learning has emerged as a significant topic in artificial intelligence research, where a singular network model performs numerous tasks. This approach simultaneously processes multiple related tasks and sha...
详细信息
ISBN:
(数字)9798331531904
ISBN:
(纸本)9798331531911
Multi-task learning has emerged as a significant topic in artificial intelligence research, where a singular network model performs numerous tasks. This approach simultaneously processes multiple related tasks and shared knowledge, enhancing model generalization while increasing efficiency. This methodology provides innovative solutions to complex real-world problems. However, the single-model-based approach for multi-task learning suffers from inter-task interference in practice. Therefore, the exploration of more efficient multi-task learning strategies, aimed at balancing the synergy and conflict among tasks and minimizing the dependence on computational resources, is pivotal for the field's future progress. We propose a strategy that autonomously adjusts both the parameters and structures of the model to alleviate gradient interference in multi-task learning. This method includes integrating a lightweight, weight-adaptive module that enhances the network's ability to process tasks by optimally balancing parameter sharing and isolation. This adaptation enables the model to share common features across tasks while allocating distinct spaces for each task, thereby reducing interference. Our extensive experimental validation indicates that our framework surpasses other multi-task learning approaches, achieving joint optimization of tasks more effectively. This enhancement not only bolsters performance but also maintains an equilibrium between accuracy and inference speed.
UAV gesture recognition, a novel human-computer interaction form, offers an intuitive approach to controlling UAVs in various environments. However, there is a lack of comprehensive datasets for AI-powered UAV gesture...
详细信息
ISBN:
(数字)9798350359312
ISBN:
(纸本)9798350359329
UAV gesture recognition, a novel human-computer interaction form, offers an intuitive approach to controlling UAVs in various environments. However, there is a lack of comprehensive datasets for AI-powered UAV gesture recognition. This paper contributes in several ways: (i) We introduce MD-UHGRD, a unique UAV static gesture dataset with 20, 000 images and annotations, collected from a diverse group of participants in different environmental conditions. This dataset is expected to bridge a significant gap in UAV gesture recognition algorithms. (ii) We propose SA-YOLO, a multifunctional UAV gesture recognition method that not only enables gesture recognition but also includes face and pedestrian tracking, optimizing UAV control in complex scenarios. SA-YOLO incorporates the Spatial Asymptotic Feature Pyramid Network (SAFPN), Scale Pyramid Pooling with Cross Stage Partial Networks Convolution (SPPCSPC), and Space-to-Depth Convolution (SPD-Conv). (iii) Extensive evaluation of SAYOLO on MD-UHGRD establishes it as a benchmark in this domain. Our method demonstrates high accuracy, processing speed, and a compact model size, achieving a 93.2% mean Average Precision (mAP) with 10.3 million parameters and 48 frames per second (FPS). Among competing models, SA-YOLO not only achieves the highest mAP but also maintains a balance in model size and FPS. The database and code are available at: https://***/ijcnn2024/SA-YOLO.
Momentum Contrast (MoCo) achieves great success for unsupervised visual representation learning. However, there are a lot of supervised and semi-supervised datasets, which are already labeled. To fully utilize the lab...
详细信息
As the size of transistors shrinks and power density increases,thermal simulation has become an indispensable part of the device design ***,existing works for advanced technology transistors use simplified empirical m...
详细信息
As the size of transistors shrinks and power density increases,thermal simulation has become an indispensable part of the device design ***,existing works for advanced technology transistors use simplified empirical models to calculate effective thermal conductivity in the *** this work,we present a dataset of size-dependent effective thermal conductivity with electron and phonon properties extracted from ab initio *** in-plane and cross-plane thermal conductivity data of eight semiconducting materials(Si,Ge,GaN,AlN,4H-SiC,GaAs,InAs,BAs)and four metallic materials(Al,W,TiN,Ti)with the characteristic length ranging from 5 nm to 50 nm have been *** the absolute value,normalized effective thermal conductivity is also given,in case it needs to be used with updated bulk thermal conductivity in the future.
A major challenge with the multi-ratio Fractional Program (FP) is that the existing methods for the maximization problem typically do not work for the minimization case. We propose a novel technique called inverse qua...
详细信息
A major challenge with the multi-ratio Fractional Program (FP) is that the existing methods for the maximization problem typically do not work for the minimization case. We propose a novel technique called inverse quadratic transform for the sum-of-ratios minimization problem. Its main idea is to reformulate the min-FP problem in a form amenable to efficient iterative optimization. Furthermore, this transform can be readily extended to a general cost-function-of-multiple-ratios minimization problem. We also give a Majorization-Minimization (MM) interpretation of the inverse quadratic transform, showing that all those desirable properties of MM can be carried over to the new technique. Moreover, we demonstrate the application of inverse quadratic transform in minimizing the Age-of-Information (AoI) of data networks.
The stochastic Gradient Tracking (GT) method for distributed optimization, is known to be robust against the inter-client variance caused by data heterogeneity. However, the stochastic GT method can be communication-i...
The stochastic Gradient Tracking (GT) method for distributed optimization, is known to be robust against the inter-client variance caused by data heterogeneity. However, the stochastic GT method can be communication-intensive, requiring a large number of communication rounds of message exchange for convergence. To address this challenge, this paper proposes a new communication-efficient stochastic GT algorithm called the Local Stochastic GT(LSGT) algorithm, which adopts the local stochastic gradient descent (local SGD) technique in the GT method. With LSGT, each agent can perform multiple SGD updates locally within each communication round. Although it is not known previously whether the stochastic GT method can benefit from the local SGD, we establish the conditions under which our proposed LSGT algorithm enjoys the linear speedup brought by local SGD. Compared with the existing work, our analysis requires less restrictive conditions on the mixing matrix and algorithm stepsize. Moreover, it reveals that the local SGD does not only reserve the resilience of the stochastic GT method against the data heterogeneity but also speeds up reducing the tracking error reduction in the optimization process. The experimental results demonstrate that the proposed LSGT exhibits improved convergence speed and robust performance in various heterogeneous environments.
In recent years, bio-inspired optimization algorithms have attained significant success in addressing complex global optimization issues. Nonetheless, a single bio-inspired search strategy may struggle to handle diver...
In recent years, bio-inspired optimization algorithms have attained significant success in addressing complex global optimization issues. Nonetheless, a single bio-inspired search strategy may struggle to handle diverse and intricate problems. To surmount this constraint, this paper introduces an enhanced hybrid whale algorithm (MEHWOA) based on reverse learning strategy and Lévy flight mechanism improvement. This approach amalgamates the global search capabilities of the gray wolf algorithm with the local search prowess of the whale algorithm, further augmenting the convergence speed and optimization accuracy of MEHWOA by incorporating reverse learning strategy and Lévy flight mechanism. To assess the performance of MEHWOA, we conducted experiments on 23 general benchmark test functions and compared it with original gray wolf (GWO), whale (WOA), particle swarm (PSO), and sparrow search (SSA) optimization algorithms. The experimental outcomes reveal that MEHWOA exhibits faster convergence speed and superior accuracy across various test functions, including unimodal, multimodal, and composite benchmark test functions. These findings corroborate that MEHWOA possesses considerable potential for solving complex global optimization problems.
暂无评论