There is a key problem in the medical visual question answering task that how to effectively realize the feature fusion of language and medical images with limited datasets. In order to better utilize multi-scale info...
详细信息
In this paper, we propose a hierarchical feature modulation (HFM) approach for stable face anti-spoofing in unseen domains and unseen attacks. The conventional multi-domain based generalizable approaches likely lead t...
In this paper, we propose a hierarchical feature modulation (HFM) approach for stable face anti-spoofing in unseen domains and unseen attacks. The conventional multi-domain based generalizable approaches likely lead to local optima due to the complicated or heuristic learning paradigm. Inspired by the fact that high-level semantic disturbances and low-level miscellaneous bias jointly cause the distribution shift, HFM aims to modulate the fine-grained feature in a hierarchical manner. Specifically, we complement the structural feature with patch-wise learnable statistical information, i.e. local difference histogram, to relieve the overfitting on high-level semantics. We further introduce the structural causal model (SCM) with imaging color model to reveal that presenting mediums and capturing devices destroy the liveness-relevant information from the low level. Thus we model this hidden entanglement as a distribution mixture problem and propose the expectation-maximization (EM) based causal intervention to remove these miscellanies. Experimental results on public datasets demonstrate the effectiveness of HFM, especially in out-of-distribution settings.
In this paper, we propose a cross-modality fourier feature (CMFF) method via frequency selection, which learns the rational anatomical structure for targeting medical modality images. Unlike existing works seeking pix...
In this paper, we propose a cross-modality fourier feature (CMFF) method via frequency selection, which learns the rational anatomical structure for targeting medical modality images. Unlike existing works seeking pixel-wise intensity discrepancy likely misleading to bias anatomical structures, our approach strongly holds the medical prior that different modality MRI images should share the same anatomical structure. To achieve this, our method instead learns to convert MRI images to the auxiliary frequency domain. Moreover, we adopt the Shapley value to quantify the contribution of each frequency, with respect to the structure for MRI image pairs of different modalities. Thus our approach learns to refine the anatomical structure generated by the target modality iteratively. Extensive experimental results on the BraTS dataset show that our model surpasses the performance compared to SOTAs.
Previous unsupervised domain adaptation (UDA) methods aim to promote target learning via a single-directional knowledge transfer from label-rich source domain to unlabeled target domain, while its reverse adaption fro...
详细信息
A key challenge to the scalable deployment of the energy self-sustainability(ESS)Internet of Everything(IoE)for sixth-generation(6G)networks is juggling massive connectivity and high spectral efficiency(SE).Cell-free ...
详细信息
A key challenge to the scalable deployment of the energy self-sustainability(ESS)Internet of Everything(IoE)for sixth-generation(6G)networks is juggling massive connectivity and high spectral efficiency(SE).Cell-free massive multiple-input multiple-output(CF mMIMO)is considered as a promising solution,where many wireless access points perform coherent signal processing to jointly serve the ***,massive connectivity and high SE are difficult to obtain at the same time because of the limited pilot *** solve this problem,we propose a new framework for ESS IoE networks where the user activity detection(UAD)and channel estimation are decoupled.A UAD detector based on deep convolutional neural networks,an initial access scheme,and a scalable power control policy are proposed to enable the practical scalable CF mMIMO *** derive novel and exact closed-form expressions of harvested energy and SE with maximum ratio(MR)*** local partial minimum mean-square error and MR combining,simulation results prove that the proposed framework can serve more users,improve the SE performance,and achieve better user fairness for the considered ESS IoE networks.
A Ji-shaped Chinese-character patch antenna is proposed for future communications. The proposed antenna is comprised of two dielectric layers. Ji-shaped patch is etched on the upper surface of the top layer, whereas a...
详细信息
Considering the actual multi-agent coverage process, the motion trajectories are seriously affected by disturbances and noise. In this paper, a cooperative control method based on active disturbance rejection controll...
Considering the actual multi-agent coverage process, the motion trajectories are seriously affected by disturbances and noise. In this paper, a cooperative control method based on active disturbance rejection controller (ADRC) for multiple mecanum-wheeled robots (MWMR) is proposed to improve its robustness. In particular, to suppress the total disturbances during the running of each robot and reduce large computation and observation cost of high order ADRC, a reduced-order cascaded ADRC (RC-ADRC) is proposed. Moreover, distributed controller is proposed to track the expected trajectories of multiple mecanum-wheeled robots under disturbances. Compared with the proposed RC-ADRC based cooperative control scheme and PID based cooperative control one, the favorable motion trajectory tracking performance is obtained. Simulation results verify the superiority of the method.
This paper investigates the enclosing control of hybrid multi-agent systems under directed networks. Firstly, we establish some criteria for continuoustime and discrete-time multi-agent systems to achieve enclosing co...
详细信息
Uncertain demand may lead to inefficient carsharing decisions. Under this case, the supply-demand imbalance problem for a station-based one-way carsharing system would be more challenging, thus complicating the ...
详细信息
Uncertain demand may lead to inefficient carsharing decisions. Under this case, the supply-demand imbalance problem for a station-based one-way carsharing system would be more challenging, thus complicating the vehicle relocation decisions. This work proposes a two-stage stochastic mixed-integer nonlinear programming model, which integrates the long-term and short-term decisions to maximize the profit of the carsharing companies. In the first stage, the tactical decisions of fleet sizing and vehicle initial distribution are optimized before the realization of the uncertain demand. In the second stage, the operational decisions of operator-based and user-based relocation are determined. First, this paper considers that the distribution of uncertain demand can be affected by the operational decisions of user-based relocation incentives. A learning-embedded optimization method is introduced to learn the distribution of demand. In this way, the decision-making optimization model can achieve higher levels of performance under the guidance of the demand uncertainty learning model. Second, an equitable relocation problem, considering the uneven demand unsatisfied levels with two different equity criteria from the aspects of station and OD pair is presented, respectively. Third, the large number of discrete decision variables, nonlinear operational cost function and constraints, and nonlinearities from the endogenous demand uncertainty together constitute the solving challenges of the proposed model. For solving the problem efficiently, a dedicated two-phase solution algorithm is proposed, with a learning-embedded trust-region method adopted to solve the continuous relaxation problem in phase I, and a mixed-integer linear programming guided iterative rounding to obtain the integer solutions of carsharing operations in phase II. The solution algorithm adaptively bridges the learning and optimization via the trust-region method with flexible sample generation. Finally
Few-shot learning with N-way K-shot scheme is an open challenge in machine learning. Many metric-based approaches have been proposed to tackle this problem, e.g., the Matching Networks and CLIP-Adapter. Despite that t...
详细信息
暂无评论