The mobile distributed file system allows users to share files among multiple mobile devices through simple read/write operations. However, it requires the connected remote devices to be online all the time to ensure ...
详细信息
Identifying sentences sharing similar meanings is crucial to speech and text understandings. Although currently popular cross-encoder solutions with pre-trained language models as backbone have achieved remarkable per...
详细信息
Identifying sentences sharing similar meanings is crucial to speech and text understandings. Although currently popular cross-encoder solutions with pre-trained language models as backbone have achieved remarkable performance, they suffer from the lack of the permutation invariance or symmetry that is one of the most important inductive biases to such task. To alleviate this issue, in this research we propose a permutation invariant training framework, in which a symmetry regularization is introduced during training that forces the model to produce the same predictions for input sentence pairs in both forward and backward directions. Empirical studies exhibit improved performance over competitive baselines.
Unsupervised domain adaptation (UDA) is a pivotal form in machine learning to extend the in-domain model to the distinctive target domains where the data distributions differ. Most prior works focus on capturing the i...
详细信息
In crowd evacuation scenarios, it is an effective way to ensure evacuation safety by recognizing real crowd emotions and then taking measures such as emotional infection to reduce crowd panic. However, factors such as...
In crowd evacuation scenarios, it is an effective way to ensure evacuation safety by recognizing real crowd emotions and then taking measures such as emotional infection to reduce crowd panic. However, factors such as distance, exposure, angle, and occlusion can lead to incomplete face expression information collection, thus failing to accurately identify crowd emotions. Therefore, it is still a very challenging problem to recognize individual emotions in the case of incomplete face expression information collection, and thus accurately identify crowd emotions during evacuation. To solve this problem, we propose a spatial-temporal consistency-based crowd emotion recognition method to accurately identify the real emotions of the crowd. First, for video frames that can capture the complete facial expression information, we use the residual network to accurately identify the individual emotion values in each frame. For video frames that cannot capture the complete facial expression information, we propose an individual emotion calculation model based on spatial-temporal consistency to calculate the individual emotion values in each frame. Second, we define the crowd panic level and obtain the real crowd emotion by calculation. Finally, we implement an end-to-end crowd panic emotion recognition system to verify our method. The experimental results show that the method can accurately calculate the crowd panic level, which is important for guiding crowd evacuation.
We propose a new and effective image deblurring network based on deep learning. The motivation of this work is based on traditional algorithms and deep learning which take an easy-to-difficult approach to image deblur...
详细信息
Radiance fields have demonstrated impressive performance in synthesizing novel views from sparse input views, yet prevailing methods suffer from high training costs and slow inference speed. This paper introduces DNGa...
详细信息
Numerous cameras deployed in venues can collect video data, which can be analyzed to help evacuate people in emergency situations. Most surveillance videos can provide data support for pedestrian trajectory prediction...
Numerous cameras deployed in venues can collect video data, which can be analyzed to help evacuate people in emergency situations. Most surveillance videos can provide data support for pedestrian trajectory prediction. However, data-driven prediction method do not consider the impact of personalization on pedestrian trajectories, which results in ignoring individual personalization. How to accurately analyze the differences among pedestrians and yet accurately predict pedestrian trajectories is a challenging problem. To solve the above problems, we propose an singular value decomposition (SVD) based pedestrian trajectory prediction method, which introduces matrix decomposition to pedestrian trajectory prediction for the first time. Thus, This method analyzes the impact of pedestrian personalization on motion trajectories by mining the interaction patterns between pedestrians and environment. Firstly, we propose an information collection method based on environmental semantics, which can extract scene information from historical video data to construct an environmental information matrix. Secondly, we propose an SVD-based individual environmental feature preference method, which uses singular value decomposition methods to mine the data and analyze personalized pedestrian motion patterns to construct a personalized individual preference matrix. Finally, we built an personalized trajectory prediction method to predict pedestrian movement trajectories. The experimental results show that the method can not only analyze the effect of personalization on pedestrian movement but also accurately predict the pedestrian movement trajectory.
Cross-domain recommendation (CDR) offers a promising solution to the data sparsity problem by enabling knowledge transfer between source and target domains. However, many recent CDR models overlook crucial issues such...
ISBN:
(纸本)9798331314385
Cross-domain recommendation (CDR) offers a promising solution to the data sparsity problem by enabling knowledge transfer between source and target domains. However, many recent CDR models overlook crucial issues such as privacy as well as the risk of negative transfer (which negatively impact model performance), especially in multi-domain settings. To address these challenges, we propose FedGCDR, a novel federated graph learning framework that securely and effectively leverages positive knowledge from multiple source domains. First, we design a positive knowledge transfer module that ensures privacy during inter-domain knowledge transmission. This module employs differential privacy-based knowledge extraction combined with a feature mapping mechanism, transforming source domain embeddings from federated graph attention networks into reliable domain knowledge. Second, we design a knowledge activation module to filter out potential harmful or conflicting knowledge from source domains, addressing the issues of negative transfer. This module enhances target domain training by expanding the graph of the target domain to generate reliable domain attentions and fine-tunes the target model for improved negative knowledge filtering and more accurate predictions. We conduct extensive experiments on 16 popular domains of the Amazon dataset, demonstrating that FedGCDR significantly outperforms state-of-the-art methods. We open source the code at https://***/LafinHana/FedGCDR.
Effective target detection and recognition are essential in remote sensing image field, and complex evidence theory (CET) is widely used for this purpose. However, measuring conflict between complex basic belief assig...
Effective target detection and recognition are essential in remote sensing image field, and complex evidence theory (CET) is widely used for this purpose. However, measuring conflict between complex basic belief assignments (CBBAs) in CET is challenging. This study proposes a complex belief Jensen-Shannon divergence based on the complex Pignistic transformation to measure conflict, accounting for quantum interference effects in CBBAs. We analyze the properties of the CBJS divergence, including boundedness, nondegeneracy, and symmetry. Numerical examples are presented to show the effectiveness of the conflict measure method and the potential of improving the robustness of intelligent interpretation system. Moreover, an algorithm for decision-making is presented and applied in pattern recognition illustrating its superiority.
The emerging trend of AR/VR places great demands on 3D content. However, most existing software requires expertise and is difficult for novice users to use. In this paper, we aim to create sketch-based modeling tools ...
详细信息
暂无评论