In label distribution learning, an instance is involved with many labels in different importance degrees, and the feature space of instances is accompanied with thousands of redundant and/or irrelevant features. There...
详细信息
Deep learning-based hyperspectral image (HSI) compression has recently attracted great attention in remote sensing due to the growth of hyperspectral data archives. Most of the existing models achieve either spectral ...
详细信息
Human pose estimation has given rise to a broad spectrum of novel and compelling applications, including action recognition, sports analysis, as well as surveillance. However, accurate video pose estimation remains an...
Semantic segmentation is a basal task and is a typical computer vision problem. Although semantic segmentation is developing rapidly, the speed and accuracy of model segmentation still need to be further improved. For...
Semantic segmentation is a basal task and is a typical computer vision problem. Although semantic segmentation is developing rapidly, the speed and accuracy of model segmentation still need to be further improved. For solve the issue of scale differences between target objects and loss of spatial information in the segmentation task of remote sensing images, by improving the original U-Net3+ network and introducing the attention mechanism, a new network MA-Unet3+ is constructed. In the coding phase, images of unlike scales are fused, and the full-scale connections are pruned, some skip connections are removed, and attention mechanisms are introduced between each layer. The improved model is contrast with some common network models, and the experiment achieves 78.7% average intersection (mIoU) on the Vaihingen dataset, which is 0.8% better than this optimized network U-Net3+, the average category pixel accuracy (MPA) is 92.4%, which is 1.2% better, and the similarity coefficient (Dice) result is 87.3%, which is 0.8% better. 0.8%, it is observed that MA-Unet3+ is precede other algorithms.
Autonomous underwater vehicles (AUVs) are valuable for ocean exploration due to their flexibility and ability to carry communication and detection units. Nevertheless, AUVs alone often face challenges in harsh and ext...
详细信息
We target cross-domain face reenactment in this paper, i.e., driving a cartoon image with the video of a real person and vice versa. Recently, many works have focused on one-shot talking face generation to drive a por...
We target cross-domain face reenactment in this paper, i.e., driving a cartoon image with the video of a real person and vice versa. Recently, many works have focused on one-shot talking face generation to drive a portrait with a real video, i.e., within-domain reenactment. Straightforwardly applying those methods to cross-domain animation will cause inaccurate expression transfer, blur effects, and even apparent artifacts due to the domain shift between cartoon and real faces. Only a few works attempt to settle cross-domain face reenactment. The most related work AnimeCeleb [13] requires constructing a dataset with pose vector and cartoon image pairs by animating 3D characters, which makes it inapplicable anymore if no paired data is available. In this paper, we propose a novel method for cross-domain reenactment without paired data. Specifically, we propose a transformer-based framework to align the motions from different domains into a common latent space where motion transfer is conducted via latent code addition. Two domain-specific motion encoders and two learnable motion base memories are used to capture domain properties. A source query transformer and a driving one are exploited to project domain-specific motion to the canonical space. The edited motion is projected back to the domain of the source with a transformer. Moreover, since no paired data is provided, we propose a novel cross-domain training scheme using data from two domains with the designed analogy constraint. Besides, we contribute a cartoon dataset in Disney style. Extensive evaluations demonstrate the superiority of our method over competing methods.
Traffic prediction is pivotal in intelligent transportation systems. Existing works mainly focus on improving the overall accuracy, overlooking a crucial problem of whether prediction results will lead to biased decis...
详细信息
Traffic prediction is pivotal in intelligent transportation systems. Existing works mainly focus on improving the overall accuracy, overlooking a crucial problem of whether prediction results will lead to biased decisions by transportation authorities. In practice, the uneven deployment of traffic sensors in different urban areas produces imbalanced data, making the traffic prediction model fail in some areas and leading to unfair regional decision-making that eventually severely affects equity and quality of residents' life. Additionally, existing fairness machine learning models fail to preserve fair traffic prediction for a prolonged time. Although they can achieve fairness at certain time points, such static fairness will be broken as the traffic conditions change. To fill this research gap, we investigate prolonged fair traffic prediction, introduce two novel fairness definitions tailored to dynamic traffic scenarios, and propose a prolonged fairness traffic prediction framework, namely FairTP. We argue that fairness in traffic scenarios changes dynamically over time and across areas. Each traffic sensor or city area has state that alternates between "sacrifice" and "benefit" based on its prediction accuracy (high accuracy indicates "benefit" state). Prolonged fairness is achieved when the overall states of sensors similar within a given ***, we first define region-based static fairness and sensor-based dynamic fairness. Next, we designed a state identification module in FairTP to discriminate between states of "sacrifice" or "benefit" to enable prolonged fairness-aware traffic predictions. Lastly, a state-guided balanced sampling strategy is designed to select training examples to promote prediction fairness further, mitigating the performance disparities among regions with imbalanced traffic sensors. Extensive experiments in two real-world datasets show that FairTP significantly improves prediction fairness without causing much accuracy degrada
The Metropolis algorithm is one of the Markov chain Monte Carlo (MCMC) methods that realize sampling from the target probability distribution. In this paper, we are concerned with the sampling from the distribution in...
详细信息
Recently, large language models (LLMs) have significantly improved the performance of text-to-SQL systems. Nevertheless, many state-of-the-art (SOTA) approaches have overlooked the critical aspect of system robustness...
详细信息
The stationary distribution of a continuous-time Markov chain is generally a complicated function of its transition rates. However, we show that if the transition rates are i.i.d. random variables with a common distri...
详细信息
暂无评论