Probabilistic linguistic term sets (PLTSs) provide a flexible tool to express linguistic preferences, and several multi-criteria decision models based on PLTSs have been recently developed. In this framework, distorti...
详细信息
Many works show that node-level predictions of Graph Neural Networks (GNNs) are unrobust to small, often termed adversarial, changes to the graph structure. However, because manual inspection of a graph is difficult, ...
详细信息
Facial beauty analysis is an important topic in human *** may be used as a guidance for face beautification applications such as cosmetic *** neural networks(DNNs)have recently been adopted for facial beauty analysis ...
详细信息
Facial beauty analysis is an important topic in human *** may be used as a guidance for face beautification applications such as cosmetic *** neural networks(DNNs)have recently been adopted for facial beauty analysis and have achieved remarkable ***,most existing DNN-based models regard facial beauty analysis as a normal classification *** ignore important prior knowledge in traditional machine learning models which illustrate the significant contribution of the geometric features in facial beauty *** be specific,landmarks of the whole face and facial organs are introduced to extract geometric features to make the *** by this,we introduce a novel dual-branch network for facial beauty analysis:one branch takes the Swin Transformer as the backbone to model the full face and global patterns,and another branch focuses on the masked facial organs with the residual network to model the local patterns of certain facial ***,the designed multi-scale feature fusion module can further facilitate our network to learn complementary semantic information between the two *** model optimisation,we propose a hybrid loss function,where especially geometric regulation is introduced by regressing the facial landmarks and it can force the extracted features to convey facial geometric *** performed on the SCUT-FBP5500 dataset and the SCUT-FBP dataset demonstrate that our model outperforms the state-of-the-art convolutional neural networks models,which proves the effectiveness of the proposed geometric regularisation and dual-branch structure with the hybrid *** the best of our knowledge,this is the first study to introduce a Vision Transformer into the facial beauty analysis task.
Satellite image classification is the most significant remote sensing method for computerized analysis and pattern detection of satellite data. This method relies on the image's diversity structures and necessitat...
详细信息
Recently, it has been shown that neural networks not only approximate the ground-state wave functions of a single molecular system well but can also generalize to multiple geometries. While such generalization signifi...
详细信息
In this paper, I provide a model for language identification that makes use of neural networks. The approach is intended to distinguish between Indian regional and dialectal languages. Individuals of Indian descent ar...
详细信息
There is a growing interest in expanding the input capacity of language models (LMs) across various domains. However, simply increasing the context window does not guarantee robust performance across diverse long-inpu...
详细信息
The application of contrastive learning (CL) to collaborative filtering (CF) in recommender systems has achieved remarkable success. CL-based recommendation models mainly focus on creating multiple augmented views by ...
详细信息
Image captioning,the task of generating descriptive sentences for images,has advanced significantly with the integration of semantic ***,traditional models still rely on static visual features that do not evolve with ...
详细信息
Image captioning,the task of generating descriptive sentences for images,has advanced significantly with the integration of semantic ***,traditional models still rely on static visual features that do not evolve with the changing linguistic context,which can hinder the ability to form meaningful connections between the image and the generated *** limitation often leads to captions that are less accurate or *** this paper,we propose a novel approach to enhance image captioning by introducing dynamic interactions where visual features continuously adapt to the evolving linguistic *** model strengthens the alignment between visual and linguistic elements,resulting in more coherent and contextually appropriate ***,we introduce two innovative modules:the Visual Weighting Module(VWM)and the Enhanced Features Attention Module(EFAM).The VWM adjusts visual features using partial attention,enabling dynamic reweighting of the visual inputs,while the EFAM further refines these features to improve their relevance to the generated *** continuously adjusting visual features in response to the linguistic context,our model bridges the gap between static visual features and dynamic language *** demonstrate the effectiveness of our approach through experiments on the MS-COCO dataset,where our method outperforms state-of-the-art techniques in terms of caption quality and contextual *** results show that dynamic visual-linguistic alignment significantly enhances image captioning performance.
Extracting valuable information from a large corpus of unstructured data poses a formidable challenge in many applications. Transformer-based architectures e.g. BERT, employing transfer learning techniques, have exhib...
详细信息
暂无评论