image background removal is a crucial technique for enhancing the visual impact of images or altering their composition, finding applications in various fields such as photography and computervision. This process can...
详细信息
Creating images through generative AI technology is still challenging because it requires specifying detailed layouts and contents of objects. This paper introduces a layout generation method that uses Transformer-bas...
详细信息
The fusion of infrared and visible images is hard due to their different modalities. Different from existing methods using the integer-order gradient, we design an optimization model to fuse infrared and visible image...
详细信息
Communicating with communication impaired people is a challenging task for doctors and caretakers and hence computer based automated assistant systems are important to identify their needs in real-time. An efficient a...
详细信息
The objective of this paper is to present a neuro-symbolic AI based technique to represent field-medicine knowledge, referred as to TON-ViT. TON-ViT integrates a Deep Learning Model with an explicit symbolic manipulat...
详细信息
ISBN:
(纸本)9783031485923;9783031485930
The objective of this paper is to present a neuro-symbolic AI based technique to represent field-medicine knowledge, referred as to TON-ViT. TON-ViT integrates a Deep Learning Model with an explicit symbolic manipulation, a task graph. This task graph describes the steps of each trauma resuscitation as denoted by a verb and noun pair. Through this representation, symbolic processing and manipulation on task graphs, we can find stereotypical procedures, regardless of style of the performer. Furthermore, we can use this technique to find differences in styles, errors, shortcuts and generate procedures never seen before. When used in combination with a transformer, it can help recognize actions in egocentric vision datasets. Last, through symbolic manipulations on the graph, it is possible to generate medical knowledge which the model has not seen before. We present preliminary results after testing the TON-ViT with the Trauma Thompson Dataset.
This paper introduces our solution for Track 2 in AI City Challenge 2023. The task is tracked-vehicle retrieval by natural language descriptions with a real-world dataset of various scenarios and cameras. Our solution...
详细信息
Most of the traditional destriping work is based on matrix domain (band-by-band) processing hyperspectral images, although satisfactory results can be obtained, some important prior conditions are still ignored. In fa...
详细信息
The 1/2 folded image refers to folding the original image at the halfway point along the horizontal or vertical direction, creating an image with partial overlap. The purpose of restoring 1/2 folded images is to repai...
详细信息
Recent studies demonstrated that the training process of deep neural networks (DNNs) is vulnerable to backdoor attacks if third-party training resources (e.g., samples) are adopted. Specifically, the adversaries inten...
详细信息
During recent decades, facial expression recognition is a hot area of research in deep learning and computervision. However, numerous research has been done on emotion recognition through facial expression using deep...
详细信息
暂无评论