The world population has increased many folds in the last few years and crossed the figure of 7 billion. Africa has the highest population growth rate. Food and water are the first and foremost necessities for the sur...
详细信息
Rising number of crime rate using firearms (such as open firing, robbery, suicides, mass shootings, homicides, threatening at gun point, etc.), has underscored the growing importance of timely detection of weapons. Bo...
详细信息
Virtual reality (VR) has become one of the most emerging technologies of this decade. Researchers can now exchange and study data like never before with the help of virtual reality-based tools. It’s a key pipeline st...
详细信息
The enhancement of the transportation database through updated road networks is a critical aspect with a wide range of applications. Traditional methods rely on onsite measurements or human interpretation of remote se...
详细信息
IoT will become an integral part of everyone's life in making the world more intelligent and smarter. In an IoT ecosystem, objects collect and share information in order to communicate with one another. The object...
详细信息
The integration of data-driven initiatives and technology has received substantial interest in the educational domain. Capstone projects significantly influence the career trajectories of students. Rather than merely ...
详细信息
Image captioning is an emerging field in machine *** refers to the ability to automatically generate a syntactically and semantically meaningful sentence that describes the content of an *** captioning requires a comp...
详细信息
Image captioning is an emerging field in machine *** refers to the ability to automatically generate a syntactically and semantically meaningful sentence that describes the content of an *** captioning requires a complex machine learning process as it involves two sub models:a vision sub-model for extracting object features and a language sub-model that use the extracted features to generate meaningful ***-based vision transformers models have a great impact in vision field *** this paper,we studied the effect of using the vision transformers on the image captioning process by evaluating the use of four different vision transformer models for the vision sub-models of the image captioning The first vision transformers used is DINO(self-distillation with no labels).The second is PVT(Pyramid Vision Transformer)which is a vision transformer that is not using convolutional *** third is XCIT(cross-Covariance Image Transformer)which changes the operation in self-attention by focusing on feature dimension instead of token *** last one is SWIN(Shifted windows),it is a vision transformer which,unlike the other transformers,uses shifted-window in splitting the *** a deeper evaluation,the four mentioned vision transformers have been tested with their different versions and different configuration,we evaluate the use of DINO model with five different backbones,PVT with two versions:PVT_v1and PVT_v2,one model of XCIT,SWIN *** results show the high effectiveness of using SWIN-transformer within the proposed image captioning model with regard to the other models.
Generative adversarial networks (GANs) have gained popularity for their ability to synthesize images from random inputs in deep learning models. One of the notable applications of this technology is the creation of re...
详细信息
Deep reinforcement learning agents are vulnerable to adversarial attacks. In particular, recent studies have shown that attacking a few key steps can effectively decrease the agent’s cumulative reward. However, all e...
详细信息
Parkinson’s disease (PD) is one of the most fatal and progressive nervous system illnesses affecting movement. It is a common neurological illness that causes disabilities, shortens people’s lives, and has no treatm...
详细信息
暂无评论