Vehicular edge computing (VEC) allows vehicles to process part of the tasks locally at the network edge while offloading the rest of the tasks to a centralized cloud server for processing. A massive volume of tasks ge...
详细信息
Image captioning is an emerging field in machine *** refers to the ability to automatically generate a syntactically and semantically meaningful sentence that describes the content of an *** captioning requires a comp...
详细信息
Image captioning is an emerging field in machine *** refers to the ability to automatically generate a syntactically and semantically meaningful sentence that describes the content of an *** captioning requires a complex machine learning process as it involves two sub models:a vision sub-model for extracting object features and a language sub-model that use the extracted features to generate meaningful ***-based vision transformers models have a great impact in vision field *** this paper,we studied the effect of using the vision transformers on the image captioning process by evaluating the use of four different vision transformer models for the vision sub-models of the image captioning The first vision transformers used is DINO(self-distillation with no labels).The second is PVT(Pyramid Vision Transformer)which is a vision transformer that is not using convolutional *** third is XCIT(cross-Covariance Image Transformer)which changes the operation in self-attention by focusing on feature dimension instead of token *** last one is SWIN(Shifted windows),it is a vision transformer which,unlike the other transformers,uses shifted-window in splitting the *** a deeper evaluation,the four mentioned vision transformers have been tested with their different versions and different configuration,we evaluate the use of DINO model with five different backbones,PVT with two versions:PVT_v1and PVT_v2,one model of XCIT,SWIN *** results show the high effectiveness of using SWIN-transformer within the proposed image captioning model with regard to the other models.
In the recent past, cancer is designated as the life threatening disease causing significant deaths across the globe. Countries are putting efforts for early stage cancer detection, and mortality prediction. Machine L...
详细信息
Colorectal cancer is one of the leading causes of cancer death worldwide, so accurate early detection is needed. Endoscopic imaging technology plays a vital role in diagnosis, but the presence of outliers in endoscopi...
详细信息
Since the advent of smartphones, capturing images has become deeply embedded in human behavior, evolving into a fundamental part of daily life. Research into human perception of image quality is crucial as people freq...
详细信息
The goal of this research is to integrate an artificial intelligence framework for predicting Kathakali mudras, a crucial component of the traditional Indian dance style that is renowned for its complex hand and facia...
详细信息
Considering the difficulty of financial time series forecasting in financial aid, much of the current research focuses on leveraging big data analytics in financial services. One modern approach is to utilize "pr...
详细信息
The pharmaceutical industry increasingly values medicinal plants due to their perceived safety and costeffectiveness compared to modern *** the extensive history of medicinal plant usage,various plant parts,including ...
详细信息
The pharmaceutical industry increasingly values medicinal plants due to their perceived safety and costeffectiveness compared to modern *** the extensive history of medicinal plant usage,various plant parts,including flowers,leaves,and roots,have been acknowledged for their healing properties and employed in plant *** images,however,stand out as the preferred and easily accessible source of *** plant identification by plant taxonomists is intricate,time-consuming,and prone to errors,relying heavily on human *** intelligence(AI)techniques offer a solution by automating plant recognition *** study thoroughly examines cutting-edge AI approaches for leaf image-based plant identification,drawing insights from literature across renowned *** paper critically summarizes relevant literature based on AI algorithms,extracted features,and results ***,it analyzes extensively used datasets in automated plant classification *** also offers deep insights into implemented techniques and methods employed for medicinal plant ***,this rigorous review study discusses opportunities and challenges in employing these AI-based ***,in-depth statistical findings and lessons learned from this survey are highlighted with novel research areas with the aim of offering insights to the readers and motivating new research *** review is expected to serve as a foundational resource for future researchers in the field of AI-based identification of medicinal plants.
In this paper, a discrete-time projection neural network with an adaptive step size (DPNN) is proposed for distributed global optimization. The DPNN is proven to be convergent to a Karush-Kuhn-Tucker point. Several DP...
详细信息
Referring video object segmentation (RVOS) aims at segmenting an object in a video with its text description. The core of RVOS lies in the modal alignment between the vision and text. To improve the performance, most ...
详细信息
暂无评论