Image Captioning is an emergent topic of research in the domain of artificial intelligence(AI).It utilizes an integration of computer Vision(CV)and Natural Language Processing(NLP)for generating the image *** use in s...
详细信息
Image Captioning is an emergent topic of research in the domain of artificial intelligence(AI).It utilizes an integration of computer Vision(CV)and Natural Language Processing(NLP)for generating the image *** use in several application areas namely recommendation in editing applications,utilization in virtual assistance,*** development of NLP and deep learning(DL)modelsfind useful to derive a bridge among the visual details and textual *** this view,this paper introduces an Oppositional Harris Hawks Optimization with Deep Learning based Image Captioning(OHHO-DLIC)*** OHHO-DLIC technique involves the design of distinct levels of ***,the feature extraction of the images is carried out by the use of EfficientNet ***,the image captioning is performed by bidirectional long short term memory(BiLSTM)model,comprising encoder as well as *** last,the oppositional Harris Hawks optimization(OHHO)based hyperparameter tuning process is performed for effectively adjusting the hyperparameter of the EfficientNet and BiLSTM *** experimental analysis of the OHHO-DLIC technique is carried out on the Flickr 8k Dataset and a comprehensive comparative analysis highlighted the better performance over the recent approaches.
In this paper, a high step-up DC-DC converter based on a switched-inductor-capacitor-diode (SLCD) cell is proposed. The proposed converter provides a high voltage gain, low voltage stress on the power switches and dio...
详细信息
作者:
Luo, YuewenAnwar, AymanRen, SiyiCoyle, James L.Sejdic, ErvinUniversity of Toronto
Division of Engineering Science Faculty of Applied Science & Engineering TorontoON Canada University of Toronto
Faculty of Applied Science & Engineering Department of Electrical and Computer Engineering TorontoON Canada University of Pittsburgh
School of Health and Rehabilitation Sciences Department of Communication Science and Disorders PittsburghPA United States University of Toronto
North York General Hospital Faculty of Applied Science & Engineering Department of Electrical and Computer Engineering TorontoON Canada
Swallowing is a pivotal physiological function for human sustenance and hydration. Dysfunctions, termed dysphagia, necessitate prompt and precise diagnosis. Videofluoroscopic swallowing studies (VFSS) remain the gold ...
详细信息
Fitting a polynomial to observed data is an ubiquitous task in many signal processing and machine learning tasks, such as interpolation and prediction. In that context, input and output pairs are available and the goa...
详细信息
With the development of Generative AI technologies, video style transfer has become a popular extra challenge of style transfer. Compared to traditional images style transfer tasks, video tasks bring new challenges in...
详细信息
We report a fiber-optic-based ultrafast time-stretch laser detection and ranging (Lidar) sensor with 10 MHz speed and 10 μm accuracy with 30 mm dynamic range for head motion detection under the thermoplastic mask dur...
详细信息
Substantial capital is required to invest in solar power plants, which puts estimation of the payback period accurately at primary concern for stakeholders. In this paper, we proposed a novel method to estimate the pa...
详细信息
Scalability is essential for next-generation blockchain technology to integrate with large mobile networks like Internet of Things (IoT). The IOTA distributed ledger protocol has combined transaction generation and ve...
详细信息
Large-scale pre-training has shown remarkable performance in building open-domain dialogue ***,previous works mainly focus on showing and evaluating the conversational performance of the released dialogue model,ignori...
详细信息
Large-scale pre-training has shown remarkable performance in building open-domain dialogue ***,previous works mainly focus on showing and evaluating the conversational performance of the released dialogue model,ignoring the discussion of some key factors towards a powerful human-like chatbot,especially in Chinese *** this paper,we conduct extensive experiments to investigate these under-explored factors,including data quality control,model architecture designs,training approaches,and decoding *** propose EVA2.0,a large-scale pre-trained open-domain Chinese dialogue model with 2.8 billion parameters,and will make our models and codes publicly *** and human evaluations show that EVA2.0 significantly outperforms other open-source *** also discuss the limitations of this work by presenting some failure cases and pose some future research directions on large-scale Chinese open-domain dialogue systems.
Joint entity and relation extraction aim to achieve named entity recognition and relation extraction in unstructured text. We use the form of triples (subject, relation, object) to describe entity and relation. Joint ...
详细信息
暂无评论