With fuel prices rising and environmental concerns intensifying, the financial burden on consumers and businesses has shown an upward trajectory, prompting a search for sustainable alternatives sources. This study inv...
详细信息
This study investigates the application of machine learning (ML) techniques in predicting Psychological Well-being outcomes, emphasizing the use of ensemble methods like AdaBoost and Random Forest for enhanced accurac...
详细信息
This study introduces a deep learning-based method for classifying brain tumors using a pre-trained VGG19 convolutional neural network (CNN). By leveraging transfer learning, we adapted the VGG19 model with custom ful...
详细信息
In this paper, we study the multi-layered security approaches for cooperative relay networks against passive eavesdropping adversaries. Specifically, we investigated physical layer approaches (i.e., beamformed and art...
详细信息
Natural Language Processing (NLP) is revolutionizing the legal domain, enabling tasks such as predicting legal outcomes, summarizing complex documents, identifying key entities, and assessing bail risks. This survey p...
详细信息
Blind people face many difficulties while engaging with their surroundings. This project is all about creating innovative solutions for the blind people by providing the information in the format of audio. Image-to-Au...
详细信息
Recent media have increasingly shifted towards multimedia formats that simultaneously utilize visual and linguistic information. Research on multimodal AI is actively conducted to analyze large-scale multimodal data e...
详细信息
ISBN:
(纸本)9798400706295
Recent media have increasingly shifted towards multimedia formats that simultaneously utilize visual and linguistic information. Research on multimodal AI is actively conducted to analyze large-scale multimodal data effectively. Multimodal AI fuses the probability and feature values extracted from single modalities by a backbone model, enabling the simultaneous analysis of multimodal information. This allows for discovering new insights that may not be detectable through single-modality analysis. Depending on the data collection environment, multimedia can be classified into one-to-one and one-to-many modality balances. Previous multimodal AI approaches analyze these one-to-many relationships by downsampling or duplicating data to fit a one-to-one relationship. In this paper, we optimize multimedia analysis in one-to-one and one-to-many modality balances based on the local and global context analysis capabilities of multimodal AI and the multimodal analysis characteristics of backbone models. The multimedia analysis system employs late score and feature fusion to independently analyze the local context as the baseline for multimodal AI. In contrast, early and hierarchical feature fusion is utilized for comprehensive global context analysis. The backbone models used include ViT and RoBERTa to analyze the overall structure of multimodal data and BEiT and DeBERTa to analyze structural features. Experimental results show that, in the duplication method, late score and feature fusion, which independently analyze the local context of multimodal data, are 0.56% more accurate and achieve an f1 score that is 0.025 higher. Additionally, BEiT and DeBERTa, which analyze structural features, demonstrate a 0.2% increase in accuracy and a 0.0167 improvement in f1 score. In the downsampling method, early and hierarchical feature fusion, which comprehensively analyzes the global context, outperforms by 1.17% in accuracy and 0.0164 in f1 score. Furthermore, ViT and RoBERTa, which foc
Face recognition technology has dramatically trans-formed the landscape of security, surveillance, and authentication systems, offering a user-friendly and non-invasive biometric solution. However, despite its signifi...
详细信息
In this paper, we have proposed testing the graphical user interface (GUI) applications using the idea of finite state machines and then comparing the results with the model generated using Matlab: which represents th...
详细信息
The global challenge of diabetes demands innovative approaches for early diagnosis and effective management. This paper investigates the integration of advanced dimensionality reduction and feature selection technique...
详细信息
暂无评论