Urbanization has led to increased traffic congestion and air pollution, primarily from vehicle emissions, posing risks to public health and the environment. Existing traffic management systems are inefficient in integ...
详细信息
The risk of gas leaks has grown significantly as a life threatening issue in industrial activities, cooking, and heating. This system integrates automatic reaction mechanisms, real-time monitoring capabilities, and ad...
详细信息
Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate...
详细信息
Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate *** this paper,we propose a VQA system intended to answer yes/no questions about real-world images,in *** support a robust VQA system,we work in two directions:(1)Using deep neural networks to semantically represent the given image and question in a fine-grainedmanner,namely ResNet-152 and Gated Recurrent Units(GRU).(2)Studying the role of the utilizedmultimodal bilinear pooling fusion technique in the *** the model complexity and the overall model *** fusion techniques could significantly increase the model complexity,which seriously limits their applicability for VQA *** far,there is no evidence of how efficient these multimodal bilinear pooling fusion techniques are for VQA systems dedicated to yes/no ***,a comparative analysis is conducted between eight bilinear pooling fusion techniques,in terms of their ability to reduce themodel complexity and improve themodel performance in this case of VQA *** indicate that these multimodal bilinear pooling fusion techniques have improved the VQA model’s performance,until reaching the best performance of 89.25%.Further,experiments have proven that the number of answers in the developed VQA system is a critical factor that *** the effectiveness of these multimodal bilinear pooling techniques in achieving their main objective of reducing the model *** Multimodal Local Perception Bilinear Pooling(MLPB)technique has shown the best balance between the model complexity and its performance,for VQA systems designed to answer yes/no questions.
As healthcare services have become increasingly digitized, Electronic Health Records (EHRs) have become widely adopted, providing seamless data exchange among providers. Conventional EHRs, however, are extremely vulne...
详细信息
This paper analyzes whether Android apps may outsource computational activities to cloud servers. Due to the complexity of mobile apps, shifting computing operations to the cloud has become popular to improve performa...
详细信息
A stochastic-gradient-based interior-point algorithm for minimizing a continuously differentiable objective function (that may be nonconvex) subject to bound constraints is presented, analyzed, and demonstrated throug...
详细信息
The digital era has brought a surge in the amount of data generated, increasing the need for data security across individuals, organizations, and governments. Protecting sensitive information from unauthorized access ...
详细信息
Smartphones contain a vast amount of information about their users, which can be used as evidence in criminal cases. However, the sheer volume of data can make it challenging for forensic investigators to identify and...
详细信息
Secure file encryption has become increasingly important in various disciplines, including finance, healthcare, and government, where protecting sensitive data is paramount. The proposed framework aims to meet this re...
详细信息
Anomaly detection for rail running bands, the pattern of wheel–rail contact area, is crucial to analyze composite rail irregularities. This paper presents an all-weather vision-based solution for running-band inspect...
详细信息
暂无评论