检索结果-内蒙古大学图书馆

Performance *** Comparative Analysis of Multimodal Bilinear Pooling Fusion Approaches for Deep Learning-Based Visual Arabic-Question Answering Systems

引用

computer Modeling in engineering & sciences 2025年第4期143卷 373-411页

作者： Sarah M.Kamel Mai A.Fadel Lamiaa Elrefaei Shimaa I.Hassan Electrical Engineering Department Faculty of Engineering at ShoubraBenha UniversityCairo11629Egypt Computer Science Department Faculty of Computing and Information TechnologyKing Abdulaziz UniversityJeddah21589Saudi Arabia Department of Computer and Systems Engineering Faculty of Engineering and TechnologyBadr University in Cairo(BUC)Cairo11829Egypt Communication Systems Engineering Department Faculty of EngineeringBenha National UniversityObour11846QalyubiaEgypt

Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate *** this paper,we propose a VQA system intended to answer yes/no questions about real-world images,in *** support a robust VQA system,we work in two directions:(1)Using deep neural networks to semantically represent the given image and question in a fine-grainedmanner,namely ResNet-152 and Gated Recurrent Units(GRU).(2)Studying the role of the utilizedmultimodal bilinear pooling fusion technique in the *** the model complexity and the overall model *** fusion techniques could significantly increase the model complexity,which seriously limits their applicability for VQA *** far,there is no evidence of how efficient these multimodal bilinear pooling fusion techniques are for VQA systems dedicated to yes/no ***,a comparative analysis is conducted between eight bilinear pooling fusion techniques,in terms of their ability to reduce themodel complexity and improve themodel performance in this case of VQA *** indicate that these multimodal bilinear pooling fusion techniques have improved the VQA model’s performance,until reaching the best performance of 89.25%.Further,experiments have proven that the number of answers in the developed VQA system is a critical factor that *** the effectiveness of these multimodal bilinear pooling techniques in achieving their main objective of reducing the model *** Multimodal Local Perception Bilinear Pooling(MLPB)technique has shown the best balance between the model complexity and its performance,for VQA systems designed to answer yes/no questions.

关键词： Arabic-VQA deep learning-based VQA deep multimodal information fusion multimodal representation learning VQA of yes/no questions VQA model complexity VQA model performance performance-complexity trade-off

来源：评论

学校读者我要写书评

暂无评论

Leaving None Behind: Data-Free Domain Incremental Learning for Major Depressive Disorder Detection

引用

IEEE Transactions on Affective Computing 2025年第2期16卷 758-770页

作者： Chen, Tao Guo, Yanrong Hao, Shijie Hong, Richang Hefei University of Technology Key Laboratory of Knowledge Engineering with Big Data Ministry of Education School of Computer Science and Information Engineering Hefei230009 China

While deep learning techniques have shown promising performance in the Major Depressive Disorder (MDD) detection task, they still face limitations in real-world scenarios. Specifically, given the data scarcity, some efforts have resorted to aggregating data from different domains to expand the data volume. However, their effectiveness is currently limited by the domain gap and data privacy. Additionally, the class imbalance issue is particularly severe in our application, leading to biased classifying performance accordingly. To address these challenges, we propose Data-Free Domain Incremental Learning for the MDD detection (DIL-MDD) task, accommodating multiple feature distributions by only accessing well-trained models from previous domains and the data in the current domain. Specifically, DIL-MDD consists of two key modules: Adaptive Class-tailored Threshold Learning (ACTL) and Data-Free Domain Alignment (DFDA). The first module measures the discrepancy between the outputs of two sequential domains, based on which we learn a class-tailored threshold adaptively. Building on this, we differentiate between samples that either exhibit similarities or dissimilarities with the previous domain, where this similar sample set is identified to investigate the feature distribution of the historical data. The second module imposes an alignment constraint to narrow the gap between these two sample sets, thereby exploring the expertise of the previous domain. To validate the effectiveness of the proposed method, we conduct extensive experiments on the public MDD datasets, i.e., DAIC-WOZ, MODMA, and CMDC. We also apply our method to another mental health condition, Autism Spectrum Disorder (ASD), to further demonstrate its applicability. Finally, the ablation studies validate the superiority of the proposed modules. © 2024 IEEE.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

Bidirectional Legendre memory unit: bidirectional memory for person authentication combining voice and online signature

引用

Neural Computing and Applications 2025年第3期37卷 1541-1563页

作者： Kumar, Rohitesh Ghosh, Rajib Department of Computer Science and Engineering National Institute of Technology Patna Ashok Rajpath Patna800005 India

With the increasing popularity of smart portable electronic gadgets, voice-based online person verification systems have become prevalent. However, these systems are susceptible to attacks where illegitimate individuals exploit the recorded voices of legitimate users, leading to false confirmations—spoofing attacks. To overcome this limitation, this article presents an innovative solution by combining speech and online handwritten signatures to mitigate the risks associated with spoofing attacks in voice-based authentication systems because a person has to be present in front of the system to produce an online handwritten signature. To accomplish this objective, this work proposes a novel bidirectional Legendre memory unit (BLMU), a type of recurrent neural network (RNN), for person authentication (verification) and recognition. The Legendre memory unit (LMU) is an innovative memory cell for RNNs that efficiently retains temporal/non-temporal sequential information over a long period with minimal resources. It achieves information orthogonalization by solving coupled ordinary differential equations (ODEs) and leveraging Legendre polynomials, ensuring effective data representation. The proposed framework for person authentication and recognition comprises seven convolution layers, four BLMU layers, two dense layers, and one output layer. The performance of the proposed BLMU-based deep learning framework has been evaluated on a self-generated/private dataset of combined feature matrix of voice signals and online handwritten signatures in the Devanagari script. To assess performance, experiments have also been conducted using various RNN architectures, such as LSTM, BLSTM, and ordinary differential equation recurrent neural network (ODE-RNN), to have a performance comparison with the proposed BLMU-based deep learning (DL) framework. The results demonstrate the superiority of the proposed BLMU-based DL framework in enhancing the accuracy of person verification systems,

关键词： Recurrent neural networks

来源：评论

学校读者我要写书评

暂无评论

Railway track fault detection using optimised convolution neural network

引用

International Journal of Reliability and Safety 2024年第2期18卷 163-186页

作者： Chitra, R. Bamini, A.M. Anusha Brindha, D. Jegan, T.M. Chenthil Kirubakaran, S. Stewart Department of Computer Science and Engineering Karunya Institute of Technology and Sciences Tamil Nadu Coimbatore India Department of Mechanical Engineering Sri Ranganathar Institute of Engineering and Technology Tamil Nadu Coimbatore India

Railway accidents are an under-scrutinised cause of death in India. Train accidents are caused by various consequences of collisions, derailments, signal errors and so on. Furthermore, when train derailments become disastrous, they can have tremendous repercussions. It is practically difficult to identify the cause of the derailment efficiently within a limited period. In recent years, we have been making progress in reducing derailments, but even if not deadly, identifying faulty tracks can waste a lot of time and money. And doing this error-free is a pressing matter, as tracks always experience wear and tear with more usage. Here is where neural networks can pitch in their solution. We can train a model to look at train tracks and identify any issues. This paper goes into the methodology of achieving this and optimising a neural network to predict problems in the track with the best possible accuracy that images can provide. The objective of this paper is to identify, develop and optimise neural networks to detect faulty tracks. In this work, a good Convolution Neural Network model is developed to identify the crack in the railway track. The developed model produced 95.54% accuracy in fault classification. Copyright © 2024 Inderscience Enterprises Ltd.

关键词： Derailments

来源：评论

学校读者我要写书评

暂无评论

Deep Learning Based Side-Channel Attack Detection for Mobile Devices Security in 5G Networks

引用

清华大学学报自然科学版（英文版） 2025年第3期30卷 1012-1026页

作者： Amjed A.Ahmed Mohammad Kamrul Hasan Ali Alqahtani Shayla Islam Bishwajeet Pandey Leila Rzayeva Huda Saleh Abbas Azana Hafizah Mohd Aman Nayef Alqahtani Center for Cyber Security Faculty of Information Science and TechnologyUniversiti Kebangsaan MalaysiaBangi 43600Malaysia Imam Alkadhim University College Department of Computer Techniques EngineeringBaghdad 10066Iraq Center for Cyber Security Faculty of Information Science and TechnologyUniversiti Kebangsaan MalaysiaBangi 43600Malaysia Department of Networks and Communications Engineering College of Computer Science and Information SystemsNajran UniversityNajran 61441Saudi Arabia Institute of Computer Science and Digital Innovation UCSI UniversityKuala Lumpur 56000Malaysia Department of Intelligent Systems and Cyber Security Astana IT UniversityAstana 20000Kazakstan Department of Computer Science College of Computer Science and EngineeringTaibah UniversityMadinah 42353Saudi Arabia Department of Electrical Engineering College of EngineeringKing Faisal UniversityAl-Hofuf 31982Saudi Arabia

Mobile devices within Fifth Generation(5G)networks,typically equipped with Android systems,serve as a bridge to connect digital gadgets such as global positioning system,mobile devices,and wireless routers,which are vital in facilitating end-user communication ***,the security of Android systems has been challenged by the sensitive data involved,leading to vulnerabilities in mobile devices used in 5G *** vulnerabilities expose mobile devices to cyber-attacks,primarily resulting from security ***-permission apps in Android can exploit these channels to access sensitive information,including user identities,login credentials,and geolocation *** such attack leverages"zero-permission"sensors like accelerometers and gyroscopes,enabling attackers to gather information about the smartphone's *** underscores the importance of fortifying mobile devices against potential future *** research focuses on a new recurrent neural network prediction model,which has proved highly effective for detecting side-channel attacks in mobile devices in 5G *** conducted state-of-the-art comparative studies to validate our experimental *** results demonstrate that even a small amount of training data can accurately recognize 37.5％of previously unseen user-typed ***,our tap detection mechanism achieves a 92％accuracy rate,a crucial factor for text *** findings have significant practical implications,as they reinforce mobile device security in 5G networks,enhancing user privacy,and data protection.

关键词： Fifth Generation(5G)networks smartphone information leakage Side-Channel Attack(SCA) deep learning

来源：评论

学校读者我要写书评

暂无评论

Microstructure and mechanical behavior of AXM Mg alloy systems—A review

引用

Journal of Magnesium and Alloys 2024年第7期12卷 2624-2646页

作者： N.Thanabal R.Silambarasan P.Seenuvasaperumal Dudekula Althaf Basha A.Elayaperumal School of Mechanical Engineering Vellore Institute of TechnologyVellore 632014India Department of Metallurgical Engineering and Materials Science Indian Institute of Technology IndoreIndore 453552India Department of Mechanical Engineering Engineering Design DivisionCEG CampusAnna UniversityChennai 600025India

Automobiles are the inevitable mode of ***,increasing fuel prices and carbon dioxide emissions are posing a serious threat to automobile users and the ***,the development of new lightweight materials has been a key area of ***-based commercial alloys(AZ and ZK series alloys)are the lightest among all structural ***,there is still a question about the replacement of Aluminum-based alloys due to HCP crystal *** this connection,Mg-Al-Ca-Mn(AXM)Mg alloy can be a choice as an alternative to the existing Mg-based commercial alloys for structural *** contains(Al,Mg)_(2)Ca,Al_(2)Ca,Mg_(2)Ca,and Al_(8)Mn_(5)as the secondary phases,contributing to the microstructural refinement and property ***,the formation of those precipitates depends on the amount of Al,Ca,and Mn,especially,the Ca/Al *** addition,the secondary processes influence the grain refinement and property enhancement of texture ***,this review article focuses on elaborating on the significance of the Ca/Al ratio for the precipitate formation,secondary process,and texture *** co-segregation behavior of other micro-alloying elements like Cerium,Lanthanum,and Zinc in AXM Mg alloy systems has also been discussed for property enhancement.

关键词： AXM Mg alloy Rolling Extrusion Texture

来源：评论

学校读者我要写书评

暂无评论

Constrained Networked Predictive Control for Nonlinear Systems Using a High-Order Fully Actuated System Approach

引用

IEEE/CAA Journal of Automatica Sinica 2025年第2期12卷 478-480页

作者： Yi Huang Guo-Ping Liu Yi Yu Wenshan Hu the School of Electrical Engineering and Automation Wuhan University IEEE the Center for Control Science and Technology Southern University of Science and Technology the Department of Electrical and Electronic Engineering The Hong Kong Polytechnic University

Dear Editor,In this letter, a constrained networked predictive control strategy is proposed for the optimal control problem of complex nonlinear highorder fully actuated (HOFA) systems with noises. The method can effectively deal with nonlinearities, constraints, and noises in the system, optimize the performance metric, and present an upper bound on the stable output of the system.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Deep-CNWO: a deep-chaotic nature whale optimization algorithm for early prediction of blood pressure disorder in smart healthcare settings

引用

Neural Computing and Applications 2024年第24期36卷 15117-15136页

作者： Motwani, Anand Shukla, Piyush Kumar Pawar, Mahesh Arya, Monika Jain, Paras School of Computing Science and Engineering VIT Bhopal University Bhopal-Indore Highway Kothrikalan MP Sehore466114 India Department of Computer Science & amp Engineering University Institute of Technology RGPV MP Bhopal462033 India Department of Computer Science & amp Engineering CMR Engineering College Kandlakoya Medchal Telangana Hyderabad501401 India Department of Information Technology University Institute of Technology RGPV MP Bhopal462033 India

The integration of cloud and edge computing, along with machine learning, plays a vital role in the development of efficient healthcare systems in smart cities. However, machine and deep learning (DL) models are prone to delayed convergence and Type-I and Type-II errors due to data vastness and high degree imbalance. To overcome the shortcomings of previous frameworks, this work aims to propose an optimization method with DL, ‘Deep-Chaotic Nature Whale Optimization’ (Deep-CNWO) for early prediction of Blood Pressure disorders among patients under at-home supervision. A simplex search algorithm is integrated to improve the update mechanism of whale optimization algorithm (WOA), thereby creating a CNWO algorithm. The purpose of this hybrid optimization is to increase the accuracy and efficiency of DL models. Leveraging the power of DL and CNWO, this method (Deep-CNWO) provides an effective solution for early detection and proactive management of a chronic disease in at-home healthcare settings. We collected relevant data from clinical studies, including vital signs and patient contextual information, to train and evaluate the deep-CNWO model. The CNWO optimization approach has been used to improve the predictive performance and convergence of DL models. Experiments performed on imbalanced datasets using deep-CNWO have given 99.90% accuracy. The average F-score for emergency cases has improved by 22%, while the average accuracy has increased by 5.72% across all three classes, compared to the results reported in previous related work. Deep-CNWO improves the convergence of DL and reduces Type-I and Type-II errors. The experimental results demonstrate the efficacy of our proposed method for remote patient monitoring and highlight its potential for quick intervention during emergencies. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

关键词： Optimization

来源：评论

学校读者我要写书评

暂无评论

Efficient diabetic retinopathy classification grading using GAN based EM and PCA learning framework

引用

Multimedia Tools and Applications 2025年第8期84卷 5311-5334页

作者： Sunil, S.S. Vindhya, A. Shri Department of CSE Saveetha School of Engineering Chennai India Department of Computer Science and Engineering Saveetha School of Engineering Saveetha Institute of Medical and Technical Sciences Saveetha University Tamil Nadu Chennai602105 India

Diabetic retinopathy (DR), a type of eye disease, is a danger for diabetics. Manual labour, which is prone to inaccuracy and time consuming, makes dealing with this illness considerably more difficult. Normally computer-assisted diagnosis has appeared as a promising tool for the early identification and severity grading of DR. As technologies are revolutionizing day by day, in which the most advance technology deep learning's algorithm gives a tremendous support for healthcare fields. This article proposes an efficient classification of DR models for categories the DR into different grades and to identify the severity. There various prediction techniques employed in DR detection. Radial Basics Network, Multilayer Perceptron and Recurrent Neural Network are binary classifiers employed for DR classification. Further the Bag of Visual Words and Convolutional Neural Networks implements for the stages of 3. The performance shows that Convolutional Neural Network perform superior over other methods and attains 98.3%. It is of great significance to apply deep-learning techniques for DR recognition. However, deep-learning algorithms often depend on large amounts of labeled data, which is expensive and time-consuming to obtain in the medical imaging area. In addition, the DR features are inconspicuous and spread out over high-resolution fundus images. Therefore, it is a big challenge to learn the distribution of such DR features. To overcome this, This research work proposes a multichannel-based generative adversarial network (M-GAN) for data augmentation as well as classification to grade DR The usefulness and effectiveness of GAN for classification of fundus images are explored for the first *** medical data is also a tedious and challenging one because it is quite expensive and confidential, to overcome this proposed model is acts data augmentation model, moreover the features in the input data’s are reduced by Dimensionality reduction Module (DRM) based on Pri

关键词： Classification (of information)

来源：评论

学校读者我要写书评

暂无评论

Fixed-time-synchronized control: a system-dimension-categorized approach

引用

science China(Information sciences) 2023年第7期66卷 177-194页

作者： Wanyue JIANG Shuzhi Sam GE Dongyu LI Institute for Future & Shandong Key Laboratory of Industrial Control Technology School of Automation Qingdao University Department of Electrical and Computer Engineering National University of Singapore School of Cyber Science and Technology Beihang University Shanghai Institute of Satellite Engineering

This study addresses the fixed-time-synchronized control problem of perturbed multi-input multioutput(MIMO) systems. In the task of fixed-time-synchronized control, different dimensions of the output signal in MIMO systems are required to reach the desired value simultaneously within a fixed time *** MIMO system is categorized into two cases: the input-dimension-dominant and the state-dimensiondominant cases. The classification is defined according to the dimension of system signals and, more importantly, the capability of converging at the same time. For each kind of MIMO system, sufficient Lyapunov conditions for fixed-time-synchronized convergence are explored, and the corresponding robust sliding mode controllers are designed. Moreover, perturbations are compensated using the super-twisting technique. The brake control of the vertical takeoff and landing aircraft is considered to verify the proposed method for the input-dimension-dominant case, which shows the essential advantages of decreasing the energy consumption and the output trajectory length. Furthermore, comparative numerical simulations are performed to show the semi-time-synchronized property for the state-dimension-dominant case.

关键词： fixed-time-synchronized convergence sliding mode control perturbed MIMO systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：