检索结果-内蒙古大学图书馆

A Novel CAPTCHA Recognition System Based on Refined Visual Attention

computers, Materials & Continua 2025年第4期83卷 115-136页

作者： Zaid Derea Beiji Zou Xiaoyan Kui Monir Abdullah Alaa Thobhani Amr Abdussalam School of Computer Science and Engineering Central South UniversityChangsha410083China College of Computer Science and Information Technology Wasit UniversityWasit52001Iraq Department of Computer Science and Artificial Intelligence College of Computing and Information TechnologyUniversity of BishaBisha67714Saudi Arabia Electronic Engineering and Information Science Department University of Science and Technology of ChinaHefei230026China

Improving website security to prevent malicious online activities is crucial,and CAPTCHA(Completely Automated Public Turing test to tell computers and Humans Apart)has emerged as a key strategy for distinguishing human users from automated ***-based CAPTCHAs,designed to be easily decipherable by humans yet challenging for machines,are a common form of this ***,advancements in deep learning have facilitated the creation of models adept at recognizing these text-based CAPTCHAs with surprising *** our comprehensive investigation into CAPTCHA recognition,we have tailored the renowned UpDown image captioning model specifically for this *** approach innovatively combines an encoder to extract both global and local features,significantly boosting the model’s capability to identify complex details within CAPTCHA *** the decoding phase,we have adopted a refined attention mechanism,integrating enhanced visual attention with dual layers of Long Short-Term Memory(LSTM)networks to elevate CAPTCHA recognition *** rigorous testing across four varied datasets,including those from Weibo,BoC,Gregwar,and Captcha 0.3,demonstrates the versatility and effectiveness of our *** results not only highlight the efficiency of our approach but also offer profound insights into its applicability across different CAPTCHA types,contributing to a deeper understanding of CAPTCHA recognition technology.

关键词： Text-based CAPTCHA recognition refined visual attention web security computer vision

来源：评论

学校读者我要写书评

暂无评论

Enhancing User Experience in AI-Powered Human-computer Communication with Vocal Emotions Identification Using a Novel Deep Learning Method

引用

computers, Materials & Continua 2025年第2期82卷 2909-2929页

作者： Ahmed Alhussen Arshiya Sajid Ansari Mohammad Sajid Mohammadi Department of Computer Engineering College of Computer and Information SciencesMajmaah UniversityAl-Majmaah11952Saudi Arabia Department of Information Technology College of Computer and Information SciencesMajmaah UniversityAl-Majmaah11952Saudi Arabia Department of Computer Science College of Engineering and Information TechnologyOnaizah CollegesQassim51911Saudi Arabia

Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing details about the speaker’s goals and desires, as well as their internal condition. Certain vocal characteristics reveal the speaker’s mood, intention, and motivation, while word study assists the speaker’s demand to be understood. Voice emotion recognition has become an essential component of modern HCC networks. Integrating findings from the various disciplines involved in identifying vocal emotions is also challenging. Many sound analysis techniques were developed in the past. Learning about the development of artificial intelligence (AI), and especially Deep Learning (DL) technology, research incorporating real data is becoming increasingly common these days. Thus, this research presents a novel selfish herd optimization-tuned long/short-term memory (SHO-LSTM) strategy to identify vocal emotions in human communication. The RAVDESS public dataset is used to train the suggested SHO-LSTM technique. Mel-frequency cepstral coefficient (MFCC) and wiener filter (WF) techniques are used, respectively, to remove noise and extract features from the data. LSTM and SHO are applied to the extracted data to optimize the LSTM network’s parameters for effective emotion recognition. Python Software was used to execute our proposed framework. In the finding assessment phase, Numerous metrics are used to evaluate the proposed model’s detection capability, Such as F1-score (95%), precision (95%), recall (96%), and accuracy (97%). The suggested approach is tested on a Python platform, and the SHO-LSTM’s outcomes are contrasted with those of other previously conducted research. Based on comparative assessments, our suggested approach outperforms the current approaches in vocal emotion recognition.

关键词： Human-computer communication(HCC) vocal emotions live vocal artificial intelligence(AI) deep learning(DL) selfish herd optimization-tuned long/short K term memory(SHO-LSTM)

来源：评论

学校读者我要写书评

暂无评论

Adversarial-Learning-Based Taguchi Convolutional Fuzzy Neural Classifier for Images of Lung Cancer

引用

IEEE Access 2024年 12卷 72766-72776页

作者： Lin, Cheng-Jian Lin, Xue-Qian Jhang, Jyun-Yu National Chin-Yi University of Technology Department of Computer Science and Information Engineering Taichung41170 Taiwan National Taichung University of Science and Technology Department of Computer Science and Information Engineering Taichung40401 Taiwan

Deep learning technology has extensive application in the classification and recognition of medical images. However, several challenges persist in such application, such as the need for acquiring large-scale labeled data, configuring network parameters, and handling excessive network parameters. To address these challenges, in this study, we developed an adversarial-learning-based Taguchi convolutional fuzzy neural classifier (AL-TCFNC) for classifying malignant and benign lung tumors displayed in computed tomography images. In the framework of the developed AL-TCFNC, a fuzzy neural classifier replaces a conventional fully connected network, thereby reducing the number of network parameters and the training duration. To reduce experimental cost and training time, the Taguchi method was used. This method helps to identify the optimal combination of model parameters through a small number of experiments. The transfer learning of models across databases often results in subpar performance because of the paucity of labeled samples. To resolve this problem, we used a combination of maximum mean discrepancy and cross-entropy for adversarial learning with the proposed model. Two data sets, namely the SPIE-AAPM Lung CT Challenge data set and LIDC-IDRI Lung Imaging Research data set, were used to validate the AL-TCFNC model. When the AL-TCFNC model was used for transfer learning, it exhibited an accuracy rate of 89.55% and outperformed other deep learning models in terms of classification performance. © 2013 IEEE.

关键词： Fuzzy neural networks

来源：评论

学校读者我要写书评

暂无评论

Leveraging Concise Concepts with Probabilistic Modeling for Interpretable Visual Recognition

引用

IEEE Transactions on Multimedia 2025年 27卷 3117-3131页

作者： Zhang, Yixuan Liu, Chuanbin Liu, Yizhi Gao, Yifan Lu, Zhiying Xie, Hongtao Zhang, Yongdong University of Science and Technology of China School of Information Science and Technology China Hunan University of Science and Technology Department of Computer Science and Engineering China

Interpretable visual recognition is essential for decision-making in high-stakes situations. Recent advancements have automated the construction of interpretable models by leveraging Visual Language Models (VLMs) and Large Language Models (LLMs) with Concept Bottleneck Models (CBMs), which process a bottleneck layer associated with human-understandable concepts. However, existing methods suffer from two main problems: a) the collected concepts from LLMs could be redundant with task-irrelevant descriptions, resulting in an inferior concept space with potential mismatch. b) VLMs directly map the global deterministic image embeddings with fine-grained concepts results in an ambiguous process with imprecise mapping results. To address the above two issues, we propose a novel solution for CBMs with Concise Concept and Probabilistic Modeling (CCPM) that can achieve superior classification performance via high-quality concepts and precise mapping strategy. Fisrt, we leverage in-context examples as category-related clues to guide LLM concept generation process. To mitigate redundancy in the concept space, we propose a Relation-Aware Selection (RAS) module to obtain a concise concept set that is discriminative and relevant based on image-concept and inter-concept relationships. Second, for precise mapping, we employ a Probabilistic Distribution Adapter (PDA) that estimates the inherent ambiguity of the image embeddings of pre-trained VLMs to capture the complex relationships with concepts. Extensive experiments indicate that our model achieves state-of-the-art results with a 5.48% improvement in classification accuracy on eight mainstream recognition benchmarks as well as reliable explainability through interpretable analysis. © 1999-2012 IEEE.

关键词： Decision making

来源：评论

学校读者我要写书评

暂无评论

CRAQL: a novel clustering-based resource allocation using the Q-learning in fog environment

引用

International Journal of Cloud Computing 2024年第3期13卷 243-266页

作者： Ahlawat, Chanchal Krishnamurthi, Rajalakshmi Department of Computer Science and Engineering Jaypee Institute of Information Technology Noida India

Fog computing is an emerging paradigm that provides services near the end-user. The tremendous increase in IoT devices and big data leads to complexity in fog resource allocation. Inefficient resource allocation can lead to resource starvation and unable to complete the task assignment within a specific time. Hence, to enhance the efficiency of the fog resources, it is critical to perform proper resource allocation. This work targets to provide the solution to the resource allocation problem with a novel clustering-based resource allocation using the Q-learning (CRAQL) model. For this purpose, the problem is defined as a decision-making problem and formulated as Markov decision process (MDP). Next, to find the optimal resource, an enhanced optimal resource allocation (EORA) algorithm is proposed and detailed study is performed to analyse the impact of various performance parameters. Simulation results show the comparison of the EORA versus conventional Brute force method by varying the performance parameters such as learning rate and number of trials. The experimental results exhibit optimal solutions with significant improvement in learning rate at an average probability of 0.5 within limited epochs. Copyright © 2024 Inderscience Enterprises Ltd.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Cognitive Transformation in Personal IoT: Pioneering Intelligent Automation

引用

Cyber-Physical Systems 2025年第2期11卷 183-240页

作者： Gulzar, Bisma Ahmad Sofi, Shabir Sholla, Sahil Department of Information Technology National Institute of Technology Srinagar India Department of Computer Science Engineering Islamic University of Science & Technology Srinagar India

In recent years, IoT has transformed personal environments by integrating diverse smart devices. This paper presents an advanced IoT architecture that optimizes network infrastructure, focusing on the adoption of MQTT protocol and introducing Cognitive Smart Objects for managing personal IoT applications. These objects use Neural Networks to predict optimal actions based on user behavior patterns. A Continuous Learning mechanism enables real-time adaptation of the network to evolving user interactions. The study highlights the role of Cognitive Transformation in Personal IoT, driving intelligent automation and enhancing user experience. © 2024 Informa UK Limited, trading as Taylor & Francis Group.

关键词： Internet of Things (IoT) Personal Internet of Things (PIoT) Social Internet of Things (SIoT) IoT architecture network infrastructure

来源：评论

学校读者我要写书评

暂无评论

Contactless and Real-time Hand Gesture Recognition using Inductive Proximity Technique for Wrist-worn Wearables

引用

IEEE Sensors Journal 2025年第11期25卷 20474-20485页

作者： Lin, Pei-Jung Shih, Chi-Huang Weng, Tzu-Hsuan Feng Chia University Department of Information Engineering and Computer Science Taichung407 Taiwan National Chin-Yi University of Technology Department of Computer Science and Information Engineering Taichung411 Taiwan

This study proposes a contactless and real-time hand gesture recognition system suitable for smartwatches. The proposed system adopts inductive proximity sensing to collect Mechanomyography (MMG) signals induced by finger-based gestures above the wrist. After the signal processing and feature extraction stages, machine-learning models can be built for gesture classification. Compared with the Electromyography (EMG)-based method generally deployed in the forearm, inductive MMG applies an electromagnetic field to the body surface and is suitable for use on the wrist. Compared to the existing contact methods, such as EMG and accelerometer-based MMG, the inductive MMG method does not require contact with the body surface to collect muscle action signals and can avoid interference from environmental factors such as sweat and dust. The main contributions of this study are (1) a watch-type prototype with inductive MMG sensing technique, including hardware and firmware;(2) a lightweight and efficient signal processing mechanism that can capture the inductive signal characteristics of gestures for the above-mentioned embedded system (i.e., watch-type prototype);and (3) the tempo-spatial feature extraction method to improve the accuracy of gesture recognition. The experiment results show that for six common machine learning models, the gesture recognition accuracy of the proposed system is above 95%, with a maximum of 97.47%. In the briefing control application for verification purposes, the average accuracy of the inductive gesture recognition system can achieve 98.26%, and the average processing time is 14.37 ms. According to the experimental results, the inductive MMG system can provide a feasible gesture recognition solution for wrist-worn wearable devices. © 2001-2012 IEEE.

关键词： Firmware

来源：评论

学校读者我要写书评

暂无评论

An Integrated Framework with Enhanced Primitives for Post-Quantum Cryptography: HEDT and ECSIDH for Cloud Data Security and Key Exchange

Informatica (Slovenia)

引用

Informatica (Slovenia) 2025年第11期49卷 165-176页

作者： Ilias, Shaik Mohammad Sharmila, V. Ceronmani Durga, V. Sathya Dept. of Computer Science and Engineering Hindustan Institute of Technology and Science Chennai India Department of Information Technology Hindustan Institute of Technology and Science Chennai India Department of Computer Science&Engineering Hindustan Institute of Technology and Science Chennai India

If adversaries were to obtain quantum computers in the future, their massive computing power would likely break existing security schemes. Since security is a continuous process, more substantial security schemes must be developed. Current PQC schemes primarily focus on data security or key exchange, and further improvement towards enhanced PQC primitives is required. Our proposal in this research is an innovative paradigm for PQC-focused cloud data security. The proposed HEDT approach achieves encryption and decryption with significantly lower latency (20% improvement) and higher reliability than AES, DES, and RSA, as demonstrated through experimental results. Furthermore, ECSIDH, a hybrid key exchange mechanism combining SIDH and ECDH, improves security strength by 50% while maintaining computational costs within 1.13x of SIDH. Compared to individual key exchange schemes like SIDH, ECSIDH offers superior security as a PQC candidate. These results confirm the robustness and efficiency of the proposed framework in ensuring secure data outsourcing and key exchange in cloud environments. © 2025 Slovene Society Informatika. All rights reserved.

关键词： Quantum computers

来源：评论

学校读者我要写书评

暂无评论

Decomposition-based hybrid methods employing statistical, machine learning, and deep learning models for crude oil price forecasting

引用

Neural Computing and Applications 2025年 1-46页

作者： Purohit, Sourav Kumar Panigrahi, Sibarama Department of Computer Science and Engineering Sambalpur University Institute of Information Technology Burla India Department of Computer Science and Engineering National Institute of Technology Rourkela Odisha Rourkela769008 India

Crude oil prices (COP) profoundly influence global economic stability, with fluctuations reverberating across various sectors. Accurate forecasting of COP is indispensable for governments, policymakers, and stakeholders to make well-informed decisions and effectively mitigate risks. The decomposition-based hybrid models have been showing promising COP forecasting accuracy than other time series forecasting methods. Despite this fact, no systematic study has been conducted to evaluate the true potential of different decomposition-based hybrid methods employing different forecasting models to forecast the COP. Therefore, a hybrid modeling framework is developed by combining efficient decomposition techniques, namely empirical mode decomposition (EMD), ensemble EMD (EEMD), complete EEMD with adaptive noise (CEEMDAN), and variational mode decomposition (VMD) with seven statistical models, fourteen machine learning (ML) models, and six deep learning (DL) models. Further, a systematic study is conducted on the resulting decomposition-based hybrid models to find the best hybrid model for COP forecasting. Three distinct train-test data splits are employed to ensure a reliable evaluation of the models using four performance metrics. Extensive statistical analysis is conducted to identify the optimal combination of the decomposition technique and forecasting model for precise COP prediction. The results demonstrate that the proposed decomposition-based hybrid model employing VMD and Huber Regression is statistically the best method among all alternatives to forecast monthly COP. The proposed hybrid method VMD-Huber Regression improves the root mean square error (RMSE) by 21% than CEEMDAN-ARIMA, 58.31% than EEMD-Theta, 13.18% than EMD-Random Walk, and 49.44% than VMD-TBATS hybrid methods in 60–40 Train-Test split ratio. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2025.

关键词： Mean square error

来源：评论

学校读者我要写书评

暂无评论

Artificial Intelligence and Natural Language Processing Inspired Chabot Technologies

引用

Recent Advances in computer science and Communications 2024年第1期17卷 11-20页

作者： Singh, Deepti Manju Jatain, Aman Netaji Subhash Institute of Technology New Delhi India Department of Computer Science and Engineering & Information Technology Jaypee Institute of Information Technology Noida India Department of Computer Science and Engineering Amity University Haryana India

Chatbots use artificial intelligence (AI) and natural language processing (NLP) algorithms to construct a clever system. By copying human connections in the most helpful way possi-ble, chatbots emulate individuals and serve as virtual assistants. They easily interface and respond to customers' requests. In the modern technical environment, these conversation agents or chatbots are considered the next-generation invention. Chatbot has become more popular in the business field right now as it can reduce customer service cost and handle multiple users at a time. There are many techniques used to involve such intelligent experts in daily business. A comprehensive analysis of the methods is needed to determine the viability of the different strategies. This paper tracks the progress of this invention and further clarifies the influence of chatbots on numerous businesses. Besides, a survey of the multiple chatbot methodologies suggested by various researchers is provid-ed. Along with the survey, a chatbot e-commerce customer service is designed to provide an efficient and accurate answer for any query based on the dataset of frequently asked questions. This chatbot can reduce customer service costs and can handle multiple customers at the same time. © 2024 Bentham science Publishers.

关键词： Machine learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：