检索结果-内蒙古大学图书馆

Genetic algorithm based data controlling method using IoT enabled WSNs

Soft Computing 2025年第5期29卷 2465-2482页

作者： Singh, Samayveer Nandan, Aridaman Singh Sikka, Geeta Malik, Aruna Singh, Pradeep Kumar Department of Computer Science and Engineering National Institute of Technology Punjab Jalandhar India Department of Computer Science and Engineering National Institute of Technology Delhi India Department of Computer Science and Engineering Central University of Jammu J&K Jammu India

Internet of Things (IoT) enabled Wireless Sensor Networks (WSNs) is not only constitute an encouraging research domain but also represent a promising industrial trend that permits the development of various IoT-based applications. These applications span a wide range from industry to education, and from military to agriculture. The IoT device plays a significant role in various IoT-based networks, and the functioning of such network depends upon the battery power. Once the devices are deployed in the hostile environments, replacing batteries becomes impractical. Despite a plethora of research addressing this challenge, IoT networks still face issues. In this paper, a genetic algorithm based data monitoring and controlling method using IoT enabled WSNs is proposed by using movable sinks in IoT enabled HWSNs (OptiGeA). The OptiGeA protocol is designed for the election of cluster heads (CHs) by incorporating factors such as density, distance, energy and heterogeneous node capacity into its fitness function. The investigation of OptiGeA is conducted with single sink, multiple static sinks and multiple movable sinks provide an unbiased comparative assessment. The novel deployment technique and multiple mobile sinks approaches are proposed to reduce the transmission distance between the sink and CH during system operation and address hotspot issue. It is evident that the OptiGeA protocol shows an increment of 10.44% compared to the GAOC, whereas with the inclusion of DDC process the OptiGeA-DDC protocol demonstrates a remarkable increase of 48.33% compared to MS-GAOC. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2025.

关键词： Genetic algorithms

来源：评论

学校读者我要写书评

暂无评论

Advancing mental health detection in texts via multi-task learning with soft-parameter sharing transformers

引用

Neural Computing and Applications 2025年第5期37卷 3077-3110页

作者： Kodati, Dheeraj Tene, Ramakrishnudu Department of Computer Science and Engineering National Institute of Technology Warangal India Department of Computer Science and Engineering Mahindra University Hyderabad India

In recent years, mental health issues have profoundly impacted individuals’ well-being, necessitating prompt identification and intervention. Existing approaches grapple with the complex nature of mental health, facing challenges like task interference, limited adaptability, and difficulty in capturing nuanced linguistic expressions indicative of various conditions. In response to these challenges, our research presents three novel models employing multi-task learning (MTL) to understand mental health behaviors comprehensively. These models encompass soft-parameter sharing-based long short-term memory with attention mechanism (SPS-LSTM-AM), SPS-based bidirectional gated neural networks with self-head attention mechanism (SPS-BiGRU-SAM), and SPS-based bidirectional neural network with multi-head attention mechanism (SPS-BNN-MHAM). Our models address diverse tasks, including detecting disorders such as bipolar disorder, insomnia, obsessive-compulsive disorder, and panic in psychiatric texts, alongside classifying suicide or non-suicide-related texts on social media as auxiliary tasks. Emotion detection in suicide notes, covering emotions of abuse, blame, and sorrow, serves as the main task. We observe significant performance enhancement in the primary task by incorporating auxiliary tasks. Advanced encoder-building techniques, including auto-regressive-based permutation and enhanced permutation language modeling, are recommended for effectively capturing mental health contexts’ subtleties, semantic nuances, and syntactic structures. We present the shared feature extractor called shared auto-regressive for language modeling (S-ARLM) to capture high-level representations that are useful across tasks. Additionally, we recommend soft-parameter sharing (SPS) subtypes-fully sharing, partial sharing, and independent layer-to minimize tight coupling and enhance adaptability. Our models exhibit outstanding performance across various datasets, achieving accuracies of 96.9%, 97.

关键词： Multi-task learning

来源：评论

学校读者我要写书评

暂无评论

DDSS: Driver decision support system based on the driver behaviour prediction to avoid accidents in intelligent transport system

引用

International Journal of Cognitive Computing in engineering 2024年第1期5卷 1-13页

作者： S, Balasubramani D, John Aravindhar Renjith, P.N. Ramesh, K. Department of Computer Science and Engineering Koneru Lakshmaiah Education Foundation Andhra Pradesh Vaddeswaram India Department of Computer Science and Engineering Hindustan Institute of Technology and Science Chennai India School of Computer Science and Engineering Vellore Institute of Technology Chennai India Department of Computer Science and Engineering Sri Krishna College of Engineering and Technology Coimbatore India

Accidents caused by drivers who exhibit unusual behavior are putting road safety at ever-greater risk. When one or more vehicle nodes behave in this way, it can put other nodes in danger and result in potentially catastrophic accidents. In order to anticipate and handle unusual driving behavior in Intelligent Transportation Systems (ITS), this research presents a unique Driver Decision Support System (DDSS). A reliable driving behavior prediction system is used by the suggested DDSS to categorize drivers as displaying normal or abnormal behavior. In order to prevent accidents in ITS scenarios, the system reliably detects anomalous driving patterns and advises nearby vehicles to change lanes or alter speed. The driver behavior prediction algorithm efficiently groups drivers into behavior categories using the K-Means clustering method. In order to evaluate the algorithm's efficacy, a comparative analysis is conducted by comparing its outcomes against those of Support Vector Machines (SVMs), Decision Trees, K-Nearest Neighbours (KNN), Logistic Regression, and Naïve Bayes. The integration of the Driver Decision Support System into the Intelligent Transportation System infrastructure serves to augment endeavours in accident prevention. Monitoring and analysis of driver behavior enable timely interventions, promoting safer driving practices and reducing accident risks. This research helps to create a more effective transportation system by reducing the number of accidents brought on by reckless driving. Because of its novel method to anticipating and controlling driver behavior, the proposed DDSS has promise for improving road safety and preventing accidents. The efficacy and the dependability of the driver behavior prediction algorithm are confirmed by the experimental assessment. © 2023

关键词： K-means clustering

来源：评论

学校读者我要写书评

暂无评论

OCRBench: on the hidden mystery of OCR in large multimodal models

引用

science China(Information sciences) 2024年第12期67卷 23-35页

作者： Yuliang LIU Zhang LI Mingxin HUANG Biao YANG Wenwen YU Chunyuan LI Xu-Cheng YIN Cheng-Lin LIU Lianwen JIN Xiang BAI School of Artificial Intelligence and Automation Huazhong University of Science and Technology School of Electronic and Information Engineering South China University of Technology Microsoft Research School of Computer & Communication Engineering University of Science and Technology Beijing Institute of Automation Chinese Academy of Sciences School of Software Engineering Huazhong University of Science and Technology

Large models have recently played a dominant role in natural language processing and multimodal vision-language learning. However, their effectiveness in text-related visual tasks remains relatively unexplored. In this paper, we conducted a comprehensive evaluation of large multimodal models, such as GPT4V and Gemini, in various text-related visual tasks including text recognition, scene text-centric visual question answering(VQA), document-oriented VQA, key information extraction(KIE), and handwritten mathematical expression recognition(HMER). To facilitate the assessment of optical character recognition(OCR) capabilities in large multimodal models, we propose OCRBench, a comprehensive evaluation benchmark. OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available. Furthermore, our study reveals both the strengths and weaknesses of these models, particularly in handling multilingual text, handwritten text, non-semantic text, and mathematical expression *** importantly, the baseline results presented in this study could provide a foundational framework for the conception and assessment of innovative strategies targeted at enhancing zero-shot multimodal *** evaluation pipeline and benchmark are available at https://***/Yuliang-Liu/Multimodal OCR.

关键词： large multimodal model OCR text recognition scene text-centric VQA document-oriented VQA key information extraction handwritten mathematical expression recognition

来源：评论

学校读者我要写书评

暂无评论

computer vision-based six layered ConvNeural network to recognize sign language for both numeral and alphabet signs

引用

Biomimetic Intelligence & Robotics 2024年第1期4卷 45-58页

作者： Muhammad Aminur Rahaman Kabiratun Ummi Oyshe Prothoma Khan Chowdhury Tanoy Debnath Anichur Rahman Md.Saikat Islam Khan Department of Computer Science and Engineering Green University of BangladeshDhakaBangladesh Department of Computer Science and Engineering National Institute of Textile Engineering and Research(NITER)Constituent Institute of Dhaka UniversityDhakaBangladesh Department of Computer Science and Engineering Mawlana Bhashani Science and Technology UniversityTangailBangladesh

People who have trouble communicating verbally are often dependent on sign language,which can be difficult for most people to understand,making interaction with them a difficult *** Sign Language Recognition(SLR)system takes an input expression from a hearing or speaking-impaired person and outputs it in the form of text or voice to a normal *** existing study related to the Sign Language Recognition system has some drawbacks,such as a lack of large datasets and datasets with a range of backgrounds,skin tones,and *** research efficiently focuses on Sign Language Recognition to overcome previous *** importantly,we use our proposed Convolutional Neural Network(CNN)model,“ConvNeural”,in order to train our ***,we develop our own datasets,“BdSL_OPSA22_STATIC1”and“BdSL_OPSA22_STATIC2”,both of which have ambiguous backgrounds.“BdSL_OPSA22_STATIC1”and“BdSL_OPSA22_STATIC2”both include images of Bangla characters and numerals,a total of 24,615 and 8437 images,***“ConvNeural”model outperforms the pre-trained models with accuracy of 98.38%for“BdSL_OPSA22_STATIC1”and 92.78%for“BdSL_OPSA22_STATIC2”.For“BdSL_OPSA22_STATIC1”dataset,we get precision,recall,F1-score,sensitivity and specificity of 96%,95%,95%,99.31%,and 95.78%***,in case of“BdSL_OPSA22_STATIC2”dataset,we achieve precision,recall,F1-score,sensitivity and specificity of 90%,88%,88%,100%,and 100%respectively.

关键词： Conv NeuralSign language CNN Static Feature extraction Convolution2D Fully connected layer Dropout

来源：评论

学校读者我要写书评

暂无评论

Modern Machine Learning Solution for Electricity Consumption Management in Smart Buildings

引用

IEEE engineering Management Review 2025年第1期53卷 54-62页

作者： Gautam, Sandeep Kumar Shrivastava, Vinayak Udmale, Sandeep S. Singh, Amit Kumar Singh, Sanjay Kumar Department of Computer Science and Engineering Varanasi221005 India Department of Computer Engineering and Information Technology Mumbai400019 India National Institute of Technology Department of Computer Science and Engineering Patna800005 India

Effective management of electricity consumption (EC) in smart buildings (SBs) is crucial for optimizing operational efficiency, cost savings, and ensuring sustainable resource utilization. Accurate EC prediction enables proactive decision-making, ensuring that resources are allocated efficiently to meet actual demand levels while maintaining occupant comfort. Population growth, building expansion, and technology usage swiftly escalate electricity demand, thus necessitating economical EC management strategies and assist consumers to better understand and strategically plan their EC. To address these challenges, this article proposes a novel approach based on a hybrid prediction model combining temporal convolutional networks (TCNs) and gated recurrent units (GRU). This approach capitalizes on the strengths of both TCN and GRU. TCN is adept at efficiently identifying diverse patterns, particularly within the complex working environments of SBs, by effectively capturing high- and low-frequency information. Subsequently, GRU is leveraged to address the long-term dependencies within the data, enhancing the accuracy of EC prediction. In this article, the results demonstrate the effectiveness of the proposed hybrid model, outperforming competitive methods with an impressive mean absolute error score. This underscores the potential of this approach to improve energy management practices significantly within SB environments, ultimately enhancing both operational efficiency and occupant satisfaction. © 1973-2011 IEEE.

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

Composite spectral spatial pixel CNN for land-use hyperspectral image classification with hybrid activation function

引用

Multimedia Tools and Applications 2025年第12期84卷 10527-10550页

作者： Banerjee, Anasua Swain, Satyajit Rout, Minakhi Bandyopadhyay, Mainak School of Computer Engineering Kalinga Institute of Industrial Technology-Deemed-to-be-University Bhubaneswar India Department of Computer Science and Engineering National Institute of Technology Silchar India

Deep learning methods have played a prominent role in the development of computer visualization in recent years. Hyperspectral imaging (HSI) is a popular analytical technique based on spectroscopy and visible imaging which examines the pattern of light in a target and recognizes objects based on varying spectral properties. However, in remote sensing, detecting surface material via HSI analysis is a critical and difficult task. The performance of spectral-spatial data exploitation is well established to outperform typical spectral pixel-wise techniques. Because of its great feature extraction ability, convolutional neural networks (CNN) have emerged as a potent deep learning approach. CNN translates the input features of an image into an equivalent CNN feature map, in addition to naturally combining spectral and spatial information. However, spectral-spatial properties when combined with pixel-wise extraction can learn more minute details of objects present on the earth’s terrain. In this paper, a noble Composite Spectral Spatial Pixel CNN model for the classification of hyperspectral data is presented which is an amalgamation of 3D-2D-1D CNN. While the 3D and 2D CNN exploit the spectral-spatial features effectively, 1D CNN works on pixel-wise feature extraction. Further, to optimize the classification performance of the proposed model, a new hybrid activation function Flatten-T Swish is also used which is the combination of ReLU and Swish function. The proposed model is compared with other state-of-the-art models based on three popular HSI datasets, and it is found that the proposed model performs better among others in terms of classification and computation time, giving 98.87% accuracy for Indian Pines, 99.92% accuracy for Pavia University, and 99.99% accuracy for Salinas Valley dataset. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Extraction

来源：评论

学校读者我要写书评

暂无评论

A Study on the Explainability of Thyroid Cancer Prediction:SHAP Values and Association-Rule Based Feature Integration Framework

引用

computers, Materials & Continua 2024年第5期79卷 3111-3138页

作者： Sujithra Sankar S.Sathyalakshmi Department of Computer Applications Hindustan Institute of Technology and ScienceChennaiTamil NaduIndia Department of Computer Engineering Hindustan Institute of Technology and ScienceChennaiTamil NaduIndia

In the era of advanced machine learning techniques,the development of accurate predictive models for complex medical conditions,such as thyroid cancer,has shown remarkable *** predictivemodels for thyroid cancer enhance early detection,improve resource allocation,and reduce ***,the widespread adoption of these models in clinical practice demands predictive performance along with interpretability and *** paper proposes a novel association-rule based feature-integratedmachine learning model which shows better classification and prediction accuracy than present *** study also focuses on the application of SHapley Additive exPlanations(SHAP)values as a powerful tool for explaining thyroid cancer prediction *** the proposed method,the association-rule based feature integration framework identifies frequently occurring attribute combinations in the *** original dataset is used in trainingmachine learning models,and further used in generating SHAP values *** the next phase,the dataset is integrated with the dominant feature sets identified through association-rule based *** new integrated dataset is used in re-training the machine learning *** new SHAP values generated from these models help in validating the contributions of feature sets in predicting *** conventional machine learning models lack interpretability,which can hinder their integration into clinical decision-making *** this study,the SHAP values are introduced along with association-rule based feature integration as a comprehensive framework for understanding the contributions of feature sets inmodelling the *** study discusses the importance of reliable predictive models for early diagnosis of thyroid cancer,and a validation framework of *** proposed model shows an accuracy of 93.48%.Performance metrics such as precision,recall,F1-score,and the area un

关键词： Explainable AI machine learning clinical decision support systems thyroid cancer association-rule based framework SHAP values classification and prediction

来源：评论

学校读者我要写书评

暂无评论

An enhanced framework for smart automated evaluations of answer scripts using NLP and deep learning methods

引用

Multimedia Tools and Applications 2025年第11期84卷 8491-8513页

作者： G, Mohanraj R.K, Nadesh M, Marimuthu V, Sathiyapriya School of Computer Science Engineering and Information Systems Vellore Institute of Technology Vellore632014 India School of Computer Science and Engineering Vellore Institute of Technology Chennai600127 India Knowledge Institute of Technology Salem637504 India

The manual process of evaluating answer scripts is strenuous. Evaluators use the answer key to assess the answers in the answer scripts. Advancements in technology and the introduction of new learning paradigms need automation of the evaluation process. This work aims to develop an enhanced novel hybrid framework that can evaluate answer scripts and automatically assign marks for different type of questions based on keywords, grammar, symbols, special keywords, and the given factors. First, the proposed system uses Optical Character Recognition (OCR) to convert image answer scripts into an editable text format. Second, the sentence transformers, the Natural Language Processing (NLP) technique flips the answer script and answers key texts into word embedding vectors. To find similarity measures, these vectors are matched using BERT encoding, spearmanś rank-order correlation, and fuzzy search. At last, the proposed model is trained using Deep Columnar Convolutional Neural Network (DCCNN) in the third step with MINST and Kaggle handwritten mathematical symbols and tested with the segmented mathematical equations to find the similarity. The performance of proposed model is measured using precision, recall, accuracy, and F1-score, and its gives highest accuracy of 93% and 95% when compared to the existing methodologies. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Optical character recognition

来源：评论

学校读者我要写书评

暂无评论

Mixed-decomposed convolutional network:A lightweight yet efficient convolutional neural network for ocular disease recognition

引用

CAAI Transactions on Intelligence technology 2024年第2期9卷 319-332页

作者： Xiaoqing Zhang Xiao Wu Zunjie Xiao Lingxi Hu Zhongxi Qiu Qingyang Sun Risa Higashita Jiang Liu Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering Southern University of Science and TechnologyShenzhenChina Tomey Corporation NagoyaJapan Guangdong Provincial Key Laboratory of Brain‐inspired Intelligent Computation Department of Computer Science and EngineeringSouthern University of Science and TechnologyShenzhenChina Singapore Eye Research Institute SingaporeSingapore

Eye health has become a global health concern and attracted broad *** the years,researchers have proposed many state-of-the-art convolutional neural networks(CNNs)to assist ophthalmologists in diagnosing ocular diseases efficiently and ***,most existing methods were dedicated to constructing sophisticated CNNs,inevitably ignoring the trade-off between performance and model *** alleviate this paradox,this paper proposes a lightweight yet efficient network architecture,mixeddecomposed convolutional network(MDNet),to recognise ocular *** MDNet,we introduce a novel mixed-decomposed depthwise convolution method,which takes advantage of depthwise convolution and depthwise dilated convolution operations to capture low-resolution and high-resolution patterns by using fewer computations and fewer *** conduct extensive experiments on the clinical anterior segment optical coherence tomography(AS-OCT),LAG,University of California San Diego,and CIFAR-100 *** results show our MDNet achieves a better trade-off between the performance and model complexity than efficient CNNs including MobileNets and ***,our MDNet outperforms MobileNets by 2.5%of accuracy by using 22%fewer parameters and 30%fewer computations on the AS-OCT dataset.

关键词： artificial intelligence deep learning deep neural networks image analysis image classification medical applications medical image processing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：