检索结果-内蒙古大学图书馆

GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception

computers, Materials & Continua 2024年第8期80卷 2963-2978页

作者： Yunxiang Liu Haili Ma Jianlin Zhu Qiangbo Zhang School of Computer Science and Information Engineering Shanghai Institute of TechnologyShanghai201418China

To enhance the efficiency and accuracy of environmental perception for autonomous vehicles,we propose GDMNet,a unified multi-task perception network for autonomous driving,capable of performing drivable area segmentation,lane detection,and traffic object ***,in the encoding stage,features are extracted,and Generalized Efficient Layer Aggregation Network(GELAN)is utilized to enhance feature extraction and gradient ***,in the decoding stage,specialized detection heads are designed;the drivable area segmentation head employs DySample to expand feature maps,the lane detection head merges early-stage features and processes the output through the Focal Modulation Network(FMN).Lastly,the Minimum Point Distance IoU(MPDIoU)loss function is employed to compute the matching degree between traffic object detection boxes and predicted boxes,facilitating model training *** results on the BDD100K dataset demonstrate that the proposed network achieves a drivable area segmentation mean intersection over union(mIoU)of 92.2%,lane detection accuracy and intersection over union(IoU)of 75.3%and 26.4%,respectively,and traffic object detection recall and mAP of 89.7%and 78.2%,*** detection performance surpasses that of other single-task or multi-task algorithm models.

关键词： Autonomous driving multitask learning drivable area segmentation lane detection vehicle detection

来源：评论

学校读者我要写书评

暂无评论

Limits of Depth: Over-Smoothing and Over-Squashing in GNNs

引用

Big Data Mining and Analytics 2024年第1期7卷 205-216页

作者： Aafaq Mohi ud din Shaima Qureshi Department of Computer Science and Engineering National Institute of Technology SrinagarSrinagar 190006India

Graph Neural Networks(GNNs)have become a widely used tool for learning and analyzing data on graph structures,largely due to their ability to preserve graph structure and properties via graph representation ***,the effect of depth on the performance of GNNs,particularly isotropic and anisotropic models,remains an active area of *** study presents a comprehensive exploration of the impact of depth on GNNs,with a focus on the phenomena of over-smoothing and the bottleneck effect in deep graph neural *** research investigates the tradeoff between depth and performance,revealing that increasing depth can lead to over-smoothing and a decrease in performance due to the bottleneck *** also examine the impact of node degrees on classification accuracy,finding that nodes with low degrees can pose challenges for accurate *** experiments use several benchmark datasets and a range of evaluation metrics to compare isotropic and anisotropic GNNs of varying depths,also explore the scalability of these *** findings provide valuable insights into the design of deep GNNs and offer potential avenues for future research to improve their performance.

关键词： Graph Neural Networks(GNNs) learning on graphs over-smoothing over-squashing isotropic-GNNs anisotropic-GNNs

来源：评论

学校读者我要写书评

暂无评论

A Fusion Model for Personalized Adaptive Multi-Product Recommendation System Using Transfer Learning and Bi-GRU

引用

computers, Materials & Continua 2024年第12期81卷 4081-4107页

作者： Buchi Reddy Ramakantha Reddy Ramasamy Lokesh Kumar School of Computer Science and Engineering Vellore Institute of TechnologyVellore632014TamilnaduIndia

Traditional e-commerce recommendation systems often struggle with dynamic user preferences and a vast array of products,leading to suboptimal user *** address this,our study presents a Personalized Adaptive Multi-Product Recommendation System(PAMR)leveraging transfer learning and Bi-GRU(Bidirectional Gated Recurrent Units).Using a large dataset of user reviews from Amazon and Flipkart,we employ transfer learning with pre-trained models(AlexNet,GoogleNet,ResNet-50)to extract high-level attributes from product data,ensuring effective feature representation even with limited ***-GRU captures both spatial and sequential dependencies in user-item *** innovation of this study lies in the innovative feature fusion technique that combines the strengths of multiple transfer learning models,and the integration of an attention mechanism within the Bi-GRU framework to prioritize relevant *** approach addresses the classic recommendation systems that often face challenges such as cold start along with data sparsity difficulties,by utilizing robust user and item *** model demonstrated an accuracy of up to 96.9%,with precision and an F1-score of 96.2%and 96.97%,respectively,on the Amazon dataset,significantly outperforming the baselines and marking a considerable advancement over traditional *** study highlights the effectiveness of combining transfer learning with Bi-GRU for scalable and adaptive recommendation systems,providing a versatile solution for real-world applications.

关键词： Personalized recommendation systems transfer learning bidirectional gated recurrent units(Bi-GRU) performance metrics adaptive systems product reviews

来源：评论

学校读者我要写书评

暂无评论

Towards imbalanced motion:part-decoupling network for video portrait segmentation

引用

science China(Information sciences) 2024年第7期67卷 197-210页

作者： Tianshu YU Changqun XIA Jia LI State Key Laboratory of Virtual Reality Technology and Systems School of Computer Science and EngineeringBeihang University Peng Cheng Laboratory

Video portrait segmentation(VPS), aiming at segmenting prominent foreground portraits from video frames, has received much attention in recent years. However, the simplicity of existing VPS datasets leads to a limitation on extensive research of the task. In this work, we propose a new intricate large-scale multi-scene video portrait segmentation dataset MVPS consisting of 101 video clips in 7 scenario categories,in which 10843 sampled frames are finely annotated at the pixel level. The dataset has diverse scenes and complicated background environments, which is the most complex dataset in VPS to our best *** the observation of a large number of videos with portraits during dataset construction, we find that due to the joint structure of the human body, the motion of portraits is part-associated, which leads to the different parts being relatively independent in motion. That is, the motion of different parts of the portraits is imbalanced. Towards this imbalance, an intuitive and reasonable idea is that different motion states in portraits can be better exploited by decoupling the portraits into parts. To achieve this, we propose a part-decoupling network(PDNet) for VPS. Specifically, an inter-frame part-discriminated attention(IPDA)module is proposed which unsupervisedly segments portrait into parts and utilizes different attentiveness on discriminative features specified to each different part. In this way, appropriate attention can be imposed on portrait parts with imbalanced motion to extract part-discriminated correlations, so that the portraits can be segmented more accurately. Experimental results demonstrate that our method achieves leading performance with the comparison to state-of-the-art methods.

关键词： video portrait segmentation imbalanced motion unsupervised part decoupling motion correlation inter-frame attention

来源：评论

学校读者我要写书评

暂无评论

Automatic emotion recognition using deep neural network

引用

Multimedia Tools and Applications 2025年 1-30页

作者： Sujatha, R. Chatterjee, Jyotir Moy Pathy, Baibhav Hu, Yu-Chen School of Computer Science Engineering and Information Systems Vellore Institute of Technology Vellore India Department of CSE Graphic Era University Dehradun India School of Electrical Engineering Vellore Institute of Technology Vellore India Department of Computer Science Tunghai University Taichung City Taiwan

Emotions are a vital semantic part of human correspondence. Emotions are significant for human correspondence as well as basic for human–computer cooperation. Viable correspondence between people is possibly achieved when both the importance and the emotion of the correspondence are perceived by all groups included. Understanding the significance of language has generally been concentrated on in natural language processing (NLP) as a semantic examination. In NLP, the text can be handled appropriately for classification. Emotion detection from facial emotion is the subfield of social signal processing applied in a wide assortment of regions, explicitly for human and PC collaboration. Many researchers have proposed various approaches, generally utilizing machine learning concepts. Automatic emotion recognition (AER) is significant for working with consistent intuitiveness between a person and a smart device toward fully acknowledging an intelligent society. Many researchers examined cross-lingual and multilingual speech emotion as a stage toward language-free emotion acknowledgment in natural speech. In the present work, we are proposing a deep learning-based AER system using four openly accessible datasets, namely Basic Arabic Vocal Emotions Dataset (BAVED), Acted Emotional Speech Dynamic Database (AESDD), Urdu written in Latin/Roman Script (URDU), and Toronto Emotional Speech Set (TESS), by utilizing the Jupyter notebook and a Python library for music and audio synthesis named Librosa. The experimental results exhibited that the proposed approach achieves better than the existing approaches, i.e., the accuracy of the proposed system with the URDU dataset is 96.24%, the TESS dataset is 99.10%, the AESDD dataset is 65.97%, and the BAVED dataset is 73.12%. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2025.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Blockchain enabled MediVault for healthcare system

引用

Multimedia Tools and Applications 2025年第15期84卷 14805-14825页

作者： Chaurasia, Brijesh Kumar Department of Computer Science and Engineering Pranveer Singh Institute of Technology Kanpur India

The management of healthcare data has significantly benefited from the use of cloud-assisted MediVault for healthcare systems, which can offer patients efficient and convenient digital storage services for storing their medical records. Nevertheless, there are security risks associated with the current digital healthcare data, as malicious parties may work with cloud storage service providers to alter patient records or to directly disclose health record content to other adversaries for monetary advantage. In this paper, a blockchain-enabled MediVault for healthcare systems is proposed not only to provide safe storage of healthcare data in digital form but also secure access for authenticated entities such as patients, doctors, and pharmacists. In this MediVault, we introduced NFT generation and storage over the cloud using Interplanetary File System (IPFS) and FireBase as per user adaptability, Ehtereum Blockchain for immutability, and encryption and decryption using asymmetric keys for confidentiality. The empirical results and the performance evaluation demonstrate that the proposed MediVault is secure, simple, and efficient with a limited computation overhead. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Blockchain

来源：评论

学校读者我要写书评

暂无评论

Novel methodology for apple leaf disease classification with PCNN-IELM

引用

Neural Computing and Applications 2025年第6期37卷 4895-4913页

作者： Navpreet Roul, Rajendra Kumar Department of Computer Science and Engineering Thapar Institute of Engineering and Technology Punjab Patiala147004 India

Agriculture is crucial to the global economy, particularly in ensuring food security. Recent trends indicate that various plant diseases are causing substantial financial losses in the agricultural sector worldwide. Traditional manual inspection methods for detecting fruit and plant diseases are labor-intensive and inefficient. Adopting automated disease detection technologies could significantly enhance early diagnosis and reduce the economic impact of these diseases on agriculture. This study introduces an advanced model for classifying apple diseases by integrating a pre-trained convolutional neural network (PCNN), such as VGG16, VGG19, or ResNet50, with an incremental extreme learning machine (I-ELM) for efficient feature extraction and classification. A key innovation of this model is replacing the PCNN’s fully connected layer with the I-ELM, which eliminates the lengthy back-propagation process and significantly reduces training time. Integrating I-ELM with PCNN harnesses the rapid learning capabilities and robust generalization of I-ELM with the superior feature extraction abilities of CNNs. I-ELM simplifies the network architecture by avoiding the complex neural networks commonly used in other methods. The model’s effectiveness is rigorously evaluated on the well-known Plant Village dataset, demonstrating its ability to identify various apple diseases through performance metrics such as precision, sensitivity, specificity, accuracy, and the F1-score. Comparing existing deep learning models using these metrics highlights its superior performance. This innovation is up-and-coming for intelligent agricultural systems, offering an effective solution for classifying apple diseases and enabling timely and innovative farming practices. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

关键词： Fruits

来源：评论

学校读者我要写书评

暂无评论

DLP4DA-RPL: A Distributed Lightweight Protocol for Detection and Avoidance of Discarded DIO and DAO Attacks on RPL Routing Protocol in IoT

引用

IEEE Sensors Journal 2025年第12期25卷 22880-22894页

作者： Deepavathi, P. Mala, C. Department of Computer Science and Engineering National Institute of Technology Tiruchirappalli 620015 India

The Internet of Things (IoT) occupies the entire world in its hands. IoT devices have a resource-constrained nature known as Low Power and Lossy Networks (LLN). The Routing Protocol for Low Power and Lossy Networks (RPL) is provided by the Internet engineering Task Force (IETF) group to secure the IoT networks. The control messages Destination Oriented Directed Acyclic Graph (DODAG) Information Object (DIO) and DODAG Advertisement Object (DAO) play a crucial part in RPL. Attackers focused on these control messages to degrade the performance of IoT networks and slowly bring them to a halt. To overcome these problems, this paper proposes a DLP4DA-RPL protocol to detect and avoid Discarded DIO (DDIO) and Discarded DAO (DDAO) control message attacks. The Contiki operating system simulates this proposed protocol using the Cooja simulator and this proposed protocol is implemented in our college environment. It is inferred from the simulation and real-time results that the proposed DLP4DA-RPL protocol outperforms the existing RPL protocols, such as RPL with Attacks, RPL without Attacks, SecTrust-RPL, and DDoS-RPL concerning End-to-end Delay, Energy Consumption, Packet Delivery Ratio, Throughput and Network Performance. © 2025 IEEE.

关键词： Routing protocols

来源：评论

学校读者我要写书评

暂无评论

An Improved Graph Partitioning Algorithm Based Approach for Workflow Offloading in a Fog Environment

引用

Journal of The Institution of Engineers (India): Series B 2025年第2期106卷 623-634页

作者： Mahajan, Neetu Narang Kaur, Parmeet Department of Computer Science and Engineering Jaypee Institute of Information Technology Noida India

The paper addresses the critical problem of application workflow offloading in a fog environment. Resource constrained mobile and Internet of Things devices may not possess specialized hardware to run complex workflows locally and hence, need to offload these tasks to fog nodes. As compared to cloud-based servers, fog nodes can provide responses in a more-timely manner and are preferred for latency-sensitive applications. Workflow applications are characterized by inter-task dependencies and hence, can be readily represented as directed acyclic graphs. Therefore, the proposed offloading solution approach utilizes an improved graph partitioning algorithm based on the Louvain community detection algorithm. The aim of the algorithm is to partition the workflow graph in such a manner that the workflow tasks having high communication costs between them are transferred or offloaded to the same fog node. The benefits of the proposed algorithm have been verified by simulation experiments where it was observed that it results in a lower makespan as compared to the related approaches. © The Institution of Engineers (India) 2024.

关键词： Fog

来源：评论

学校读者我要写书评

暂无评论

Ancient Character Recognition: A Comprehensive Review

引用

IEEE Access 2025年 13卷 88847-88857页

作者： Krithiga, R. Varsini, S.R. Joshua, R. Gabriel Om Kumar, C.U. Vellore Institute of Technology School of Computer Science and Engineering Chennai600127 India

Deep learning-based character recognition of Tamil inscriptions plays a significant role in preserving the ancient Tamil language. The complexity of the task lies in the precise classification of the age-old Tamil letters (Vattezhuthu) into modern-day Tamil letter structures. Various methodologies and pre-processing techniques have been used for denoising the ancient Tamil manuscript to retrieve the Tamil text. Researchers have used various synthesized and scanned images of stone wall inscriptions, palm leaves manuscripts, and offline handwritten characters for their analysis. Over the years, Ancient Tamil scripts have deteriorated with time due to various natural calamities. Strong denoising and feature extraction methods are required to separate the letters accurately to tackle this issue. Techniques such as CNN(OCR), ResNet, SVM, KNN, HorVer method, etc., are utilized to digitize Tamil characters. This technique has successfully converted handwritten characters into digitalized text for multiple languages, including Tamil, Arabic, English, Latin, Chinese, German, etc. Different models have been evaluated based on their segmentation and recognition rates, accuracy, detection rate, precision, and confusion matrix. This paper will concentrate on Ancient Tamil character segmentation and recognition models. Besides, we will give an overview of the different models and datasets available. Lastly, we summarise the key challenges and the future scope related to the topic. © 2013 IEEE.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：