版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Department of Computing Technologies SRM Institute of Science and Technology Chengalpattu District Tamil Nadu Kattankulathur603203 India Department of Computer Science and Engineering Vellore Institute Technology Chennai Campus Tamil Nadu Chennai India Department of Information Technology Anna University MIT Campus Tamil Nadu Chennai India
出 版 物:《Multimedia Tools and Applications》 (Multimedia Tools Appl)
年 卷 期:2025年
页 面:1-20页
核心收录:
学科分类:08[工学] 0835[工学-软件工程] 0812[工学-计算机科学与技术(可授工学、理学学位)]
摘 要:Video analytics faces complex challenges in object detection and classification. Deep learning based approaches have achieved remarkable success in past decade. However, existing object identification models that utilize backbone’s core features still present challenges due to their lack of semantic information. To address these issues, a novel object detection and classification framework utilizing Batch normalization and Softswish activation adapted ResNet (BS2ResNet) and Logistic Tanh Kaiming Bi-directional Long Short Term Memory (LTK-Bi-LSTM) techniques was proposed. The framework employs frame conversion, noise removal, and contrast elevation during frame pre-processing, followed by background subtraction using the Supreme Distance-centered Fuzzy C-Means (SD-FCM) clustering algorithm, and edge detection using the Hyperbolic Tangent Kernel Canny Edge Detector (HTKCED). BS2ResNet is then used for object detection, and features are extracted and passed to the LTK-Bi-LSTM neural network for object classification. The proposed system was found to improve object detection and classification accuracy, outperforming existing techniques. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.