video action recognition using 3D Convolutional Neural Networks (CNN) become an increasingly popular strategy in past years with the evolution of machine learning and computer vision. However, the higher memory and co...
详细信息
ISBN:
(纸本)9798350358810;9798350358803
video action recognition using 3D Convolutional Neural Networks (CNN) become an increasingly popular strategy in past years with the evolution of machine learning and computer vision. However, the higher memory and computation capacity requirement of these networks leads to the use of low-power memory-saving neural networks to perform video action recognition tasks efficiently. Spike-based information processing and computation of bio-inspired spiking convolutional neural networks perform an essential role when comes to energy efficient memory saving computation for video classification and action recognition tasks which allow on-chip real-timeprocessing. This paper proposes a novel 3D Convolutional Spiking Neural Network (CSNN) architecture with modulating STDP supervised learning via global error feedback for human action recognition in video data. The proposed model includes two 3D convolutional layers, followed by two spiking neuron layers, modeled using Leaky Integrate and Fire (LIF) neurons for feature extraction from video data. Using the modulating STDP learning rule with global error feedback, this model can successfully recognize human actions from video data allowing online parallel computations. The proposed network experimented on two datasets: one 3D image dataset - synthesized 3D MNIST and one video dataset - UCF 101 human action recognition dataset and achieved 71.6% and 63.7% recognition accuracy.
With the rapid improvement of UAV imaging equipment in terms of image resolution and video frame rate, relying solely on software to enhance images with extremely high data volume is no longer sufficient to meet real-...
详细信息
Handguns, pistols, and revolvers are commonly used in today's world for committing criminal acts, requiring the need for effective surveillance and control systems. However, despite the advancement of security sys...
详细信息
ISBN:
(数字)9783031581816
ISBN:
(纸本)9783031581809;9783031581816
Handguns, pistols, and revolvers are commonly used in today's world for committing criminal acts, requiring the need for effective surveillance and control systems. However, despite the advancement of security systems, human monitoring and involvement are still necessary to effectively combat these crimes. This paper provides a robust automated handgun identification technique for recorded videos and live CCTV footage that may be used for both control and surveillance purposes. Automatic detection of firearms is crucial for improving people's protection and safety, however, it is a challenging task because of the numerous differences in design, size, and appearance of firearms. In recent years, object detectors have improved, yielding better findings and shorter inference times. The authors used cutting-edge object detector YOLOv7 for firearm detection. A varied and demanding dataset of 15,367 images for weapon identification is also proposed, which is carefully annotated for weapon localization and classification. After analysing the data, it is determined that the model achieves an accuracy rate of 96.80% and recall rate of 90.37%.
Urban traffic management is a crucial concern in today's cities due to increased vehicle density and the complexity of transport networks. Effective real-time vehicle tracking systems are critical for reducing tra...
详细信息
With the development of digital imageprocessing technology, the demand and application of wide-view video continue to grow. However, the existing stitching algorithms may result in unnatural appearance, misalignment,...
详细信息
This paper uses Shi-Tomas method for video search of sports competitions. Firstly, the ONVIF356-8950 4K HD network image data decoding processor is selected, and RS485 interface actuator is used as the main interface....
详细信息
Transfer learning is a set of techniques to apply skills or knowledge from a source task to a target task that is different but related, while Hybrid Quantum-Classical Transfer Learning (HQCTL) model extends the skill...
详细信息
This research paper presents a novel Virtual Gym Tracker AI Pose Estimation system designed to enhance virtual fitness experiences. Leveraging advanced deep learning techniques and real- timeimage analysis, the syste...
详细信息
For safety and security reasons, the indoor/outdoor working environments of various industries require the use of many cameras for automated surveillance. In such context, a major challenge for automated monitoring sy...
详细信息
The arrival of Sora marks a new era for text-to-video diffusion models, bringing significant advancements in video generation and potential applications. However, Sora, along with other text-to-video diffusion models,...
暂无评论