Recent years have shown immense potential for image and videoprocessing. Several fields in health, engineering, and science have used them to enhance functioning. In imageprocessing, a number of algorithms and techn...
详细信息
video action recognition using 3D Convolutional Neural Networks (CNN) become an increasingly popular strategy in past years with the evolution of machine learning and computer vision. However, the higher memory and co...
详细信息
ISBN:
(纸本)9798350358810;9798350358803
video action recognition using 3D Convolutional Neural Networks (CNN) become an increasingly popular strategy in past years with the evolution of machine learning and computer vision. However, the higher memory and computation capacity requirement of these networks leads to the use of low-power memory-saving neural networks to perform video action recognition tasks efficiently. Spike-based information processing and computation of bio-inspired spiking convolutional neural networks perform an essential role when comes to energy efficient memory saving computation for video classification and action recognition tasks which allow on-chip real-timeprocessing. This paper proposes a novel 3D Convolutional Spiking Neural Network (CSNN) architecture with modulating STDP supervised learning via global error feedback for human action recognition in video data. The proposed model includes two 3D convolutional layers, followed by two spiking neuron layers, modeled using Leaky Integrate and Fire (LIF) neurons for feature extraction from video data. Using the modulating STDP learning rule with global error feedback, this model can successfully recognize human actions from video data allowing online parallel computations. The proposed network experimented on two datasets: one 3D image dataset - synthesized 3D MNIST and one video dataset - UCF 101 human action recognition dataset and achieved 71.6% and 63.7% recognition accuracy.
With the rapid improvement of UAV imaging equipment in terms of image resolution and video frame rate, relying solely on software to enhance images with extremely high data volume is no longer sufficient to meet real-...
详细信息
Handguns, pistols, and revolvers are commonly used in today's world for committing criminal acts, requiring the need for effective surveillance and control systems. However, despite the advancement of security sys...
详细信息
ISBN:
(数字)9783031581816
ISBN:
(纸本)9783031581809;9783031581816
Handguns, pistols, and revolvers are commonly used in today's world for committing criminal acts, requiring the need for effective surveillance and control systems. However, despite the advancement of security systems, human monitoring and involvement are still necessary to effectively combat these crimes. This paper provides a robust automated handgun identification technique for recorded videos and live CCTV footage that may be used for both control and surveillance purposes. Automatic detection of firearms is crucial for improving people's protection and safety, however, it is a challenging task because of the numerous differences in design, size, and appearance of firearms. In recent years, object detectors have improved, yielding better findings and shorter inference times. The authors used cutting-edge object detector YOLOv7 for firearm detection. A varied and demanding dataset of 15,367 images for weapon identification is also proposed, which is carefully annotated for weapon localization and classification. After analysing the data, it is determined that the model achieves an accuracy rate of 96.80% and recall rate of 90.37%.
Urban traffic management is a crucial concern in today's cities due to increased vehicle density and the complexity of transport networks. Effective real-time vehicle tracking systems are critical for reducing tra...
详细信息
Narrow-band imaging (NBI), a relatively new bronchoscopy technology, offers superior visualization of vascular details in lesion areas along the airway walls compared to standard white light bronchoscopy. This empower...
详细信息
ISBN:
(纸本)9781510685987;9781510685994
Narrow-band imaging (NBI), a relatively new bronchoscopy technology, offers superior visualization of vascular details in lesion areas along the airway walls compared to standard white light bronchoscopy. This empowers physicians to detect suspect lesions and characterize their underlying vascular structures for further indications of cancerous activity. Unfortunately, the bronchoscopic video stream suffers from blurring artifacts due to device and patient motions, resulting in low-resolution visualization of lesion areas. To address this problem, we present an image enhancement method for NBI bronchoscopy to improve: 1) visualization of vascular structures;2) lesion detection;and 3) vessel segmentation. We adapted real-ESRGAN, a single-image super-resolution network, to enhance bronchoscopic images in real-time. This involved a transfer learning approach to fine-tune a pre-trained model using our public NBI bronchial lesion database. The results, derived from bronchoscopic airway exam videos of 10 lung cancer patients, demonstrate significant improvement in the visual quality of super-resolved frames, particularly in vascular regions. Our quantitative analysis further shows enhanced vessel segmentation and lesion detection accuracy, with increased confidence scores. This method offers a practical, real-time solution for improving the diagnostic utility of NBI bronchoscopy by providing clearer, more detailed images. Thus, we integrated the method into an NBI video analysis system for aiding in the early detection and characterization of bronchial lesions.
With the development of digital imageprocessing technology, the demand and application of wide-view video continue to grow. However, the existing stitching algorithms may result in unnatural appearance, misalignment,...
详细信息
This paper uses Shi-Tomas method for video search of sports competitions. Firstly, the ONVIF356-8950 4K HD network image data decoding processor is selected, and RS485 interface actuator is used as the main interface....
详细信息
Transfer learning is a set of techniques to apply skills or knowledge from a source task to a target task that is different but related, while Hybrid Quantum-Classical Transfer Learning (HQCTL) model extends the skill...
详细信息
This research paper presents a novel Virtual Gym Tracker AI Pose Estimation system designed to enhance virtual fitness experiences. Leveraging advanced deep learning techniques and real- timeimage analysis, the syste...
详细信息
暂无评论