Automatic License Plate Recognition systems aim to provide a solution for detecting, localizing, and recognizing license plate characters from vehicles appearing in video frames. However, deploying such systems in the...
详细信息
ISBN:
(纸本)9781665458245
Automatic License Plate Recognition systems aim to provide a solution for detecting, localizing, and recognizing license plate characters from vehicles appearing in video frames. However, deploying such systems in the real world requires real-time performance in low-resource environments. In our paper, we propose a two-stage detection pipeline paired with Vision API that provides real-time inference speed along with consistently accurate detection and recognition performance. We used a haar-cascade classifier as a filter on top of our backbone MobileNet SSDv2 detection model. This reduces inference time by only focusing on high confidence detections and using them for recognition. We also impose a temporal frame separation strategy to distinguish between multiple vehicle license plates in the same clip. Furthermore, there are no publicly available Bangla license plate datasets, for which we created an image dataset and a video dataset containing license plates in the wild. We trained our models on the image dataset and achieved an AP(0.5) score of 86% and tested our pipeline on the video dataset and observed reasonable detection and recognition performance (82.7% detection rate, and 60.8% OCR F1 score) with real-timeprocessing speed (27.2 frames per second).
The proceedings contain 31 papers. The special focus in this conference is on Signal and imageprocessing. The topics include: A Blockchain-Based Secure Framework for Storage and Access of Surveillance video Data;enco...
ISBN:
(纸本)9789819795147
The proceedings contain 31 papers. The special focus in this conference is on Signal and imageprocessing. The topics include: A Blockchain-Based Secure Framework for Storage and Access of Surveillance video Data;encountering the Challenges and Awareness of Internet Security;Data-Driven Strategies for Twitter Engagement: Hashtag Recommendations and API Insights;Analyzing and Visualizing AI Decision-Making for Human-Centered Interaction and Trust;AI-ML-Based Handover Techniques in Next-Generation Wireless Networks;castle Defender: Design and Implementation of Game Based on Pygame;optimizing Healthcare: Enhancing Disease Management with Recommendation Systems;optimizing Diabetes Prediction Models for Enhanced Health Data processing;comparative Analysis of Classification Models Using Various Feature Sets;An Early Detection of Tomato Plant Disease with Deep Reinforcement Learning and CNN;ioT-Powered Health Monitoring System for Protecting Vital Organs Through Cloud-Based Diagnosis;LASKV: Lightweight Authenticated Session Key Agreement for video Surveillance System;Data Analyses of Factors Influencing Sustainable Adoption of Online Public Services: An Extended UTAUT Analysis in North Macedonia;a Comprehensive Review on real-time Campus Compass;deep Learning Approaches for Early Detection of Bovine Respiratory Diseases in Cattle;identifying Suitable Cloud Storage Services from Cross Cloud Platforms Based on the Requirements;storage-Savvy Frame Recorder: Enhancing Storage Efficiency and Inspection Speed*;lossless Medical image Compression Using Block-Wise Burrows–Wheeler Transform and Modified Run Length Encoding;advancing Software Defect Detection and Prevention: Bridging Gaps in Early-Stage and Evolving Software Systems;a New Way to Communicate: Deep Learning for Deaf and Dumb People;dependent Binomial: A Family of Distributions Derivable by Modifying a Base.
This paper presents examination result of possibility for automatic unclear region detection in the CAD system for colorectal tumor with realtime endoscopic videoimage. We confirmed that it is possible to realize th...
详细信息
This paper presents examination result of possibility for automatic unclear region detection in the CAD system for colorectal tumor with realtime endoscopic videoimage. We confirmed that it is possible to realize the CAD system with navigation function of clear region which consists of unclear region detection by YOLO2 and classification by AlexNet and SVMs on customizable embedded DSP cores. Moreover, we confirmed the realtime CAD system can be constructed by a low power ASIC using customizable embedded DSP cores.
Diabetic retinopathy (DR) is a sight-threatening condition associated with diabetes, characterized by damage to the retinal blood vessels. Key to the automation of DR staging is the identification of various symptoms ...
详细信息
ISBN:
(纸本)9798350351439;9798350351422
Diabetic retinopathy (DR) is a sight-threatening condition associated with diabetes, characterized by damage to the retinal blood vessels. Key to the automation of DR staging is the identification of various symptoms directly or closely associated with retinal blood vessels, as well as the number of these symptoms in the four quadrants of the retina separated by the optic disc. Therefore, precise identification of the optic disc (OD) and blood vessels in fundus images is crucial for DR stage diagnosis but is often time-consuming and requires expert analysis. This study introduces a thresholding-based approach for the automated localization of the OD and the detection of blood vessels in fundus images of diabetic patients. Our algorithm is more robust than some deep learning-based algorithms, achieving more accurate results, particularly in advanced DR stages where the resemblance between various symptoms and blood vessels complicates the extraction of blood vessels. Additionally, our computer vision system can achieve OD localization and blood vessel segmentation in realtime. The experimental results on a dataset selected by an ophthalmologist from a Kaggle dataset, ensuring data quality, show that the proposed algorithm can achieve an accuracy higher than 94% for both OD localization and blood vessel detection, outperforming some state-of-the-art algorithms.
The progress made in camera technology and computational imaging has been astounding when it comes to photography. A specialized neural image signal processor is currently used by many top smartphone cameras and DSLR ...
详细信息
KeyWord Spotting (KWS), i.e. the capability to identify vocal commands as they are pronounced, is becoming one of the most important features of Human-Machine Interface (HMI), also thanks to the pervasive diffusion of...
详细信息
Test-time adaptation (TTA) aims at boosting the generalization capability of a trained model by conducting self-/un-supervised learning during testing in real-world applications. Though TTA on image-based tasks has se...
详细信息
ISBN:
(纸本)9798400701085
Test-time adaptation (TTA) aims at boosting the generalization capability of a trained model by conducting self-/un-supervised learning during testing in real-world applications. Though TTA on image-based tasks has seen significant progress, TTA techniques for video remain scarce. Naively introducing image-based TTA methods into video tasks may achieve limited performance, since these methods do not consider the special nature of video tasks, e.g., the motion information. In this paper, we propose leveraging motion cues in videos to design a new test-time learning scheme for video classification. We extract spatial appearance and dynamic motion clip features using two sampling rates (i.e., slow and fast) and propose a fast-to-slow unidirectional alignment scheme to align fast motion and slow appearance features, thereby enhancing the motion encoding ability. Additionally, we propose a slow-fast dual contrastive learning strategy to learn a joint feature space for fastly and slowly sampled clips, guiding the model to extract discriminative video features. Lastly, we introduce a stochastic pseudo-negative sampling scheme to provide better adaptation supervision by selecting a more reliable pseudo-negative label compared to the pseudo-positive label used in prior TTA methods. This technique reduces the adaptation difficulty often caused by poor performance on out-of-distribution test data before adaptation. Our approach significantly improves performance on various video classification backbones, as demonstrated through extensive experiments on two benchmark datasets.
a kind of spatial-temporal neural network video smoke detection algorithm is proposed in order to solve the problems associated with the incorrect classification of the static approximate smoke background in the face ...
详细信息
This study addresses the deficiencies in the analysis of local parameters of target features in human motion videoimages during the rapid extraction of local features. This deficiency leads to inaccurate description ...
详细信息
With the continuous advancement of seeker technology and imageprocessing techniques, the precision of guided weapons has increasingly improved. However, due to the rigidly fixed structure between the seeker and the g...
详细信息
暂无评论