CheXNet is not a surprise for Deep Learning (DL) community as it was primarily designed for radiologist-level pneumonia detection in Chest X-rays (CXRs). In this paper, we study CheXNet to analyze CXRs to detect the e...
详细信息
One of the applications of deep learning is deciphering the unscripted text over the walls and pillars of historical monuments is the major source of information extraction. this information gives us an idea about the...
详细信息
Object detection is an advanced area of imageprocessing and computervision. Its major applications are in surveillance, autonomous driving, face recognition, anomaly detection, traffic management, agriculture etc. T...
Object detection is an advanced area of imageprocessing and computervision. Its major applications are in surveillance, autonomous driving, face recognition, anomaly detection, traffic management, agriculture etc. this paper focuses on various object detection techniques in thermal images. A thermal imaging sensor is a device that creates an image by analyzing temperature differences between different objects in a scene and detecting radiation from those objects. In recent years, many machine learning and deep learning algorithms have been used to recognize objects in thermal images. this study makes a comparison of YOLO, YOLO DarkNet, Retinex algorithm, CNN-based machine learning model Support Vector Machine (SVM), and Gaussian Mixture Model (GMM), Mixer of Gaussian (MoG), Mean Shift Approach, Faster R-CNN and Deep Neural Network along with different datasets.
Noisy imageprocessing is a fundamental task of computervision. the first example is the detection of faint edges in noisy images, a challenging problem studied in the last decades. A recent study introduced a fast m...
详细信息
ISBN:
(纸本)9781728188089
Noisy imageprocessing is a fundamental task of computervision. the first example is the detection of faint edges in noisy images, a challenging problem studied in the last decades. A recent study introduced a fast method to detect faint edges in the highest accuracy among all the existing approaches. their complexity is nearly linear in the image's pixels and their runtime is seconds for a noisy image. their approach utilizes a multi-scale binary partitioning of the image. By utilizing the multi-scale U-net architecture, we show in this paper that their method can be dramatically improved in both aspects of run time and accuracy. By training the network on a dataset of binary images, we developed an approach for faint edge detection that works in linear complexity. Our runtime of a noisy image is milliseconds on a GPU. Even though our method is orders of magnitude faster, we still achieve higher accuracy of detection under many challenging scenarios. In addition, we show that our approach to performing multi-scale preprocessing of noisy images using U-net improves the ability to perform other vision tasks under the presence of noise. We prove it on the problems of noisy objects classification and classical image denoising. We show that multi-scale denoising can be carried out by a novel edge preservation loss. As our experiments show, we achieve high-quality results in the three aspects of faint edge detection, noisy image classification, and natural image denoising.
the online car is the product of the "Interonline+ " era. As a new city intelligent transportation mode, it breaks the old interest pattern and the barrier of taxi industry based on franchise. In recent year...
详细信息
Video image analysis mainly involves the analysis and processing of moving image sequences, which usually involves several processes such as motion detection, object classification, object tracking, and behavior under...
Video image analysis mainly involves the analysis and processing of moving image sequences, which usually involves several processes such as motion detection, object classification, object tracking, and behavior understanding and description. Among them, moving object detection and tracking are at the bottom level of the entire visual monitoring system, and are the most basic methods in video image analysis. they are the foundation for various subsequent advanced processing. this article proposes a target detection and tracking algorithm based on Kalman filtering, which can effectively detect and track moving objects, providing a more efficient technical means for detecting and tracking target suspects in the era of big data.
Food recognition is an important task for a variety of applications, including managing health conditions and assisting visually impaired people. Several food recognition studies have focused on generic types of food ...
详细信息
ISBN:
(数字)9783031133213
ISBN:
(纸本)9783031133213;9783031133206
Food recognition is an important task for a variety of applications, including managing health conditions and assisting visually impaired people. Several food recognition studies have focused on generic types of food or specific cuisines, however, food recognition with respect to Middle Eastern cuisines has remained unexplored. therefore, in this paper we focus on developing a mobile friendly, Middle Eastern cuisine focused food recognition application for assisted living purposes. In order to enable a low-latency, high-accuracy food classification system, we opted to utilize the Mobilenet-v2 deep learning model. As some of the foods are more popular than the others, the number of samples per class in the used Middle Eastern food dataset is relatively imbalanced. To compensate for this problem, data augmentation methods are applied on the underrepresented classes. Experimental results show that using Mobilenet-v2 architecture for this task is beneficial in terms of both accuracy and the memory usage. Withthe model achieving 94% accuracy on 23 food classes, the developed mobile application has potential to serve the visually impaired in automatic food recognition via images.
this paper concerns the efficient implementation of a method for optimal binary labeling of graph vertices, originally proposed by Malmberg and Ciesielski (2020). this method finds, in quadratic time with respect to g...
详细信息
Excessive alcohol consumption leads to inebriation. Driving under the influence of alcohol is a criminal offence in many countries involving operating a motor vehicle while inebriated to a level that renders safely op...
详细信息
ISBN:
(数字)9783031064272
ISBN:
(纸本)9783031064272;9783031064265
Excessive alcohol consumption leads to inebriation. Driving under the influence of alcohol is a criminal offence in many countries involving operating a motor vehicle while inebriated to a level that renders safely operating a motor vehicle extremely difficult. Studies show that traffic accidents will become the fifth most significant cause of death if inebriated driving is not mitigated. Inversely, 70% of the world population can be protected by mitigating inebriated driving. Short term effects of inebriation include lack of balance, inhibition and fine motor coordination, dilated pupils and slow heart rate. An ideal inebriation recognition method that operates in real-time is less intrusive, more convenient, and efficient. Deep learning has been used to solve object detection, object recognition, object tracking and image segmentation problems. In this paper, we compare deep learning inebriation recognition methods. We implemented Faster R-CNN and YOLO methods for our experiment. We created our dataset of sober and inebriated individuals made available to the public. Six thousand four hundred forty-three (6443) face images were used, and our best performing pipeline was YOLO with a 99.6% accuracy rate.
In view of the existing problem of the safety and insufficient matching performance of template protection methods using linear threshold quantification, a palm-print template protection method using nonlinear thresho...
详细信息
暂无评论