Potholes pose a significant risk to vehicles and pedestrians, necessitating prompt action to be addressed. In this project, a strategy is incorporated for pothole detection, utilizing the YOLOv8 object detection algor...
详细信息
Vital sign monitoring is an invaluable tool for healthcare professionals, both in the hospital and at home. Traditional measurement devices provide accurate readings but require physical contact with the patient which...
详细信息
ISBN:
(纸本)9798350324471
Vital sign monitoring is an invaluable tool for healthcare professionals, both in the hospital and at home. Traditional measurement devices provide accurate readings but require physical contact with the patient which often is unsuitable, furthermore contact-based devices have been reported to fail by loosing contact due to movement as severe events occur, therefore, a contactless method is necessary. We hypothesize that, in ideal scenarios, it is possible to estimate both SpO(2) and pulse rate using only facial video recorded with a smartphone's front-facing camera. To test this hypothesis, a dataset of 10 healthy subjects performing various breathing patterns while being recorded with a smartphone camera was collected during ideal lighting conditions. Using advanced image and signalprocessing methods to acquire remote photoplethysmography (rPPG) estimates from a patient's forehead, our proposed method can achieve SpO(2) estimation results with A(rms) = 1.34% (accuracy RMS) and MAE +/- STD = 1.26 +/- 0.68% (mean average error) across a SpO(2) range of 92% to 99% (percentage point SpO(2)) and pulse rate estimation results with A(rms) = 3.91 bpm (beats per minute) and MAE +/- STD = 3.24 +/- 2.11 bpm across a pulse rate range of 60 bpm to 90 bpm. We conclude from these results, that remote vital sign estimation using facial videos recorded entirely with a smartphone camera is possible.
With the rapid development of imagesignal processor (ISP), the implementation of ISP based on CPU or FPGA has become a major focus of current research. However, the implementation of CPU-based ISP has a long runtime ...
详细信息
Volumetric 3D reconstruction methods have shown great performance in reconstructing indoor scenarios from monocular videos. However, as such approaches utilize discrete feature voxels to encode the observed scenes, th...
详细信息
This paper presents a robust approach for detecting skin diseases using advanced image detection methods supported by Convolutional Neural Network (CNN) models. Traditionally, skin diseases are diagnosed by experience...
详细信息
Digital humans are attracting more and more research interest during the last decade, the generation, representation, rendering, and animation of which have been put into large amounts of effort. However, the quality ...
详细信息
The study seeks to investigate how the implementation of IoT-driven advanced AI methods can bolster the robustness of oil and gas pipelines by improving strategies for detecting leaks and conducting predictive mainten...
详细信息
Multimodal medical image fusion is vital for extracting complementary information and generating comprehensive images in clinical applications. However, existing deep learning-based fusion approaches face challenges i...
详细信息
Recent deep learning-based contour detection studies show high accuracy in single-class boundary detection problems. However, this performance does not translate well in a multi-class scenario where continuous contour...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
Recent deep learning-based contour detection studies show high accuracy in single-class boundary detection problems. However, this performance does not translate well in a multi-class scenario where continuous contours are required. Our research presents CU-Net, a U-Net-based network with residual-net encoders which can produce accurate and uninterrupted contour lines for multiple classes. The critical factor behind this concept is our continuity module, containing an interpolation layer and a novel activation function that converts discrete signals into smooth contours. We find the application of our approach in medical imaging problems like retinal layer segmentation from optical coherence tomography (OCT) scans. We applied our method to an expert annotated OCT dataset of children with sickle-cell disease. To compare with benchmarks, we evaluated our network on DME and HC-MS datasets. We achieved an overall mean absolute distance of 6.48 +/- 2.04 mu M and 1.97 +/- 0.89 mu M, respectively 1.03 and 1.4 times less than the current state-of-the-art.
We propose an adversarial training framework to simultaneously address the vessel segmentation and dirt/reflection removal problems in fundus photographs used for diabetic retinopathy diagnosis. This framework contain...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
We propose an adversarial training framework to simultaneously address the vessel segmentation and dirt/reflection removal problems in fundus photographs used for diabetic retinopathy diagnosis. This framework contains two primary subnetworks, each triggered by a set of loss terms, i.e., one for segmentation and the other for reconstruction. These two subnetworks act as inverse functions of each other so that they form an autoencoder framework with a 2-dimensional latent code, which can be a vessel segmentation mask after binarization. To further improve the segmentation and reconstruction performance, we devise a loss function based on gradient vector flow (GVF) and re-organize the generator network. Experimental results show that the proposed method has a good generalization capability. Trained on DRIVE's training set, our model can produce segmentation and reconstruction-based artifact removal results stably on other datasets like CHASE-DB1 and STARE. The average F1-score of our segmentation results of DRIVE's testing set reaches 0.7964, and the artifact-free reconstruction results can achieve an average PSNR of 24 dB.
暂无评论