This research studied the effect of variations in a sensor's F lambda/d metric value (FLD) on the performance of machine learning algorithms such as the YOLO (You Only Look Once) algorithm for object classificatio...
详细信息
This research studied the effect of variations in a sensor's F lambda/d metric value (FLD) on the performance of machine learning algorithms such as the YOLO (You Only Look Once) algorithm for object classification. The YOLO_v3 and YOLO_v10 algorithms were trained using static imagery provided in the commonly available training dataset provided by Teledyne FLIR systems. imageprocessing techniques were used to degrade image quality of the test dataset also provided by Teledyne FLIR systems, simulating detector-limited to optics-limited performance, which results in a variation of the FLD metric between 0.339 and 7.98. The degraded test set was used to evaluate the performance of YOLO_v3 and YOLO_v10 for object classification and relate the FLD metric to the probability of detection. Results of YOLO_v3 and YOLO_v10 are presented for the varying levels of image degradation. A summary of the results is discussed along with recommendations for evaluating an algorithm's performance using a sensor's FLD metric value. (c) 2025 Optica Publishing Group. All rights, including for text and data mining (TDM), Artificial Intelligence (AI) training, and similar technologies, are reserved.
Radar-based sensing emerges as a promising alternative to cameras and wearable devices for indoor human activity recognition. Unlike wearables, radar sensors offer non-contact and unobtrusive monitoring, while being i...
详细信息
ISBN:
(纸本)9781510673915;9781510673908
Radar-based sensing emerges as a promising alternative to cameras and wearable devices for indoor human activity recognition. Unlike wearables, radar sensors offer non-contact and unobtrusive monitoring, while being insensitive to lighting conditions and preserving privacy as compared to cameras. This paper addresses the task of continuous and sequential classification of daily life activities, unlike the problem to isolate distinct motions in isolation. Upon acquiring raw radar data containing sequences of motions, an event detection algorithm, the Short-Time-Average/Long-Time-Average (STA/LTA) algorithm, is utilized to detect individual motion segments. By recognizing breaks between transitions from one motion type to another, the STA/LTA detector isolates individual activity segments. To ensure consistent input shapes for activities of varying durations, image resizing and cropping techniques are employed. Furthermore, data augmentation techniques are applied to modify micro-Doppler signatures, enhancing the classification system's robustness and providing additional data for training.
This article presents a new image segmentation algorithm based on a Split & Merge approach. By nature, the execution time of Split & Merge algorithms is data-dependent, as their halting conditions are tied to ...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
This article presents a new image segmentation algorithm based on a Split & Merge approach. By nature, the execution time of Split & Merge algorithms is data-dependent, as their halting conditions are tied to the homogeneity of each region. While previous algorithms made the Split step less sensitive to input data, the execution time of the more complex Merge step remains highly sensitive to image content. This paper tackles the sensitivity and performance problems from a system and architecture perspective. Memory reallocations due to array fusions are eliminated with the introduction of a TTA (Three Table Array) structure in the Merge step. As iterating over entries in this structure causes a loss of memory locality, we propose two new mechanisms that implement a software cache to mitigate this. An experimental study on an embedded system (Nvidia Jetson Xavier NX) has shown our Merge algorithm to be 10.6 times faster than the state-of-the-art Split & Merge algorithm for 960 x 720 images. Moreover, the execution time of our algorithm is also more resistant to image characteristics.
The development of biomedical image technology has brought significant advancements to healthcare and frontier research. Over the past 20 years, BioCAS has witnessed and documented comprehensive achievements in biomed...
详细信息
ISBN:
(纸本)9798350354966;9798350354959
The development of biomedical image technology has brought significant advancements to healthcare and frontier research. Over the past 20 years, BioCAS has witnessed and documented comprehensive achievements in biomedical image sensor design and processingalgorithms. This paper provides a systematic review of the work related to biomedical image acquisition and processing technology in BioCAS and offers a perspective on future developments in this field.
With the rapid development of deep learning technology, its application in imageprocessing and recognition has become a hot research topic. The application of these technologies in software information systems such a...
详细信息
This paper explores the utilization of MATLAB for digital signal processing (DSP) techniques in imageprocessing tasks, focusing on image deblurring, face detection, and facial feature enhancement. Blind deconvolution...
详细信息
ISBN:
(纸本)9798350372113;9798350372106
This paper explores the utilization of MATLAB for digital signal processing (DSP) techniques in imageprocessing tasks, focusing on image deblurring, face detection, and facial feature enhancement. Blind deconvolution methods are employed to address image blurriness, while face detection is facilitated using cascaded object detectors. Enhancements to detected facial features involve histogram equalization, smoothing filters, skin tone adjustment, and contrast enhancement techniques, followed by seamless integration using resizing methods. MATLAB serves as a robust platform for implementing and analyzing DSP algorithms, providing insights into practical solutions for common challenges in digital imageprocessing.
This research paper explores the application of singular value decomposition (SVD) in quantum imageprocessing (QIP), specifically focusing on the computation of eigenvalues using variational quantum algorithms. SVD i...
详细信息
The quality of image signals directly affects the performance of intelligent communication systems. This paper proposes a set of image enhancement and denoising algorithms to address image quality degradation in intel...
详细信息
This paper presents ternary systolic array architecture for matrix multiplication for ternary neural networks and imageprocessingalgorithms in ternary logic. As part of the architecture, we have proposed a processin...
详细信息
ISBN:
(纸本)9798350381191
This paper presents ternary systolic array architecture for matrix multiplication for ternary neural networks and imageprocessingalgorithms in ternary logic. As part of the architecture, we have proposed a processing element of a systolic array architecture. The processing element is designed using a partial product calculator, which does the multiply-accumulate operation using single-digit multipliers and adders. All proposed circuits are designed in CNFET-Memristor technology and simulated using Cadence virtuoso. The propagation delay, power, and power delay product of the proposed circuits are calculated using simulations. The proposed matrix multiplication is used in the Gaussian smoothing of images and results were presented.
With the rapid development of artificial intelligence technology, deep learning has become one of the key technologies in the field of image recognition. PyTorch has become the preferred framework for researchers due ...
详细信息
暂无评论