This research studied the effect of variations in a sensor's F lambda/d metric value (FLD) on the performance of machine learning algorithms such as the YOLO (You Only Look Once) algorithm for object classificatio...
详细信息
This research studied the effect of variations in a sensor's F lambda/d metric value (FLD) on the performance of machine learning algorithms such as the YOLO (You Only Look Once) algorithm for object classification. The YOLO_v3 and YOLO_v10 algorithms were trained using static imagery provided in the commonly available training dataset provided by Teledyne FLIR systems. image processing techniques were used to degrade imagequality of the test dataset also provided by Teledyne FLIR systems, simulating detector-limited to optics-limited performance, which results in a variation of the FLD metric between 0.339 and 7.98. The degraded test set was used to evaluate the performance of YOLO_v3 and YOLO_v10 for object classification and relate the FLD metric to the probability of detection. Results of YOLO_v3 and YOLO_v10 are presented for the varying levels of image degradation. A summary of the results is discussed along with recommendations for evaluating an algorithm's performance using a sensor's FLD metric value. (c) 2025 Optica Publishing Group. All rights, including for text and data mining (TDM), Artificial Intelligence (AI) training, and similar technologies, are reserved.
The early detection and classification of COvID-19 is crucial for disease diagnosis and control. To reduce the need for medical professionals, fast and accurate detection approaches for COvID-19 are required. Due to e...
详细信息
The early detection and classification of COvID-19 is crucial for disease diagnosis and control. To reduce the need for medical professionals, fast and accurate detection approaches for COvID-19 are required. Due to environmental concerns, the quality of the image gets degraded. Compared with reverse-transcription polymerase chain reaction (RT-PCR), chest computed tomography (CT) imaging may be a significantly more trustworthy, useful, and rapid technique to classify and evaluate COvID-19. Thus, the performance of the deep learning (DL) techniques is diminished. Therefore, a CT image-based hybrid DL technology is presented in this article for the classification of COvID-19 disease as COvID or non-COvID or pneumonia. Initially, in the pre-processing stage, the hybrid nonlocal moment bilateral filtering (Hybrid NMBF) technique is introduced for image de-noising and re-sizing. After pre-processing, the image is fed into the feature extraction phase. Gray-level covariance matrices (GLCM) technique is used to extract the relevant features and reduce feature dimensionality issues. For the feature selection process, the enhanced Archimedes optimization algorithm (EAOA) is introduced to select optimal features. The residual channel attention-generative adversarial network (RCA-GAN) technique is introduced for image classification. Here, the hyperparameter of the network is tuned using the Sandpiper optimization (SPO) algorithm to optimize the loss function. The data set used in this research is COvID-CT-machine learning deep learning (MD), and the performance is analyzed using the MATLAB tool. In the experimental scenario, the proposed system achieves 98.3% accuracy, 98.7% specificity, 99.4% sensitivity, 97.4% F-score, and 96.1% kappa. The attained results prove that the proposed system works better than the traditional techniques.
This paper introduces TranGDeepSC, a lightweight CNN-based deep semantic communication (DeepSC) system that leverages vision Transformer (viT) knowledge through co-training to enhance image transmission. Evaluated on ...
详细信息
This paper introduces TranGDeepSC, a lightweight CNN-based deep semantic communication (DeepSC) system that leverages vision Transformer (viT) knowledge through co-training to enhance image transmission. Evaluated on CIFAR-100 across various SNRs, TranGDeepSC demonstrates competitive performance with viTDeepSC, and outperforms SemviT and ADJSCC-v in imagequality, particularly in low-SNR environments. Notably, it offers substantial gains in efficiency: 92.8% fewer parameters than ADJSCC-v, 72.0% lower energy use, and 48% faster processing than viTDeepSC. These advantages make TranGDeepSC well-suited for resource-constrained applications in next-generation communication systems, including 6G, IoT, and real-time multimedia streaming.
The process of creating blood cells (hematopoiesis) occurs in the bone marrow, where Hematopoietic Stem Cells (HSCs) are located. The division and differentiation of hematopoietic stem cells are tightly regulated to e...
详细信息
The process of creating blood cells (hematopoiesis) occurs in the bone marrow, where Hematopoietic Stem Cells (HSCs) are located. The division and differentiation of hematopoietic stem cells are tightly regulated to ensure a balance between blood cell lineages. Disturbances in the process can lead to blood diseases such as anemia, high White Blood Cell (WBC) count, or thrombocytopenia. Detecting malignant leukemia cells based on images is crucial in diagnosing and treating leukemia, helping doctors make accurate diagnoses, and providing appropriate treatment. The author proposes a new method for recognizing and classifying WBC images using the Multi-hop Attention Graph Neural Networks method. The YOLO-v10 method is used for object detection and image preprocessing through the Centre Net network architecture. The Salp Swarm Optimization (SSO) method is deployed to select the features of the WBC images optimally and put the image features into each node in the architecture of the Graph Neural Network (GNN) model to perform classification. The dataset used has an imagequality of approximately 42 pixels per 1 mu m resolution with a total of 16,027 annotated White Blood Cell images classified into 9 types of WBC with characteristic images of clinically significant pathologies. The classification accuracy of the system of the YLSSOGNN model is 99.18%, and the classification accuracy of the system of the YLGNN model is 99.03 %. The WBC image recognition and classification model using the post-learning method has a GNN architecture with object recognition function using the YOLO-v10 method and feature extraction and optimization using the SSO method and performs WBC image classification using Multi-hop Attention Graph Neural Networks model, which helps to bring high performance and can apply the model to other types of image objects.
Unmanned aerial vehicles are increasingly utilised for monitoring and inspecting critical infrastructure such as power generation grids, oil and gas pipelines and roads. One key task in road maintenance is the detecti...
详细信息
Unmanned aerial vehicles are increasingly utilised for monitoring and inspecting critical infrastructure such as power generation grids, oil and gas pipelines and roads. One key task in road maintenance is the detection of cracks, which is crucial for ensuring road safety. Manual detection of cracks is time-consuming and prone to errors, highlighting the need for automated solutions. This study presents an automated method for road crack detection and classification using a hybrid deep learning technique. The proposed approach integrates a pyramid vision transformer and ConvMixer models for feature extraction, enabling the system to learn complex patterns in crack images. image pre-processing is first performed using a median filtering technique to enhance imagequality. The detection and classification of cracks are then carried out using an Elman neural network model, with its hyperparameters optimised through an improved black widow optimisation algorithm. Extensive simulations demonstrate that the proposed method outperforms other deep learning models in terms of performance, providing a reliable and efficient solution for automated crack detection.
This study proposes an innovative algorithm based on DCNN and multi-channel image fusion, aiming to improve the quality and efficiency of virtual scene image generation. The algorithm extracts depth information and te...
详细信息
Pneumonia causes a high rate of newborn morbidity and mortality. The challenge is accurately identifies respiratory disorders while overcoming the limitations of existing technologies such as low accuracy, delayed res...
详细信息
Pneumonia causes a high rate of newborn morbidity and mortality. The challenge is accurately identifies respiratory disorders while overcoming the limitations of existing technologies such as low accuracy, delayed response, and restricted scalability. To overcome this complication, variational Onsager Neural Network optimized with Golden search optimization algorithm fostered for Lung Disease Detection system in IoT (LDD-vONNCXR-IoT) is proposed. Initially, input CXR images are gathered from chest-X-ray Dataset. Then, pre-process the input CXR images using Two-way Recursive filtering (TWRF) for normalizing image and increasing the quality of the images. Afterwards, the preprocessed image is supplied to the feature extraction. Adaptive Synchro Extracting Transform (ASET) is employed to extract the statistical features. Finally, the extracted features are fed into variational Onsager Neural Networks (vONN) which classifies the input CXR image into normal and pneumonia. The Golden Search Optimization Algorithm (GSOA) is used to optimize vONN that accurately detects the Lung Disease. The proposed LDD-vONN-CXR-IoT method is implemented. The performance metrics, like precision, accuracy, F1-score, Sensitivity, specificity, Error rate, ROC, computational time are examined. The proposed LDD-vONN-CXR-IoT approach attains 99.57%, 98.46%, and 98.13% for accuracy, F1 score, and precision respectively. These outcomes prove that this method for the Lung Disease Detection system in IoT is effectual tool to assist in clinical diagnosis. This method allows expertise to acquire exact results, thus providing the proper treatment.
This paper presents a comparative study on the optical system design for a smartphone-based fundus camera to enhance retinal imaging quality. Three optical system configurations were evaluated using ANSYS Zemax OpticS...
详细信息
In agricultural settings, handling of soft fruit is critical to ensuring quality and safety. This study introduces a novel opto-tactile sensing approach designed to enhance the handling and assessment of soft fruit, w...
详细信息
In agricultural settings, handling of soft fruit is critical to ensuring quality and safety. This study introduces a novel opto-tactile sensing approach designed to enhance the handling and assessment of soft fruit, with a case example of strawberries. Our approach utilises a Robotiq 2F-85 gripper equipped with the DIGIT vision-Based Tactile Sensor (vBTS) and attached to a Universal Robot UR10e. In contrast to force-based approaches, we introduce a novel purely image-based processing software pipeline for quantifying localised surface deformations in soft fruit. The system integrates fast and explainable image processing techniques applying image differencing, denoising, K-means clustering for unsupervised classification, morphological operations, and connected components analysis (CCA) to quantify surface deformations accurately. A calibration of the image processing pipeline using a rubber ball showed that the system effectively captured and analysed the rubber ball's surface deformations, benefiting from its uniform elasticity and predictable response to compression. As a soft fruit case example, the image processing pipeline was subsequently applied to strawberries, blueberries, and raspberries, demonstrating that the calibration parameters derived from the rubber ball could effectively assess surface deformations in soft fruits. Despite the complex, nonlinear deformation characteristics inherent to strawberries, blueberries, and raspberries, the pipeline exhibited robust performance, capturing and quantifying subtle surface changes. These findings underscore the system's capacity for precise deformation analysis in delicate materials, offering major potential for further refinement and adaptation. This novel approach of proposing and testing an image processing pipeline lays the groundwork for enhancing the handling and assessment of materials with intricate mechanical properties, paving the way for broader applications in sensitive agricultural and industrial
Lensless imaging systems eliminate the need for lenses by employing an encoding element to multiplex incident light signals, which are then captured directly onto a bare camera sensor. They present a promising alterna...
详细信息
暂无评论