This letter presents an efficient visual anomaly detection framework designed for safe autonomous navigation in dynamic indoor environments, such as university hallways. The approach employs an unsupervised autoencode...
详细信息
This letter presents an efficient visual anomaly detection framework designed for safe autonomous navigation in dynamic indoor environments, such as university hallways. The approach employs an unsupervised autoencoder method within deeplearning to model regular environmental patterns and detect anomalies as deviations in the embedding space. To enhance reliability and safety, the system integrates a statistical framework, conformal prediction, that provides uncertainty quantification with probabilistic guarantees. The proposed solution has been deployed on a real-time robotic platform, demonstrating efficient performance under resource-constrained conditions. Extensive hyperparameter optimization ensures the model remains dynamic and adaptable to changes, while rigorous evaluations confirm its effectiveness in anomaly detection. By addressing challenges related to real-timeprocessing and hardware limitations, this work advances the state-of-the-art in autonomous anomaly detection. The probabilistic insights offered by this framework strengthen operational safety and pave the way for future developments, such as richer sensor fusion and advanced learning paradigms. This research highlights the potential of uncertainty-aware deeplearning to enhance safety monitoring frameworks, thereby enabling the development of more reliable and intelligent autonomous systems for real-world applications.
The Chinese Space Station Telescope (abbreviated as CSST) is a future advanced space telescope. real-time identification of galaxy and nebula/star cluster (abbreviated as NSC) images is of great value during CSST surv...
详细信息
The Chinese Space Station Telescope (abbreviated as CSST) is a future advanced space telescope. real-time identification of galaxy and nebula/star cluster (abbreviated as NSC) images is of great value during CSST survey. While recent research on celestial object recognition has progressed, the rapid and efficient identification of high-resolution local celestial images remains challenging. In this study, we conducted galaxy and NSC image classification research using deeplearning methods based on data from the Hubble Space Telescope. We built a local celestial image data set and designed a deeplearning model named HR-CelestialNet for classifying images of the galaxy and NSC. HR-CelestialNet achieved an accuracy of 89.09 per cent on the testing set, outperforming models such as AlexNet, VGGNet, and ResNet, while demonstrating faster recognition speeds. Furthermore, we investigated the factors influencing CSST image quality and evaluated the generalization ability of HR-CelestialNet on the blurry image data set, demonstrating its robustness to low image quality. The proposed method can enable real-time identification of celestial images during CSST survey mission.
Although the existing digital holographic technologies are effective in measuring particle concentration, the processes are cumbersome and time-consuming. The purpose of this study is to quickly and accurately measure...
详细信息
Although the existing digital holographic technologies are effective in measuring particle concentration, the processes are cumbersome and time-consuming. The purpose of this study is to quickly and accurately measure the particle number from a single hologram through deeplearning. The simulation and experimental results show that the prediction number are close to the ground truth without the complicated reconstructions and denoising processes, and the average relative error remains less than 10%. The prediction time of a hologram is at the millisecond level, which offers a new possibility for real-timeprocessing.
A synthetic aperture radar (SAR) system is a notable source of information, recognized for its capability to operate day and night and in all weather conditions, making it essential for various applications. SAR image...
详细信息
A synthetic aperture radar (SAR) system is a notable source of information, recognized for its capability to operate day and night and in all weather conditions, making it essential for various applications. SAR image formation is a pivotal step in radar imaging, essential for transforming complex raw radar data into interpretable and utilizable imagery. Nowadays, advancements in SAR sensor design, resulting in very wide swaths, generate a massive volume of data, necessitating extensive processing. Traditional methods of SAR image formation often involve resource-intensive and time-consuming postprocessing. There is a vital need to automate this process in near-real-time, enabling fast responses for various applications, including image classification and object detection. We present an SAR processing pipeline comprising a complex 2D autofocus SARNet, followed by a CNN-based classification model. The complex 2D autofocus SARNet is employed for image formation, utilizing an encoder-decoder architecture, such as U-Net and a modified version of ResU-Net. Meanwhile, the image classification task is accomplished using a CNN-based classification model. This framework allows us to obtain near real-time results, specifically for quick image viewing and scene classification. Several experiments were conducted using real-SAR raw data collected by the European remote sensing satellite to validate the proposed pipeline. The performance evaluation of the processing pipeline is conducted through visual assessment as well as quantitative assessment using standard metrics, such as the structural similarity index and the peak-signal-to-noise ratio. The experimental results demonstrate the processing pipeline's robustness, efficiency, reliability, and responsivity in providing an integrated neural network-based SAR processing pipeline.
An image-based real-time pantograph anomaly detection method is presented by combining unsupervised deeplearning and nearest neighbor classification. The proposed method includes the following key steps. First, an im...
详细信息
An image-based real-time pantograph anomaly detection method is presented by combining unsupervised deeplearning and nearest neighbor classification. The proposed method includes the following key steps. First, an improved DeblurGAN-v2 deblurring algorithm is applied to the input pantograph image if there exists motion blur. Next, deeplearning semantic segmentation with hybrid coding that combines lightweight convolutional neural network (CNN) and vision transformer (VIT) is employed to accurately segment the pantograph structure within the image. And multiscale feature-dense aggregation network based on an attentional feature fusion (AFF) module has been designed to efficiently integrate information from different feature layers. Finally, a K -nearest neighbor (KNN) classification algorithm with deep pretrained features from the segmented pantograph mask image has been utilized to detect anomalies in the pantograph. Experimental results demonstrate that the proposed pantograph segmentation network outperforms several general segmentation algorithms, achieving a high mean intersection over union (MIoU) of 95.86% with a parameter size of 7 M and FPS of 81.7. And nearest neighbor classification with deep pretrained features achieves excellent pantograph anomaly detection performance with area under the receiver operating characteristic (ROC) curve of 0.987 and area under a precision-recall (PR) curve of 0.998. It is verified that the proposed pantograph anomaly detection method does not rely on abnormal data, and can achieve a high anomaly detection accuracy of 98.75%.
By combining time-frequency images and deeplearning models, the nonlinear ultrasound signals can be classified, detected, and predicted, using the nonlinear coefficient as a fundamental label for training deep learni...
详细信息
By combining time-frequency images and deeplearning models, the nonlinear ultrasound signals can be classified, detected, and predicted, using the nonlinear coefficient as a fundamental label for training deeplearning models. This integrated approach enables quantitative identification and real-time monitoring of concrete damage, promoting the widespread adoption of nonlinear ultrasonic techniques in engineering applications. As a basis, the relationship between damage variations and nonlinear coefficients is discussed by performing nonlinear ultrasonic damage testing on concrete specimens with different crack lengths and angles. The testing signals are converted into time-frequency images using the short-time Fourier transform and the continuous wavelet transform, and both types of images are combined for data augmentation and input into the deeplearning model for training, with nonlinear coefficients serving as labels for the time-frequency images. The MobileNetV2, VGG16, and ResNet18 deeplearning models are trained separately on time-frequency image datasets for the length specimens, the angle specimens, and the length-angle specimens, and the performance of the different models is evaluated and compared. The results show that all three models have accuracy rates above 94%, indicating good identification performance. Finally, with the example, the nonlinear coefficients of the testing signals are compared with the labels of the nonlinear coefficients in the time-frequency images identified by the deeplearning model, which confirms the high accuracy of damage identification by the deeplearning model.
Applying affective computing techniques to recognize fear and combining them with portable signal monitors makes it possible to create real-time detection systems that could act as bodyguards when users are in danger....
详细信息
Applying affective computing techniques to recognize fear and combining them with portable signal monitors makes it possible to create real-time detection systems that could act as bodyguards when users are in danger. With this aim, this paper presents a fear recognition method based on physiological signals obtained from wearable devices. The procedure involves creating two-dimensional feature maps from the raw signals, using data augmentation and feature selection algorithms, followed by deeplearning-based classification models, taking inspiration from those used in imageprocessing. This proposal has been validated with two different datasets, achieving, in WEMAC, WESAD 3-classes, and WESAD 2-classes, F1-score results of 78.13%, 88.07%, and 99.60%, respectively, and 79.90%, 89.12%, and 99.60% in accuracy. Furthermore, the paper demonstrates the feasibility of implementing the proposed method on the Coral Edge TPU device, prepared to make inferences on the edge.
In response to the current challenges of numerous background influencing factors and low detection accuracy in the open railway foreign object detection, a real-time foreign object detection method based on deep learn...
详细信息
In response to the current challenges of numerous background influencing factors and low detection accuracy in the open railway foreign object detection, a real-time foreign object detection method based on deeplearning for open railways in complex environments is proposed. Firstly, the images of foreign objects invading the clearance collected by locomotives during long-term operation are used to create a railway foreign object dataset that fits the current situation. Then, to improve the performance of the target detection algorithm, certain improvements are made to the YOLOv7-tiny network structure. The improved algorithm enhances feature extraction capability and strengthens detection performance. By introducing a Simple, parameter-free Attention Module for convolutional neural network (SimAM) attention mechanism, the representation ability of ConvNets is improved without adding extra parameters. Additionally, drawing on the network structure of the weighted Bi-directional Feature Pyramid Network (BiFPN), the backbone network achieves cross-level feature fusion by adding edges and neck fusion. Subsequently, the feature fusion layer is improved by introducing the GhostNetV2 module, which enhances the fusion capability of different scale features and greatly reduces computational load. Furthermore, the original loss function is replaced with the Normalized Wasserstein Distance (NWD) loss function to enhance the recognition capability of small distant targets. Finally, the proposed algorithm is trained and validated, and compared with other mainstream detection algorithms based on the established railway foreign object dataset. Experimental results show that the proposed algorithm achieves applicability and real-time performance on embedded devices, with high accuracy, improved model performance, and provides precise data support for railway safety assurance.
Nowadays, the Internet is rapidly evolving toward the future of the Internet of Things (IoT), where billions or even trillions of edge devices may be interconnected. The proliferation of network cameras and the advanc...
详细信息
Nowadays, the Internet is rapidly evolving toward the future of the Internet of Things (IoT), where billions or even trillions of edge devices may be interconnected. The proliferation of network cameras and the advancement of IoT technologies have provided broader opportunities for data collection and utilization. In the past, the massive real-time videos generated by network cameras were mostly transmitted over the network to the cloud for analysis. However, due to network speed limitations, the latency incurred by uploading all videos to the cloud makes it difficult to meet the real-time requirements of video analysis. While edge computing significantly reduces latency, the computational capabilities of edge devices are limited, making it difficult to handle large amounts of real-time video data. In this article, we introduce a real-time video processing framework called deepVA, which utilizes cloud-edge collaboration technology to reduce latency in real-time video processing and enhance the accuracy of analysis. The deepVA framework incorporates the DRLVA video frame distribution algorithm based on deep reinforcement learning (DRL), which dynamically determines whether to distribute video frames for processing at the cloud or edge. To evaluate the performance of the proposed DRLVA algorithm, we first verify that it is superior to several other DRL-based distribution algorithms on the Gym environment. We also evaluate the performance of deepVA on the MOT2015 data set, MOTSynth data set, and real campus surveillance videos. The experiments show that our deepVA outperforms both cloud-only and edge-only solutions in terms of reducing latency and improving accuracy.
In this paper, we explore a new idea of using deeplearning representations as a principle for regularization in inverse problems for digital signal processing. Specifically, we consider the standard variational formu...
详细信息
In this paper, we explore a new idea of using deeplearning representations as a principle for regularization in inverse problems for digital signal processing. Specifically, we consider the standard variational formulation, where a composite function encodes a fidelity term that quantifies the proximity of the candidate solution to the observations (under a physical process), and a second regularization term that constrains the space of solutions according to some prior knowledge. In this work, we investigate deeplearning representations as a means of fulfilling the role of this second (regularization) term. Several numerical examples are presented for signal restoration under different degradation processes, showing successful recovery under the proposed methodology. Moreover, one of these examples uses real data on energy usage by households in London from 2012 to 2014.
暂无评论