Rail surface defect inspection is an essential task for the railway system. However, due to the similarity of the background and defect foreground pixels, uneven textures, irregular shapes and multiple scales of the e...
详细信息
ISBN:
(数字)9798331510138
ISBN:
(纸本)9798331510145
Rail surface defect inspection is an essential task for the railway system. However, due to the similarity of the background and defect foreground pixels, uneven textures, irregular shapes and multiple scales of the existing defects, the inspection accuracy is still required to be further improved. In this paper, we first expand our dataset by GAN-based method for the scarcity of the defect samples. Then, we propose a diffusion-based model to perform the classification and segmentation task. The diffusion model predicts the mask from the Gaussian noise broken ground truth by successive designed transformer decoder. Meanwhile, to better distinguish the pixels in the boundary or other illegible regions, a boundary-frequency feature enhancement module (BFM) is proposed to make the model pay more attention to the defect-related features. Experiments on the rail surface defect dataset revealed that the proposed diffusion-based transformer decoder and BFM can work well together and efficiently improve the rail surface defect segmentation accuracy.
The multitude of advantages offered by distributed power generation units are recognized as crucial factors in enhancing the security of distribution networks. Ensuring the optimal size and placement of distributed ge...
详细信息
The task of hand reconstruction aims to estimate the shape and pose of hands from the input images. Deep learning algorithms have shown remarkable performance in the field of computer vision in recent years, promoting...
详细信息
Currently, the advanced geological prediction for the main tunnel from the parallel tunnel often employs parallel deduction of detection conclusions, which faces issues such as a lack of diverse geological exploration...
详细信息
Accurate short-term power load forecasting is important to ensure the reliability and economy of the power grid. Convolutional neural networks are widely used, but they are difficult to capture the multi-scale tempora...
详细信息
Model-Agnostic Meta-Learning (MAML) and its variants have shown remarkable performance in scenarios characterized by a scarcity of labeled data during the training phase of machine learning models. Despite these succe...
Model-Agnostic Meta-Learning (MAML) and its variants have shown remarkable performance in scenarios characterized by a scarcity of labeled data during the training phase of machine learning models. Despite these successes, MAML-based approaches encounter significant challenges when there is a substantial discrepancy in the distribution of training and testing tasks, resulting in inefficient learning and limited generalization across domains. Inspired by classical proportional-integral-derivative (PID) control theory, this study introduces a Layer-Adaptive PID (LA-PID) Optimizer, a MAML-based optimizer that employs efficient parameter optimization methods to dynamically adjust task-specific PID control gains at each layer of the network, conducting a first-principles analysis of optimal convergence conditions. A series of experiments conducted on four standard benchmark datasets demonstrate the efficacy of the LA-PID optimizer, indicating that LA-PID achieves state-of-the-art performance in few-shot classification and cross-domain tasks, accomplishing these objectives with fewer training steps. Code is available on https://***/yuguopin/LA-PID. Copyright 2024 by the author(s)
Accurate ultra-short-term photovoltaic (PV) power forecasting is crucial for the real-time scheduling of grid systems. However, the inherent variability of solar energy makes this task extremely challenging. To enhanc...
详细信息
As an essential component of modern industry, steel strips play an indispensable role in various fields and serve as a crucial raw material in industrial production. However, due to various factors such as production ...
详细信息
Locating 3D objects from a single RGB image via perspective projection is a long-standing problem in computer vision. Inspired by differentiable geometry methods in the field of camera relocation, we apply this neural...
详细信息
Image feature matching is an important research field in computer vision that can be widely applied to advanced vision tasks, such as 3D reconstruction and visual tracking. Considering the low matching accuracy and po...
详细信息
暂无评论