Abstract: In order to improve the intellective level of water resources management, a real-time water level recognition method based on deep-learning algorithms and image-processing techniques is proposed in this pape...
详细信息
ISBN:
(纸本)9781450395687
Abstract: In order to improve the intellective level of water resources management, a real-time water level recognition method based on deep-learning algorithms and image-processing techniques is proposed in this paper. The recognition process is composed of four steps. Firstly, for the purpose of digit detection, YOLO-v3 model is deployed for extracting numbers from the water gauges. Then, the cropped number images are fed into the LSTM + CTC model as training samples so that digits can be recognized. In the third step, Hough transform are adopted to correct the tilt of water gauge in terms of the vertical edge feature. Morphological operation, associated with horizontal projection would position upper and lower edge of water gauge to recognize the scale lines correctly. Water level could be determined correspondingly. Model application shows that the recognition model has satisfying accuracy and efficiency, with potential being applied in practice.
Plasma arc welding can achieve high-quality welding joints in high-strength manufacturing fields, such as aviation and automotive, and improve production efficiency. It is important to observe the weld pool state in r...
详细信息
Plasma arc welding can achieve high-quality welding joints in high-strength manufacturing fields, such as aviation and automotive, and improve production efficiency. It is important to observe the weld pool state in real-time in robot automatic welding. However, the electrodes of the plasma welding torch cannot be observed from the outside. Teaching the weld line to torch in real-time to be observable to humans will be difficult. Also, it is difficult to process the image to obtain the position of the weld line in K-PAW. In this study, a camera was utilized to observe the weld pool. The authors estimate the weld line position in realtime by imageprocessing based on U-Net prediction. The U-Net model demonstrates sufficient prediction where the accuracy reached 99.5% for the training data and 96.5% for the test data recognition. Moreover, a control method utilized weld line position estimated from the boundary area to verify the effectiveness of this prediction model from 3 mm within the deviation of 1 mm, which is within the range of permissible welding errors. It could reduce imageprocessing errors in the weld pool image and provide higher recognition accuracy than imageprocessing. Combining vision sensing technologies and deeplearning methods will provide new technologies to enable higher welding precision and improve welding quality. It could also accelerate the development of welding technology in the intelligent manufacturing field.
Gas-solid fluidized beds are widely applied as chemical reactors, and the fluidization quality in the bed is closely related to bubble behavior. Digital imageprocessing is a commonly used non-invasive method for bubb...
详细信息
Gas-solid fluidized beds are widely applied as chemical reactors, and the fluidization quality in the bed is closely related to bubble behavior. Digital imageprocessing is a commonly used non-invasive method for bubble behavior analysis, but it is usually constrained by experimental conditions such as lighting, making identification of bubble and emulsion phases still challenging. Herein, deeplearning is applied in this study to optimize traditional digital imageprocessing techniques. By evaluating different deeplearning models (FCN, deepLab V3, U-Net), rapid and accurate identification and segmentation of bubble images can be achieved, and the U-Net model performs best, achieving an identification accuracy of 99.05 %. Further application of U-Net to analyze bubble behavior demonstrates that deeplearning methods enable efficient and accurate identification of bubbles and real-time analysis of bubble behavior, highlighting the significant potential application of deeplearning in the field of complex hydrodynamics in fluidized beds.
Sustainable transmission of electrical energy to consumers across regions relies heavily on the integrity of power transmission lines and continuous monitoring of assets is crucial for maintaining system reliability. ...
详细信息
Sustainable transmission of electrical energy to consumers across regions relies heavily on the integrity of power transmission lines and continuous monitoring of assets is crucial for maintaining system reliability. Unmanned aerial vehicles have revolutionized defect identification in real-time and accessibility, even in difficult-to-reach geographical landscapes, thereby improving image-based inspections. This work introduces semisupervised Yolo with focal loss function (SYFLo), a novel method that augments YOLO for real-time health monitoring of electric assets in power transmission lines. SYFLo integrates the focal loss function with semi-supervised learning to effectively address the lack of abundant labeled data, data imbalances and enhance detection accuracy. Additionally, it improves data generalizability across a wide range of images, ensuring robust performance despite varied image backgrounds. By leveraging YOLOv8, SYFLo significantly improves fault identification, achieving a detection accuracy of 96.5% and an FPS of 16.39. Experimental results demonstrate the impact of the proposed approach, highlighting its potential to enhance the reliability of power transmission line monitoring. These findings underscore the importance of integrating advanced deeplearning techniques with innovative loss functions to address common challenges in real-time health monitoring systems.
The fineness modulus(FM) represents the level of particle size of manufactured sand. real-time feedback of FM of manufactured sand is important for industrial sand production, but extracting the particle profile from ...
详细信息
The fineness modulus(FM) represents the level of particle size of manufactured sand. real-time feedback of FM of manufactured sand is important for industrial sand production, but extracting the particle profile from densely stacked images is a great challenge. In this study, a deeplearning and regression analysis -based online measurement method for FM of manufactured sand is proposed. Firstly, the real fineness modulus of the sand produced by the sand -making machine in realtime was obtained by the vibration -screening method(VSM). Then, the particle size fraction of larger particles (0.6-4.75 mm) was obtained based on machine vision combined with a convolutional neural network and imageprocessing. Secondly, a multiple linear regression model was developed for the percentage of particle size and FM. Finally, the percentage of particle size was substituted into the regression model as the independent variable to achieve a fast prediction of the unknown FM. The experimental results show that the maximum repeatability errors for FM of different manufactured sands are 0.09 and 0.13 respectively, and the maximum absolute errors of the FM prediction results are 0.18 and 0.17 respectively. The calculation efficiency and error level of this research method can meet the online testing at sand making sites.
Context. Large aperture ground-based solar telescopes allow the solar atmosphere to be resolved in unprecedented detail. However, ground-based observations are inherently limited due to Earth's turbulent atmospher...
详细信息
Context. Large aperture ground-based solar telescopes allow the solar atmosphere to be resolved in unprecedented detail. However, ground-based observations are inherently limited due to Earth's turbulent atmosphere, requiring image correction techniques. Aims. Recent post-image reconstruction techniques are based on using information from bursts of short-exposure images. Shortcomings of such approaches are the limited success, in case of stronger atmospheric seeing conditions, and computational demand. real-time post-image reconstruction is of high importance to enabling automatic processing pipelines and accelerating scientific research. In an attempt to overcome these limitations, we provide a deeplearning approach to reconstruct an original image burst into a single high-resolution high-quality image in realtime. Methods. We present a novel deeplearning tool for image burst reconstruction based on image stacking methods. Here, an image burst of 100 short-exposure observations is reconstructed to obtain a single high-resolution image. Our approach builds on unpaired image-to-image translation. We trained our neural network with seeing degraded image bursts and used speckle reconstructed observations as a reference. With the unpaired image translation, we aim to achieve a better generalization and increased robustness in case of increased image degradations. Results. We demonstrate that our deeplearning model has the ability to effectively reconstruct an image burst in realtime with an average of 0.5 s of processingtime while providing similar results to standard reconstruction methods. We evaluated the results on an independent test set consisting of high- and low-quality speckle reconstructions. Our method shows an improved robustness in terms of perceptual quality, especially when speckle reconstruction methods show artifacts. An evaluation with a varying number of images per burst demonstrates that our method makes efficient use of the combined image info
This paper executes a criminal face affirmation using deeplearning Algorithm. The data is used to get ready and test using imageprocessing count. The longing of this paper is to execute using a pretrained significan...
详细信息
As the need for on-device artificial intelligence (AI) has increased in recent years, mobile devices tend to be equipped with multiple heterogeneous processors, including CPU, GPU, and Neural processing Unit (NPU). Wh...
详细信息
As the need for on-device artificial intelligence (AI) has increased in recent years, mobile devices tend to be equipped with multiple heterogeneous processors, including CPU, GPU, and Neural processing Unit (NPU). While NPUs can offer low-cost and real-time AI processing capabilities for deep Neural Network (DNN) inference, its limited resources often lead to a trade-off between performance and accuracy, potentially resulting in a non-trivial accuracy drop. To address this problem, we propose a new NPU-GPU Scheduling (NGS) framework for DNN-based video analytics. The main challenge lies in determining when and how to execute inference on the NPU/GPU to satisfy the performance objectives. To make more precise scheduling decisions, we first propose a new image complexity assessment model to replace the existing normalized edge density metric. Then, we formulate the scheduling problem with the objective of maximizing inference accuracy under the given latency constraint, and introduce an adaptive solution based on dynamic programming to determine which frames should be processed on the GPU and when to exit from inference for each of them. Extensive experiments conducted on a real mobile device show that our NGS framework substantially outperforms other solutions, and achieves a close-to-oracle performance.
deeplearning-based scene recognition algorithms have been developed for real-time application in indoor localization systems. However, owing to the slow calculation time resulting from the deep structure of convoluti...
详细信息
deeplearning-based scene recognition algorithms have been developed for real-time application in indoor localization systems. However, owing to the slow calculation time resulting from the deep structure of convolutional neural networks, deeplearning-based algorithms have limitations in the usage of real-time applications, despite their high accuracy in classification tasks. To significantly reduce the computation time of these algorithms and slightly improve their accuracy, we thus propose a path-selective deeplearning network, denoted as Selective Optimal Network (SoN). The SoN selectively uses the depth-variable networks depending on anew indicator, denoted as the classification-complexity of a source image. The SoN reduces the prediction time by selecting optimal depth for the baseline networks corresponding to the input samples. The network was evaluated using two public datasets and two custom datasets for indoor localization and scene classification, respectively. The experimental results indicated that, compared to other deeplearning models, the SoN exhibited improved accuracy and enhanced the processing speed by up to 78.59%. Additionally, the SoN was applied to a smartphone-based indoor positioning system in real-time. The results indicated that the SoN shows excellent performance for rapid and accurate classification in real-time applications of indoor localization systems.
Underwater imaging suffers from significant quality degradation due to light scattering and absorption by water molecules, leading to color cast and reduced visibility. This hinders the ability to analyze and interpre...
详细信息
Underwater imaging suffers from significant quality degradation due to light scattering and absorption by water molecules, leading to color cast and reduced visibility. This hinders the ability to analyze and interpret the underwater world. image dehazing techniques have emerged as a crucial component for underwater image enhancement (UIE). This review comprehensively examines both traditional methods, rooted in the physics of light transmission in water, and recent advances in learning-based approaches, particularly deeplearning architectures like Convolutional Neural Networks (CNNs), Generative Adversarial Networks (GANs), and Transformers. We conduct a comparative analysis across various metrics, including visual quality, color fidelity, robustness to noise, and computational efficiency, to highlight the strengths and weaknesses of each approach. Furthermore, we address key challenges and future directions for traditional and learning-based methods, focusing on domain adaptation, real-timeprocessing, and integrating physical priors into deeplearning models. This review provides valuable insights and recommendations for researchers and practitioners in underwater image enhancement.
暂无评论