Denoising(DN) and demosaicing(DM) are the first crucial stages in the image signal processing pipeline. Recently, researches pay more attention to solve DN and DM in a joint manner, which is an extremely undetermined ...
详细信息
Denoising(DN) and demosaicing(DM) are the first crucial stages in the image signal processing pipeline. Recently, researches pay more attention to solve DN and DM in a joint manner, which is an extremely undetermined inverse problem. Existing deeplearning methods learn the desired prior on synthetic dataset, which limits the generalization of learned network to the real world data. Moreover, existing methods mainly focus on the raw data property of high green information sampling rate for DM, but occasionally exploit the high intensity and signalto-noise(SNR) of green channel. In this work, a deep guided attention network(DGAN) is presented for realimage joint DN and DM(JDD), which considers both high SNR and high sampling rate of green information for DN and DM, respectively. To ease the training and fully exploit the data property of green channel, we first train DN and DM sub-networks sequentially and then learn them jointly, which can alleviate the error accumulation. Besides, in order to support the realimage JDD, we collect paired raw clean RGB and noisy mosaic images to conduct a realistic dataset. The experimental results on real JDD dataset show the presented approach performs better than the state-of-the-art methods, in terms of both quantitative metrics and qualitative visualization.
Modern enterprise resource planning (ERP) systems face the challenge of handling massive amounts of data and supporting real-time decision-making. With the rapid changes in the market environment, traditional ERP syst...
详细信息
Modern enterprise resource planning (ERP) systems face the challenge of handling massive amounts of data and supporting real-time decision-making. With the rapid changes in the market environment, traditional ERP systems are limited in their ability to make adaptive decisions. This study aims to address this issue by integrating deeplearning techniques to enhance the management effectiveness of ERP systems. The study uses RNNs, CNNs and DRL models for time series prediction, image recognition and resource optimisation, respectively. The experimental results show that RNN achieves 95% accuracy in demand forecasting, CNN 98% accuracy in image recognition, and DRL achieves more than 10% cost savings in resource optimisation. The integrated ERP system achieved a 42.86% reduction in order processingtime, a 25% improvement in inventory turnover, an 8% reduction in operating costs, and a 15% improvement in employee satisfaction. This study demonstrates the effectiveness of deeplearning to enhance decision support in ERP systems and provides suggestions for future directions of improvement.
Sustainable transmission of electrical energy to consumers across regions relies heavily on the integrity of power transmission lines and continuous monitoring of assets is crucial for maintaining system reliability. ...
详细信息
Sustainable transmission of electrical energy to consumers across regions relies heavily on the integrity of power transmission lines and continuous monitoring of assets is crucial for maintaining system reliability. Unmanned aerial vehicles have revolutionized defect identification in real-time and accessibility, even in difficult-to-reach geographical landscapes, thereby improving image-based inspections. This work introduces semisupervised Yolo with focal loss function (SYFLo), a novel method that augments YOLO for real-time health monitoring of electric assets in power transmission lines. SYFLo integrates the focal loss function with semi-supervised learning to effectively address the lack of abundant labeled data, data imbalances and enhance detection accuracy. Additionally, it improves data generalizability across a wide range of images, ensuring robust performance despite varied image backgrounds. By leveraging YOLOv8, SYFLo significantly improves fault identification, achieving a detection accuracy of 96.5% and an FPS of 16.39. Experimental results demonstrate the impact of the proposed approach, highlighting its potential to enhance the reliability of power transmission line monitoring. These findings underscore the importance of integrating advanced deeplearning techniques with innovative loss functions to address common challenges in real-time health monitoring systems.
During the converter process, it is crucial to automatically identify and record ladle numbers to track steel product quality and enhance automation levels. However, the steelmaking environment presents several challe...
详细信息
During the converter process, it is crucial to automatically identify and record ladle numbers to track steel product quality and enhance automation levels. However, the steelmaking environment presents several challenges, including intricate ladle scheduling, varying lighting conditions, severe background interference, and significant disparities between manually spray-printed ladle number characteristics and publicly available datasets. The combination of these problems makes it challenging to perform accurate and real-time ladle number identification. In response, this article suggests an automatic ladle number recognition approach based on deeplearning and imageprocessing. First, a double-region object detection model based on YOLOv5 is employed to capture keyframe images of the ladle to be identified from the video stream. Then, a method that can enable the acquisition of an accurate region of ladle numbers in sophisticated industrial settings is proposed to address the distortion of numerical features caused by lighting variations and background interference in industrial environments. Last, leveraging the proprietary dataset found and a ladle number recognition model integrating CNN and multiframe image fusion is designed, developing multithreading design and image queue management to ensure real-time and accurate ladle number recognition. In this study, the video data of a steel plant is used for testing. Through testing 176 steelmaking production cycles, all ladle numbers are accurately identified prior to finishing charging molten iron, indicating the high accuracy and real-time capability of the recognition system.
Hypercomplex algebras have recently been gaining prominence in the field of deeplearning owing to the advantages of their division algebras over real vector spaces and their superior results when dealing with multidi...
详细信息
Hypercomplex algebras have recently been gaining prominence in the field of deeplearning owing to the advantages of their division algebras over real vector spaces and their superior results when dealing with multidimensional signals in real-world 3D and 4D paradigms. This article provides a foundational framework that serves as a road map for understanding why hypercomplex deeplearning methods are so successful and how their potential can be exploited. Such a theoretical framework is described in terms of inductive bias, i.e., a collection of assumptions, properties, and constraints that are built into training algorithms to guide their learning process toward more efficient and accurate solutions. We show that it is possible to derive specific inductive biases in the hypercomplex domains, which extend complex numbers to encompass diverse numbers and data structures. These biases prove effective in managing the distinctive properties of these domains as well as the complex structures of multidimensional and multimodal signals. This novel perspective for hypercomplex deeplearning promises to both demystify this class of methods and clarify their potential, under a unifying framework, and in this way, promotes hypercomplex models as viable alternatives to traditional real-valued deeplearning for multidimensional signal processing.
作者:
Meenakshisundaram, N.Sajiv, G.
Saveetha University Department of Electronics and Communication Engineering Chennai India
Malaria remains a significant global health challenge, particularly in resource-limited regions, necessitating accurate and rapid diagnostic tools. This study introduces deepMalariaNet, a deeplearning model developed...
详细信息
Underwater imaging suffers from significant quality degradation due to light scattering and absorption by water molecules, leading to color cast and reduced visibility. This hinders the ability to analyze and interpre...
详细信息
Underwater imaging suffers from significant quality degradation due to light scattering and absorption by water molecules, leading to color cast and reduced visibility. This hinders the ability to analyze and interpret the underwater world. image dehazing techniques have emerged as a crucial component for underwater image enhancement (UIE). This review comprehensively examines both traditional methods, rooted in the physics of light transmission in water, and recent advances in learning-based approaches, particularly deeplearning architectures like Convolutional Neural Networks (CNNs), Generative Adversarial Networks (GANs), and Transformers. We conduct a comparative analysis across various metrics, including visual quality, color fidelity, robustness to noise, and computational efficiency, to highlight the strengths and weaknesses of each approach. Furthermore, we address key challenges and future directions for traditional and learning-based methods, focusing on domain adaptation, real-timeprocessing, and integrating physical priors into deeplearning models. This review provides valuable insights and recommendations for researchers and practitioners in underwater image enhancement.
This paper presents a systematic analysis for tomato plant leaf disease detection by adapting traditional Convolutional Neural Network (CNN) and pre-trained RESNET50 model. To understand the problem, first a tradition...
详细信息
ISBN:
(数字)9798331506452
ISBN:
(纸本)9798331506469
This paper presents a systematic analysis for tomato plant leaf disease detection by adapting traditional Convolutional Neural Network (CNN) and pre-trained RESNET50 model. To understand the problem, first a traditional CNN model has been considered. However, the performance of the model was both space and time consuming & provided inadequate predictions. Therefore, to resolve this issue RESNET50 model has been employed. Further, comparing the yielded performance from both CNN and RESNET50, it can be observed RESNET50 exhibits better performance as compared to CNN in terms of accuracy, prediction and complexity.
Context. Large aperture ground-based solar telescopes allow the solar atmosphere to be resolved in unprecedented detail. However, ground-based observations are inherently limited due to Earth's turbulent atmospher...
详细信息
Context. Large aperture ground-based solar telescopes allow the solar atmosphere to be resolved in unprecedented detail. However, ground-based observations are inherently limited due to Earth's turbulent atmosphere, requiring image correction techniques. Aims. Recent post-image reconstruction techniques are based on using information from bursts of short-exposure images. Shortcomings of such approaches are the limited success, in case of stronger atmospheric seeing conditions, and computational demand. real-time post-image reconstruction is of high importance to enabling automatic processing pipelines and accelerating scientific research. In an attempt to overcome these limitations, we provide a deeplearning approach to reconstruct an original image burst into a single high-resolution high-quality image in realtime. Methods. We present a novel deeplearning tool for image burst reconstruction based on image stacking methods. Here, an image burst of 100 short-exposure observations is reconstructed to obtain a single high-resolution image. Our approach builds on unpaired image-to-image translation. We trained our neural network with seeing degraded image bursts and used speckle reconstructed observations as a reference. With the unpaired image translation, we aim to achieve a better generalization and increased robustness in case of increased image degradations. Results. We demonstrate that our deeplearning model has the ability to effectively reconstruct an image burst in realtime with an average of 0.5 s of processingtime while providing similar results to standard reconstruction methods. We evaluated the results on an independent test set consisting of high- and low-quality speckle reconstructions. Our method shows an improved robustness in terms of perceptual quality, especially when speckle reconstruction methods show artifacts. An evaluation with a varying number of images per burst demonstrates that our method makes efficient use of the combined image info
In ophthalmic diagnostics, achieving precise segmentation of retinal blood vessels is a critical yet challenging task, primarily due to the complex nature of retinal images. The intricacies of these images often hinde...
In ophthalmic diagnostics, achieving precise segmentation of retinal blood vessels is a critical yet challenging task, primarily due to the complex nature of retinal images. The intricacies of these images often hinder the accuracy and efficiency of segmentation processes. To overcome these challenges, we introduce the cognitive DL retinal blood vessel segmentation (CoDLRBVS), a novel hybrid model that synergistically combines the deeplearning capabilities of the U-Net architecture with a suite of advanced imageprocessing techniques. This model uniquely integrates a preprocessing phase using a matched filter (MF) for feature enhancement and a post-processing phase employing morphological techniques (MT) for refining the segmentation output. Also, the model incorporates multi-scale line detection and scale space methods to enhance its segmentation capabilities. Hence, CoDLRBVS leverages the strengths of these combined approaches within the cognitive computing framework, endowing the system with human-like adaptability and reasoning. This strategic integration enables the model to emphasize blood vessels, accurately segment effectively, and proficiently detect vessels of varying sizes. CoDLRBVS achieves a notable mean accuracy of 96.7%, precision of 96.9%, sensitivity of 99.3%, and specificity of 80.4% across all of the studied datasets, including DRIVE, STARE, HRF, retinal blood vessel and Chase-DB1. CoDLRBVS has been compared with different models, and the resulting metrics surpass the compared models and establish a new benchmark in retinal vessel segmentation. The success of CoDLRBVS underscores its significant potential in advancing medical imageprocessing, particularly in the realm of retinal blood vessel segmentation.
暂无评论