Abstract: In order to improve the intellective level of water resources management, a real-time water level recognition method based on deep-learning algorithms and image-processing techniques is proposed in this pape...
详细信息
ISBN:
(纸本)9781450395687
Abstract: In order to improve the intellective level of water resources management, a real-time water level recognition method based on deep-learning algorithms and image-processing techniques is proposed in this paper. The recognition process is composed of four steps. Firstly, for the purpose of digit detection, YOLO-v3 model is deployed for extracting numbers from the water gauges. Then, the cropped number images are fed into the LSTM + CTC model as training samples so that digits can be recognized. In the third step, Hough transform are adopted to correct the tilt of water gauge in terms of the vertical edge feature. Morphological operation, associated with horizontal projection would position upper and lower edge of water gauge to recognize the scale lines correctly. Water level could be determined correspondingly. Model application shows that the recognition model has satisfying accuracy and efficiency, with potential being applied in practice.
India is home to 10% of all traffic deaths worldwide and has the second-largest road network in the world. Moreover, in smart cities, traffic congestion, pollutants, and noise pollution have increased due to a constan...
详细信息
The real-world medical datasets are often inherently challenged by imbalanced classes, which impact the performance of deeplearning models, leading to overfitting and limited effectiveness. These limitations are part...
详细信息
The real-world medical datasets are often inherently challenged by imbalanced classes, which impact the performance of deeplearning models, leading to overfitting and limited effectiveness. These limitations are particularly pronounced in image segmentation tasks, where accurate delineation of anatomical structures is essential to support clinical decision-making. In order to match the recent advancements and enhance the model's generalizability and its ability to classify correctly the minor class, specifically the foreground pixels, we applied the generalized dice loss in conjunction with transfer learning, avoiding the redundancy provided by traditional data augmentation techniques and heavy computational data generation strategies. In this paper, we demonstrated that the choice of the loss function plays a pivotal role in optimizing the learning landscape and guiding the model's training process. The proposed approach generated the highest Dice Coefficient value of 98.44% compared with the existing works and augmentation of 5.24% compared with the network that employed the cross-entropy Loss function. Experimental results indicate that the proposed hybrid approach can accurately identify and segment different shapes of the fetal head, enabling real-timeprocessing and providing a significant potential to assist clinical diagnosis for further circumference measurement.
In order to improve the intellective level of water resources management, a real-time water level recognition method based on deep-learning algorithms and image-processing techniques is proposed in this paper. The rec...
详细信息
Road separations, intersections, and crosswalks, which are important components of highways, are seen as significant areas for autonomous vehicles and advanced driver assistance systems because traffic accident occurr...
详细信息
Road separations, intersections, and crosswalks, which are important components of highways, are seen as significant areas for autonomous vehicles and advanced driver assistance systems because traffic accident occurrence rate is considerably high in these areas. In this study, an imageprocessing method and a deeplearning based approach on realimages has been proposed in order to provide instant information for drivers and autonomous vehicles, or to develop warning systems as part of advanced driver assistance systems to prevent or minimize traffic accidents. The information is obtained from the classification of images belonging to the separations, intersections and crosswalks on the road using a new model and VggNet, AlexNet, LeNet based on Convolutional Neural Network(CNN). We have obtained high classification accuracy with our model based on CNN. The result of the study performed on different datasets showed that the proposed method is usable for driver assistance systems and an effective structure that can be used in many areas such as warning both vehicles and drivers. (C) 2019 Elsevier B.V. All rights reserved.
Although deeplearning techniques have made significant advances in the field of images, existing methods still face challenges in processing complex, noisy images. In view of the limitation that most denoising models...
详细信息
Although deeplearning techniques have made significant advances in the field of images, existing methods still face challenges in processing complex, noisy images. In view of the limitation that most denoising models only focus on extracting single scale features, a new denoising network structure is proposed in this paper. Firstly, the channel attention mechanism and convolutional neural network are combined to construct a realimage denoising model, and then the parallel multi-scale convolutional neural network is constructed by combining the adaptive dense connected residual block and parallel multi-scale feature extraction module. The results showed that the designed model can reach the stable state only after 121 and 86 iterations on the training set and the test set, and the denoising accuracy of the model is as high as 0.96. In addition, the research model has high computational efficiency and short denoising time when processing noisy images, and the processingtime of an image is as low as 0.09s. Therefore, the proposed denoising structure has good denoising performance under different noise levels and types, and this study also provides a new idea for the application of deeplearning in image denoising and other imageprocessing tasks.
Intelligent Visitor Management System (IVMS) is crucial for enhancing security and operational efficiency in smart factories and intelligent office buildings. Leveraging AIoT-driven image analysis will facilitate real...
详细信息
Intelligent Visitor Management System (IVMS) is crucial for enhancing security and operational efficiency in smart factories and intelligent office buildings. Leveraging AIoT-driven image analysis will facilitate realtime visitor authentication and access control. However, the growing volume of interactions and the limited processing power of local terminals complicate the delivery of timely and accurate image analysis. To address these challenges, we propose an edge-terminal collaborative AIoT framework for real-time visitor management. The framework solves the limitations of traditional approaches, where local terminals are unable to handle the computational load and edge solutions experience high latency due to transmission delays. Specifically, it integrates three key components to improve system performance: a local analysis module for initial processing, an image communication module for efficient data transmission, and an edge analysis module for advanced processing. Moreover, the framework jointly optimizes image task offloading, wireless channel allocation, and image compression, all formulated as an optimization problem to ensure fast and accurate analysis. Additionally, a novel multi-level deep Reinforcement learning (DRL) method is further designed to dynamically refine the selection of compression and offloading strategies. By learning in real-time, the DRL model adapts to network variations, addressing the scalability and adaptability limitations of existing methods. Simulation results show that our proposed edge-terminal collaborative AIoT framework significantly outperforms both edge-only and terminal-only methods in terms of latency and accuracy.
Underwater remote sensing image enhancement is complicated by low illumination, color bias, and blurriness, affecting deep-sea monitoring and marine resource development. This study compares a multi-scale fusion-enhan...
详细信息
Underwater remote sensing image enhancement is complicated by low illumination, color bias, and blurriness, affecting deep-sea monitoring and marine resource development. This study compares a multi-scale fusion-enhanced physical model and deeplearning algorithms to optimize intelligent processing. The physical model, based on the Jaffe-McGlamery model, integrates multi-scale histogram equalization, wavelength compensation, and Laplacian sharpening, using cluster analysis to target enhancements. It performs well in shallow, stable waters (turbidity < 20 NTU, depth < 10 m, PSNR = 12.2) but struggles in complex environments (turbidity > 30 NTU). deeplearning models, including water-net, UWCNN, UWCycleGAN, and U-shape Transformer, excel in dynamic conditions, achieving UIQM = 0.24, though requiring GPU support for real-time use. Evaluated on the UIEB dataset (890 images), the physical model suits specific scenarios, while deeplearning adapts better to variable underwater settings. These findings offer a theoretical and technical basis for underwater image enhancement and support sustainable marine resource use.
The COVID-19 pandemic has highlighted the need for efficient and non-contact health screening methods. Signal-based infrared imaging is an emerging field in biomedical engineering that enables remote monitoring of vit...
详细信息
The COVID-19 pandemic has highlighted the need for efficient and non-contact health screening methods. Signal-based infrared imaging is an emerging field in biomedical engineering that enables remote monitoring of vital signs. While fever is a common symptom, respiratory abnormalities often appear earlier, necessitating advanced screening systems that monitor both body temperature and respiratory patterns. This research presents an artificial intelligence-based screening device for health that identifies human respiratory patterns based on a deeplearning model. The device is built with a Convolutional Neural Network (CNN) to extract features and a Long Short-Term Memory (LSTM) network to classify time-series patterns. The Softmax classifier accurately classifies respiratory patterns. It is learned on a specialized dataset of six breathing signal patterns, making it an effective model for real-time public health surveillance. The experimental result demonstrates that the proposed CNN-LSTM model achieves 91% accuracy, 90% precision, 93% recall, and an F1-score of 91%. It can be scaled up even further for medical real-time applications, paves the way to even greater future advancements in automated health surveillance.
The utilization of smartphone cameras to capture photographs is immensely popular in the world. Smartphone image signal processors are used to produce high-quality images. The field of image Signal processing (ISP) in...
详细信息
The utilization of smartphone cameras to capture photographs is immensely popular in the world. Smartphone image signal processors are used to produce high-quality images. The field of image Signal processing (ISP) in smartphone cameras involves the application of various techniques and algorithms that process the raw image data acquired by the smartphone camera sensor into a high-quality Red, Green, and Blue (RGB) image. The visual difference between images captured by smartphone cameras and Digital Single Lens Reflex (DSLR) cameras can be attributed to the constrained sizes of smartphone camera sensors and lenses. To address the existing disparity in visual differences of these devices, there is a need to redesign the smartphone ISP with the aim of reconstructing the good quality of the captured images in realtime. This work proposes a single-stage end-to-end deep-learning model that can replace most complex ISP pipelines of smartphone cameras. The training of the proposed model is independent of the sensor and optics employed in a specific device. The proposed single-stage ISP pipeline for smartphone cameras uses the Global Context Residual Dense (GCRD) module, the Multiple Convolution Block (MCB) module, and the Residual Channel Attention (RCA) module. The GCRD module is used to learn the residual information which helps in color mapping of the raw image to the corresponding RGB image. At the same time, the MCB module with multiple convolution blocks consisting of layers of different kernel sizes focuses on the fine-grained details of the image. Further, the RCA module used in the proposed work consists of a very high deep trainable network that adaptively learns more beneficial channel-wise features simultaneously. By combining these modules, the pipeline achieves a synergistic effect that helps in balancing global context, local refinement, and feature prioritization, enabling superior performance in complex ISP operations. This work evaluates the proposed mo
暂无评论