Maintaining road pavement integrity is crucial for ensuring safe and efficient transportation. Conventional methods for assessing pavement condition are often laborious and susceptible to human error. This paper propo...
详细信息
Maintaining road pavement integrity is crucial for ensuring safe and efficient transportation. Conventional methods for assessing pavement condition are often laborious and susceptible to human error. This paper proposes YOLO9tr, a novel lightweight object detection model for pavement damage detection, leveraging the advancements of deeplearning. YOLO9tr is based on the YOLOv9 architecture, incorporating a partial attention block that enhances feature extraction and attention mechanisms, leading to improved detection performance in complex scenarios. The model is trained on a comprehensive dataset comprising road damage images from multiple countries. This dataset includes an expanded set of damage categories beyond the standard four types (longitudinal cracks, transverse cracks, alligator cracks, and potholes), providing a more nuanced classification of road damage. This broadened classification range allows for a more accurate and realistic assessment of pavement conditions. Comparative analysis demonstrates YOLO9tr's superior precision and inference speed compared to state-of-the-art models like YOLOv8, YOLOv9 and YOLOv10, achieving a balance between computational efficiency and detection accuracy. The model achieves a high frame rate of up to 136 FPS, making it suitable for real-time applications such as video surveillance and automated inspection systems. The research presents an ablation study to analyze the impact of architectural modifications and hyperparameter variations on model performance, further validating the effectiveness of the partial attention block. The results highlight YOLO9tr's potential for practical deployment in real-time pavement condition monitoring, contributing to the development of robust and efficient solutions for maintaining safe and functional road infrastructure.
Dentists judge that the quality of dental treatment for each patient is very time-consuming and inefficient, lacks quantitative evaluation criteria, and is easy to cause errors. At the same time, the traditional metho...
详细信息
Dentists judge that the quality of dental treatment for each patient is very time-consuming and inefficient, lacks quantitative evaluation criteria, and is easy to cause errors. At the same time, the traditional method of extracting tooth and root canal image features based on experience is difficult to accurately extract the tooth area and root canal filling area, resulting in low accuracy of tooth and root canal segmentation, which in turn affects the accuracy of tooth treatment quality evaluation. In this paper, a deeplearning convolutional neural network is used to segment the root canal filling area, tooth boundary, and the boundary between tooth and soft tissue for the real patient 's root canal treatment and filling image. Finally, the segmented image is quantitatively evaluated according to the multi-evaluation index of professional doctors. The experimental results show that the intelligent evaluation method of dental treatment quality combined with deeplearning and multi-index decomposition proposed in this paper not only unifies the evaluation criteria of dental treatment quality but also the therapeutic effect of quantitative scoring can effectively improve the work efficiency of doctors, which has reference significance for the application of artificial intelligence in the medical field.
This paper addresses the problem of lossy image compression, a fundamental problem in imageprocessing and information theory that is involved in many real-world applications. We start by reviewing the framework of va...
详细信息
This paper addresses the problem of lossy image compression, a fundamental problem in imageprocessing and information theory that is involved in many real-world applications. We start by reviewing the framework of variational autoencoders (VAEs), a powerful class of generative probabilistic models that has a deep connection to lossy compression. Based on VAEs, we develop a new scheme for lossy image compression, which we name quantization-aware ResNet VAE (QARV). Our method incorporates a hierarchical VAE architecture integrated with test-time quantization and quantization-aware training, without which efficient entropy coding would not be possible. In addition, we design the neural network architecture of QARV specifically for fast decoding and propose an adaptive normalization operation for variable-rate compression. Extensive experiments are conducted, and results show that QARV achieves variable-rate compression, high-speed decoding, and better rate-distortion performance than existing baseline methods.
The analysis of Android malware shows that this threat is constantly increasing and is a real threat to mobile devices since traditional approaches,such as signature-based detection,are no longer effective due to the ...
详细信息
The analysis of Android malware shows that this threat is constantly increasing and is a real threat to mobile devices since traditional approaches,such as signature-based detection,are no longer effective due to the continuously advancing level of *** resolve this problem,efficient and flexible malware detection tools are *** work examines the possibility of employing deep CNNs to detect Android malware by transforming network traffic into image data ***,the dataset used in this study is the CIC-AndMal2017,which contains 20,000 instances of network traffic across five distinct malware categories:***,***,***,***,*** network traffic features are then converted to image formats for deeplearning,which is applied in a CNN framework,including the VGG16 pre-trained *** addition,our approach yielded high performance,yielding an accuracy of 0.92,accuracy of 99.1%,precision of 98.2%,recall of 99.5%,and F1 score of 98.7%.Subsequent improvements to the classification model through changes within the VGG19 framework improved the classification rate to 99.25%.Through the results obtained,it is clear that CNNs are a very effective way to classify Android malware,providing greater accuracy than conventional *** success of this approach also shows the applicability of deeplearning in mobile security along with the direction for the future advancement of the real-time detection system and other deeper learning techniques to counter the increasing number of threats emerging in the future.
This work presents an innovative method for scheduling tasks in a fog computing environments by combining the fuzzy logic with deep reinforcement learning. In Internet of Things there has been a significant raising th...
详细信息
This work presents an innovative method for scheduling tasks in a fog computing environments by combining the fuzzy logic with deep reinforcement learning. In Internet of Things there has been a significant raising the amount of data produced by different devices. This has created a need for more effective methods of processing and managing this data. Conventional cloud computing often fails to fulfill the need of IoT usage in terms of high bandwidth, low makespan, and real-timeprocessing. Fog computing presents available solution by placing the processing resources near the data source but the issue of efficient task scheduling remains a major obstacle. We proposed a technique that combines an Hybrid task scheduling technique in fog computing using fuzzy logic and deep reinforcement learning (HTSFFDRL) algorithm with a Takagi-Sugeno fuzzy inference system. By continuously interacting with the environment, this hybrid technique allows for the dynamic prioritization of tasks and the real-time change of scheduling rules. The technique seeks to maximize many crucial performance measures, such as makespan, energy consumption, cost, and fault tolerance. Simulations extensively validate the suggested strategy, demonstrating significant enhancements compared to current approaches like LSTM, DQN, and A2C. The results indicate that combining fuzzy logic with reinforcement learning may greatly improve the effectiveness and dependability of task scheduling in fog computing, opening up possibilities for more resilient IoT applications.
Date In response to the imperative need for mitigating criminal activities and ensuring public safety, this research proposes a novel approach leveraging deeplearning techniques for real-time weapon detection. In con...
详细信息
Dorsal hand vein (DHV) recognition is a burgeoning biometric technology that has recently garnered considerable attention. This article uses imageprocessing and deeplearning to present a novel DHV recognition approa...
详细信息
Dorsal hand vein (DHV) recognition is a burgeoning biometric technology that has recently garnered considerable attention. This article uses imageprocessing and deeplearning to present a novel DHV recognition approach. It involves detecting and identifying the unique patterns present in the DHV. The proposed system begins with the preprocessing mechanism that is applied to enhance the quality of the acquired images, including contrast enhancement and noise reduction, by using some filters such as Median and Contrast Limited Adaptive Histogram Equalization (CLAHE). Next, a deeplearning model, such as a convolutional neural network (CNN), is employed to automatically abstract discriminative features from the preprocessed vein images. The empirical outcomes prove the influence and reliability of the proposed technique for vein recognition, making it a promising solution for biometric authentication systems. Compared with traditional CNN, the proposed approach shows good accuracy and classification rate results. The suggested model achieved a high recognition rate accuracy, recall, precious, and f-score of 99.7%,97%,96%, and 96%, respectively, and a recognition time of about 1283.45 s. To enrich the model's capability for feature recognition and reduce recognition time, decrease the intricacy of learning and the connectivity CNN structure, an alternative approach based on Restricted Boltzmann Machines (RBM) was assessed. This strategy exhibits superior accuracy in comparison to other contemporary algorithms. The proposed RBM achieved a high recognition rate accuracy, recall, precious, and f-score of 99.9%,99%,99%, and 99%, respectively, and a recognition time of about 137.235s.
In this paper, we propose a face detection and recognition system using deeplearning method. It can be used as an access control system that performs face detection and recognition in real-timeprocessing. Our goal i...
详细信息
In this paper, we propose a face detection and recognition system using deeplearning method. It can be used as an access control system that performs face detection and recognition in real-timeprocessing. Our goal is to achieve a one-shot recognition instead of traditional two-step methods. We use SSD as the main model for face detection and VGG-Face as the main model for face recognition. We perform the deeplearning method through the collection of datasets. Moreover, we use some techniques, such as data augmentation, preprocessing of the image, and post-processing of the image to train the robust face detection and recognition subsystems. We use continuous frames as input to avoid false-positive cases and make the system output without wrong results. A real demonstration system is constructed to determine the identification of the laboratory members. We use 1280 x 960 resolution video for experimental testing and achieve about 30 fps speed under GPU acceleration.
Hyperspectral imaging (HSI) has recently emerged as a promising tool for many agricultural applications;however, the technology cannot be directly used in real-time for immediate decision-making and actions due to the...
详细信息
Hyperspectral imaging (HSI) has recently emerged as a promising tool for many agricultural applications;however, the technology cannot be directly used in real-time for immediate decision-making and actions due to the extensive time needed to capture, process, and analyze large volumes of data. Consequently, the development of a simple, compact, and cost-effective imaging system is not possible with the current HSI systems. Therefore, the overall goal of this study was to reconstruct hyperspectral images from RGB images through deeplearning for agricultural applications. Specifically, this study used Hyperspectral Convolutional Neural Network - Dense (HSCNN-D) to reconstruct hyperspectral images from RGB images for predicting soluble solid content (SSC) in sweet potatoes. The algorithm reconstructed the hyperspectral images from RGB images, with the resulting spectra closely matching the ground-truth. The partial least squares regression (PLSR) model based on reconstructed spectra outperformed the model using the full spectral range, demonstrating its potential for SSC prediction in sweet potatoes. These findings highlight the potential of deeplearning-based hyperspectral image reconstruction as a low-cost, efficient tool for various agricultural uses.
High dynamic range (HDR) images capture real-world luminance values which cannot be directly displayed on the screen and require tone mapping to be shown on low dynamic range (LDR) hardware. During this transformation...
详细信息
High dynamic range (HDR) images capture real-world luminance values which cannot be directly displayed on the screen and require tone mapping to be shown on low dynamic range (LDR) hardware. During this transformation, tone mapping algorithms are expected to preserve the naturalness and structural details of the image. In this regard, the performance of atone mapping algorithm can be evaluated through a subjective study where participants rank or score tone mapped images based on their preferences. However, such subjective evaluations can be time-consuming and cannot be repeated for every tone mapped image. To address this issue, numerous quantitative metrics have been proposed for objective evaluation. This paper presents a robust objective metric based on deeplearning to quantify image quality. We assess the performance of our proposed metric by comparing it to 20 existing state-of-theart metrics using two subjective datasets, including one benchmark dataset and a novel proposed dataset of 666 tone mapped images comprising a variety of scenes and labeled by 20 users. Our approach exhibits the highest correlation with subjective scores in both evaluations, confirming its effectiveness and potential to be a reliable alternative to laborious subjective studies.
暂无评论