To enhance the safety and comfort of vehicle travel, detecting pavement cracks is a critical task in road management. This article introduces an advanced single-stage target detection method utilizing the YOLOv5s algo...
详细信息
To enhance the safety and comfort of vehicle travel, detecting pavement cracks is a critical task in road management. This article introduces an advanced single-stage target detection method utilizing the YOLOv5s algorithm to enhance real-time performance and accuracy. Initially, Squeeze-and-Excitation Networks are integrated into the model to facilitate self-learning for improved crack characterization. Subsequently, anchors computed through the K-means clustering algorithm are closely aligned with the fracture dataset, achieving an adaptation rate of 99.9 % and enhancing the recall rate of the model. Furthermore, the inclusion of the SimSPPF module from YOLOv6 diminishes memory usage and expedites detection speed. By replacing the original nearest up- sampling method with transposed convolution, optimization of up-sampling for crack datasets is achieved. Performance assessments reveal that the refined YOLOv5s algorithm attains an F1 score of 91 %, a mean Average Precision (mAP) of 93.6 %, and a 1.54 % increase in frames per second (fps) for pavement crack detection. This enhancement in detection technology signifies a substantial advancement in the maintenance and longevity of road infrastructure.
With the exponential growth of video data, individuals, particularly scholars in the fields of history and sociology, are increasingly reliant on video materials. However, the task of locating specific frames within v...
详细信息
ISBN:
(纸本)9789464593617;9798331519773
With the exponential growth of video data, individuals, particularly scholars in the fields of history and sociology, are increasingly reliant on video materials. However, the task of locating specific frames within videos remains a laborious and time-consuming endeavor. Advanced machine learning-assisted video processing techniques have emerged, including text-based video searches, video summarization, real-time object detection, and person re-identification. However, distinct from these, the main challenge of retrieving video frames based on given visual content is how to efficiently and accurately pinpoint the instance occurrences. To expedite the process while maintaining retrieval performance, we propose a two-stage approach, combining KeyFrame Extraction (KFE) and Content-based image Retrieval (CBIR), underpinned a DNN-empowered framework called MoReSo. Our innovations include 1) the integration of improved statistical features with dynamic clustering in the KFE stage and 2) the development of the MoReSo framework, which consists of MobileNet and ResNet backbones with SOA layer to jointly represent video frames, achieving 2.67x increase in efficiency compared to existing solutions. Our framework is evaluated on two datasets: the annotated EHM Historical Database provided by digital history researchers and the widely-used image retrieval benchmark datasets, the Oxford and Paris datasets. The experimental results showcase that the proposed framework and scheme excel among other models in the CBVIR task. We make our code available for further exploration through our GitHub repository. This repository contains the implementation of our model and CBVIR system with a GUI prototype.
We present a novel color video compression method using the greatest solution of a system of bilinear fuzzy relation equations to assess the similarity between frames. The frames in each band are treated separately an...
详细信息
The integration of deeplearning into visual communication design offers transformative possibilities for style transfer and automation. This paper proposes a framework that combines neural style transfer (NST) techni...
详细信息
Attention Deficit Hyperactivity Disorder (ADHD) causes significant impairment in various domains. It is known in the Medical Statistical Manual of Mental Disorders, Fifth Edition (DSM-V) that the symptoms of ADHD may ...
详细信息
ISBN:
(纸本)9798350374520;9798350374513
Attention Deficit Hyperactivity Disorder (ADHD) causes significant impairment in various domains. It is known in the Medical Statistical Manual of Mental Disorders, Fifth Edition (DSM-V) that the symptoms of ADHD may manifest in actions and daily behaviors. deeplearning methods based on fMRI and EEG have improved the efficiency of the ADHD detection process. However, the cost of the specialized equipment and trained staff required by the existing methods are generally huge. Therefore, we introduce the action recognition network based on raw RGB videos to ADHD detection for the first time. We also extract corresponding action characteristics with two proposed novel measurements: Attention Deficit Ratio (RAD) and Stationary Ratio (RS) based on the action features of ADHD. The two-stage final ADHD detection is decided with RAD and RS fusion and achieves a high accuracy of 95.5% in our real multimodal dataset. The dataset recorded in our Intelligent Sensing Laboratory has been processed and reported to CNTW-NHS Foundation Trust, which will be reviewed by medical consultants/professionals and to be made public in due course.
This paper presents a comprehensive approach to address the challenge of dehazing on-road images by synthesising datasets and training advanced deeplearning models. Leveraging Pix2Pix GAN and introducing a novel Tira...
详细信息
Freshwater fish is one of the commodities experiencing an increasing growth rate from 1990 to 2018. Many efforts have been made to meet market needs, through both fisheries technology and applied technology, one of wh...
详细信息
Medical image segmentation benefits from machine learning advancements, offering potential automation. Yet, accuracy depends on substantial annotated data and significant computing resources. Transfer learning address...
详细信息
ISBN:
(纸本)9781510671577;9781510671560
Medical image segmentation benefits from machine learning advancements, offering potential automation. Yet, accuracy depends on substantial annotated data and significant computing resources. Transfer learning addresses these challenges by leveraging a model's knowledge from one task for another with minor adjustments. The idea is to adapt learned features to new tasks, even with differing datasets but shared characteristics. Studies explore the impact of using large source datasets for limited target datasets. This investigation focuses on transferring knowledge from a limited source to enhance model versatility across various tasks. Our goal involved transferring knowledge from an advanced model trained on T2 weighted MR images related to Autosomal Dominant Polycystic Kidney Disease (ADPKD) for kidney and cyst segmentation (referred to as "Lsource"). This transfer was directed towards five distinct target datasets: CT liver, CT kidneys, CT spleen, MRI kidneys, and CT multimodal data (target datasets 1 through 5). The primary objective was to achieve accurate segmentation on these target datasets while saving time and computational resources. This approach is especially valuable when obtaining a substantial, labeled mouse PKD MRI target dataset is challenging, and the source dataset itself is resource- intensive. Using transfer learning from source 1 onto target sets 1 to 5 resulted in mean Dice Similarity Coefficients (DSCs) of 0.94 +/- 0.04, 0.97 +/- 0.02, 0.95 +/- 0.03, 0.96 +/- 0.01, 0.93 +/- 0.02, respectively. Similarly, employing source 2 yielded mean DSCs of 0.95 +/- 0.04, 0.96 +/- 0.02, 0.95 +/- 0.02, 0.96 +/- 0.02, and 0.93 +/- 0.02 for the same target sets. Despite variations in pathological conditions, image characteristics, and imaging modalities, the transfer learning approach produced DSC values comparable to the initial published outcomes. This accomplishment was achieved with reduced training requirements, faster convergence times, and decreased co
The proceedings contain 70 papers. The special focus in this conference is on deeplearning, Artificial Intelligence and Robotics. The topics include: A Short Survey on Comparative Study of Modern Cryptograp...
ISBN:
(纸本)9783031609343
The proceedings contain 70 papers. The special focus in this conference is on deeplearning, Artificial Intelligence and Robotics. The topics include: A Short Survey on Comparative Study of Modern Cryptography Approach;advances in Computer-Aided Detection and Diagnosis of Retinal Diseases: A Comprehensive Survey of Fundal image Analysis;driver Safety and Drowsiness Detection in Internet of Vehicles with Federated learning;privacy Preserving Fingerprint Classification Using Federated learning;comparative Study of Ensemble learning Models for Smart Meter Load;social-Media Video Summarization Using Convolutional Neural Network and Kohnen’s Self Organizing Map;machine learning and deep Leaning in Predicting Coronary Heart Disease;Augmented Super Resolution GAN (ASRGAN) for image Enhancement Through Reinforced Discriminator;Convolutional Block Attention Assisted Dense Stacked Bi-LSTM for the Generation of RDF Statements;real-time Permanent Change Proposals for Abandoned Object Detection;an Excursion to Ontology-Based Non-functional Requirements Specification;a Review of Traditional and Neural Network Methods for Protecting Privacy in Big Data Analytics;a Long Short-Term Memory learning Based Malicious Node Detection for Clustering in Wireless Sensor Networks;experimental Analysis for Sensor Reduction to Depict real-time Applications Through Regression Techniques;multi-resolution Neural Network for Road Scene Segmentation;A CNN-Based Road Accident Detection and Comparison of Classification Techniques;football Match Result Prediction Using Twitter Statistical/Historical Data;safeguarding Ecosystems and Efficiency in Peer-to-Peer File Sharing Systems: An IoT-Inspired Approach to Pollution Mitigation;a Heuristic for Minimizing Resource Requirement for Quantum Graph Neural Networks;Light-Gated Recurrent Unit Based Acoustic Modeling for Improved Hindi ASR;Detecting Phishing URLs Using Machine learning: A Review;comparative Analysis of Pneumonia Detection from Chest X-ray Using
Oral cancer can become non-fatal if promptly detected and treated with medication. However, failure to diagnose cancer at an early stage poses a significant risk to lives. Therefore, early detection of oral cancer pla...
详细信息
暂无评论