Fused Deposition Modeling (FDM), an 3D printing technique being popular for rapidly fabricating polymeric prototypes as well as functional components with gradient structures such as scaffolds still faces significant ...
详细信息
Fused Deposition Modeling (FDM), an 3D printing technique being popular for rapidly fabricating polymeric prototypes as well as functional components with gradient structures such as scaffolds still faces significant hurdles in quality control and defect management. To overcome these limitations, a comprehensive approach has been proposed integrating advanced deeplearning models with an Internet of Things (IoT) based quality control system. The research proposes a framework using Data-efficient image Transformer (DeiT) model, engineered to identify and classify three high-impact FDM defects: warping, layer delamination, and gaps in raster lines. The model has been fine-tuned on a curated dataset of original images, enhanced through pre-processing techniques. The DeiT model combined with a proposed Weighted Classification Accuracy (WCA) approach achieves an accuracy of 99.3%. Furthermore, the response time of the entire system is calculated to be 0.1121 s, providing realtime monitoring and control. The research represents a significant step towards intelligent and optimized manufacturing systems in the context of Industry 4.0, addressing current challenges in FDM printing while paving the way for more autonomous and efficient 3D printing processes in the future.
Featured Application This work can be applied to enhance the robustness of image filtering systems in large-scale content platforms, specifically for detecting unauthorized images and their transformed versions, preve...
详细信息
Featured Application This work can be applied to enhance the robustness of image filtering systems in large-scale content platforms, specifically for detecting unauthorized images and their transformed versions, preventing the dissemination of manipulated *** image filtering systems have become essential in large-scale content platforms to prevent the dissemination of unauthorized data. While extensive research has focused on identifying images based on categories or visual similarity, the filtering problem addressed in this study presents distinct challenges. Specifically, it involves a predefined set of filtering images and requires real-time detection of whether a distributed image is derived from an unauthorized source. Although three major approaches-bitmap-based, imageprocessing-based, and deeplearning-based techniques-have been explored, no comprehensive comparison has been conducted. To bridge this gap, we formalize the concept of image equivalence and introduce performance metrics tailored for fair evaluation. Through extensive experiments, we derive the following key findings. First, bitmap-based methods are practically viable in real-world scenarios, offering reasonable detection rates and fast search speeds even under resource constraints. Second, despite their success in tasks such as image classification, deeplearning-based methods underperform in our problem domain, highlighting the need for customized models and architectures. Third, imageprocessing-based techniques demonstrate superior performance across all key metrics, including execution time and detection rates. These findings provide valuable insights into designing efficient image filtering systems for diverse content platforms, particularly for detecting unauthorized images and their transformations effectively.
In recent years, the proliferation of deepfake images has posed a substantial threat to media credibility, security, and privacy. Contemporary detection techniques, predominantly reliant on deeplearning algorithms, f...
详细信息
In recent years, the proliferation of deepfake images has posed a substantial threat to media credibility, security, and privacy. Contemporary detection techniques, predominantly reliant on deeplearning algorithms, fail to identify the nuanced pixel-level discrepancies inherent in deepfake material. This study introduces PlasmoVision, an innovative quantum-enhanced plasmonic imaging technology that incorporates AI-driven deeplearning for highly sensitive real-timedeepfake detection. deepfakes alter digital images and videos to produce very persuasive fraudulent content, rendering traditional detection methods ineffective. Plasmonic surface resonance technology, in conjunction with quantum dots, has the capacity to capture intricate image features that can disclose such alterations. Integrating deeplearning into this detection system improves the accuracy and velocity of analysis. The PlasmoVision technology employs quantum dot-enhanced plasmonic arrays to detect sub-pixel-level resonance shifts resulting from light interaction with the image surface. The optical signals are analyzed with a sophisticated convolutional neural network (CNN) that categorizes images according to the plasmonic resonance data. The AI model is trained on a varied dataset of genuine and deepfake photos, attaining an ideal equilibrium between detection sensitivity and speed. real-time picture analysis is accomplished by swift plasmonic scanning and AI-driven classification. The suggested device attained an accuracy rate of 98.6% in identifying deepfakes within a test dataset, exhibiting a false positive rate of 1.2% and a false negative rate of 0.5%. The quantum-enhanced plasmonic system identified pixel abnormalities with a sensitivity of up to 10 nm, markedly surpassing conventional deepfake detection technologies. PlasmoVision real-time analysis capacity decreased processingtime by 35% relative to traditional approaches, rendering it exceptionally appropriate for extensive and real-ti
Unmanned aerial vehicles (UAVs) have become essential in disaster management due to their ability to provide real-time situational awareness and support decision-making processes. Visual servoing, a technique that use...
详细信息
Unmanned aerial vehicles (UAVs) have become essential in disaster management due to their ability to provide real-time situational awareness and support decision-making processes. Visual servoing, a technique that uses visual feedback to control the motion of a robotic system, has been used to improve the precision and accuracy of UAVs in disaster scenarios. The study integrates visual servoing to enhance UAV precision while exploring recent advancements in deeplearning. This integration enhances the precision and efficiency of disaster response by enabling UAVs to navigate complex environments, identify critical areas for intervention, and provide actionable insights to decision-makers in realtime. It discusses disaster management aspects like search and rescue, damage assessment, and situational awareness, while also analyzing the challenges associated with integrating visual servoing and deeplearning into UAVs. This review article provides a comprehensive analysis to offer real-time situational awareness and decision support in disaster management. It highlights that deeplearning along with visual servoing enhances precision and accuracy in disaster scenarios. The analysis also summarizes the challenges and the need for high computational power, data processing, and communication capabilities. UAVs, especially when combined with visual servoing and deeplearning, play a crucial role in disaster management. The review underscores the potential benefits and challenges of integrating these technologies, emphasizing their significance in improving disaster response and recovery, with possible means of enhanced situational awareness and decision-making.
One of the deadliest forms of skin cancer is malignant melanoma, developed by aberrant melanocyte cell development. Efficient diagnostic procedures are essential due to the rising prevalence of skin illnesses resultin...
详细信息
Hypercomplex numbers, such as quaternions and octonions, have recently gained attention because of their advantageous properties over real numbers, e.g., in the development of parameter-efficient neural networks. For ...
详细信息
Hypercomplex numbers, such as quaternions and octonions, have recently gained attention because of their advantageous properties over real numbers, e.g., in the development of parameter-efficient neural networks. For instance, the 16-component sedenion has the capacity to reduce the number of network parameters by a factor of 16. Moreover, hypercomplex neural networks offer advantages in the processing of spatiotemporal data as they are able to represent variable temporal data divisions through the hypercomplex components. Similarly, they support multimodal learning, with each component representing an individual modality. In this article, the key components of deeplearning in the hypercomplex domain are introduced, encompassing concatenation, activation functions, convolution, and batch normalization. The use of the backpropagation algorithm for training hypercomplex networks is discussed in the context of hypercomplex algebra. These concepts are brought together in the design of a ResNet backbone using hypercomplex convolution, which is integrated within a U-Net configuration and applied in weather and traffic forecasting problems. The results demonstrate the superior performance of hypercomplex networks compared to their real-valued counterparts, given a fixed parameter budget, highlighting their potential in spatiotemporal data processing.
During urban fire incidents, real-time videos and images are vital for emergency responders and decision-makers, facilitating efficient decision-making and resource allocation in smart city fire monitoring systems. Ho...
详细信息
During urban fire incidents, real-time videos and images are vital for emergency responders and decision-makers, facilitating efficient decision-making and resource allocation in smart city fire monitoring systems. However, real-time videos and images require simple and embeddable models in small computer systems with highly accurate fire detection ratios. YOLOv5s has a relatively small model size and fast processingtime with limited accuracy. The aim of this study is to propose a method that employs a YOLOv5s network with a squeeze-and-excitation module for image filtering and classification to meet the urgent need for rapid and accurate real-time screening of irrelevant data. In this study, over 3000 internet images were used for crawling and annotating to construct a dataset. Furthermore, the YOLOv5, YOLOv5x and YOLOv5s models were developed to train and test the dataset. Comparative analysis revealed that the proposed YOLOv5s model achieved 98.2% accuracy, 92.5% recall, and 95.4% average accuracy, with a remarkable processing speed of 0.009 s per image and 0.19 s for a 35 frames-per-second video. This surpasses the performance of other models, demonstrating the efficacy of the proposed YOLOv5s for real-time screening and classification in smart city fire monitoring systems.
Non-Destructive Evaluation/Testing (NDE/NDT) is comprised of advanced sensor technologies that can evaluate structures, materials and components for defects and analyze their properties. In recent years, researchers h...
详细信息
Non-Destructive Evaluation/Testing (NDE/NDT) is comprised of advanced sensor technologies that can evaluate structures, materials and components for defects and analyze their properties. In recent years, researchers have been applying deeplearning algorithms on NDT technologies to improve the capability of detecting and classifying complex sensor data. However, deeplearning models often require large computational resources including specialized hardware accelerators, dedicated memory blocks and increased power consumption. It is very challenging to implement these deeplearning algorithms in real-time testing scenarios in the field due to limited access to aforementioned computational resources. To address this issue, we introduce a model compression algorithm and the corresponding Field Programmable Gate Array (FPGA) accelerators for a novel deeplearning model targeting ultrasonic NDT techniques. The ultrasonic deeplearning algorithm which is based on Meta learning is capable of detecting and classifying different flaw types (e.g. cracks, holes) within the specimen. The results have shown that the model compression has significantly reduced the required operations with minimal accuracy loss, and the low-cost FPGA hardware platform is able to accelerate the inference using compressed model with high efficiency.
The AdaMax algorithm provides enhanced convergence properties for stochastic optimization problems. In this paper, we present a regret bound for the AdaMax algorithm, offering a tighter and more refined analysis compa...
详细信息
The AdaMax algorithm provides enhanced convergence properties for stochastic optimization problems. In this paper, we present a regret bound for the AdaMax algorithm, offering a tighter and more refined analysis compared to existing bounds. This theoretical advancement provides deeper insights into the optimization landscape of machine learning algorithms. Specifically, the You Only Look Once (YOLO) framework has become well-known as an extremely effective object segmentation tool, mostly because of its extraordinary accuracy in real-timeprocessing, which makes it a preferred option for many computer vision applications. Finally, we used this algorithm for image segmentation.
real-world images captured in remote sensing, image or video retrieval, and outdoor surveillance are often degraded due to poor weather conditions, such as rain and mist. These conditions introduce artifacts that make...
详细信息
real-world images captured in remote sensing, image or video retrieval, and outdoor surveillance are often degraded due to poor weather conditions, such as rain and mist. These conditions introduce artifacts that make visual analysis challenging and limit the performance of high-level computer vision methods. In time-critical applications, it is vital to develop algorithms that automatically remove rain without compromising the quality of the image contents. This article proposes a novel approach called QSAM-Net, a quaternion multi-stage multiscale neural network with a self-attention module. The algorithm requires significantly fewer parameters by a factor of 3.98 than the real-valued counterpart and state-of-the-art methods while improving the visual quality of the images. The extensive evaluation and benchmarking on synthetic and real-world rainy images demonstrate the effectiveness of QSAM-Net. This feature makes the network suitable for edge devices and applications requiring near real-time performance. Furthermore, the experiments show that the improved visual quality of images also leads to better object detection accuracy and training speed.
暂无评论