The proceedings contain 20 papers. The topics discussed include: automated detection of common IED components on resource constrained computing devices;closed-loop active object recognition with constrained illuminati...
ISBN:
(纸本)9781510650800
The proceedings contain 20 papers. The topics discussed include: automated detection of common IED components on resource constrained computing devices;closed-loop active object recognition with constrained illumination power;deeplearning techniques to identify and classify COVID-19 abnormalities on chest x-ray images;deeplearning architecture search for real-timeimage denoising;self-supervised learning in medical imaging: anomaly detection in MRI using autoencoders;benchmarking the MAX78000 artificial intelligence microcontroller for deeplearning applications;high efficiency sensing in realtime;a local real-time bar detector based on the multiscale radon transform;object detection on resource-constrained platforms using a configurable ensemble of detectors;comparison of onboard processors for rapid target identification in unmanned aircraft systems;and toward a hardware implementation of lidar-based real-time insect detection.
image segmentation in total knee arthroplasty is crucial for precise preoperative planning and accurate implant positioning, leading to improved surgical outcomes and patient satisfaction. The biggest challenges of im...
详细信息
Due to the rapidly increasing number of vehicles and urbanization, the use of parking spaces on the streets has increased significantly. Many studies have been carried out on the determination of parking spaces by usi...
详细信息
Due to the rapidly increasing number of vehicles and urbanization, the use of parking spaces on the streets has increased significantly. Many studies have been carried out on the determination of parking spaces by using the lines in the parking areas. However, the usage areas of this method are very limited since these lines are not found in every parking area. In this research, a unique study has been presented to determine the empty and occupied parking spaces in the parking area by processing the images from the cameras located at high points on the streets with depth calculation, perspective transformation and certain imageprocessing techniques within the framework of specific features. Empty and full parking lots were determined by utilizing perspective transformation and depth measurement techniques, and the data obtained were transferred to the real-time Database environment. In addition to determining the parking spaces, the study also aims to inform users through the mobile application and to prevent traffic congestion, extra fuel consumption, waste of time and air pollution caused by fuel consumption.
Breakwater construction in Indonesia still relies on divers to direct the placement of rock armour units, which is risky and time-constrained. This research aims to replace the diver's task with a deeplearning-ba...
详细信息
ISBN:
(数字)9798350391992
ISBN:
(纸本)9798350392005
Breakwater construction in Indonesia still relies on divers to direct the placement of rock armour units, which is risky and time-constrained. This research aims to replace the diver's task with a deeplearning-based vision system using YOLO-based deeplearning models. The system utilizes image pre-processing technology by applying histogram equalization (HE) techniques to improve image quality before the detection process. This research evaluates the performance of the YOLO-based deeplearning models in detecting armour units in real-time with a focus on various environmental conditions, which are clear and murky water. The analysis reveals clear water consistently supports higher average frame rates (FPS) compared to murky water, maintaining efficient frame processing across all models. In murky water, histogram equalization significantly enhances detection accuracy from 60% to 80% for YOLOv4-tiny and YOLOv7-tiny, demonstrating its effectiveness in challenging conditions. Notably, accuracy remains at 100% for all models in clear water, underscoring their robust performance under optimal visibility conditions.
Given the abundance of images related to operations that are being captured and stored, it behooves firms to innovate systems using imageprocessing to improve operational performance that refers to any activity that ...
详细信息
Given the abundance of images related to operations that are being captured and stored, it behooves firms to innovate systems using imageprocessing to improve operational performance that refers to any activity that can save labor cost. In this paper, we use deeplearning techniques, combined with classic image/signal processing methods, to propose a pipeline to solve certain types of object counting and layer characterization problems in firm operations. Using data obtained by us through a collaborative effort with real manufacturers, we demonstrate that the proposed pipeline method is able to achieve higher than 93% accuracy in layer and log counting. Theoretically, our study conceives, constructs, and evaluates proof of concept of a novel pipeline method in characterizing and quantifying the number of defined items with images, which overcomes the limitations of methods based only on deeplearning or signal processing. Practically, our proposed method can help firms significantly reduce labor costs and/or improve quality and inventory control by recording the number of products in realtime, more accurately and with minimal up-front technological investment. The codes and data are made publicly available online through the INFORMS Journal on Computing GitHub site.
In the rapidly evolving sphere of infrastructure management, early detection of road damage stands paramount for ensuring both safety and longevity. This research introduces an innovative technique for real-time road ...
详细信息
In the rapidly evolving sphere of infrastructure management, early detection of road damage stands paramount for ensuring both safety and longevity. This research introduces an innovative technique for real-time road damage detection by leveraging the Mask R-CNN (Region-based Convolutional Neural Networks) approach. The primary objective was to discern varied forms of damages – from cracks to potholes, ensuring timely interventions and repairs. Utilizing a robust dataset comprising images of multiple road surfaces under different environmental conditions, the Mask R-CNN model was trained exhaustively. Results reveal a commendable accuracy rate, with the model distinguishing between minor aberrations and significant damages adeptly. A distinctive feature was the model's capability to operate in real-time, aiding in instant damage reporting. Furthermore, a comparative analysis with existing methods demonstrated a marked improvement in terms of both detection speed and precision. The findings suggest promising implications for urban planning and road maintenance. The integration of such an approach can revolutionize the manner in which road monitoring is traditionally undertaken, potentially resulting in substantial economic savings and enhanced safety measures.
In this research, we delve into advanced image segmentation techniques applied to drone imagery for various environmental and surveillance applications. By leveraging state-of-the-art models such as UNet, deepLabV3, M...
详细信息
ISBN:
(数字)9798331505073
ISBN:
(纸本)9798331505080
In this research, we delve into advanced image segmentation techniques applied to drone imagery for various environmental and surveillance applications. By leveraging state-of-the-art models such as UNet, deepLabV3, Manet, and Feature Pyramid Network (FPN), our goal is to achieve high precision in segmenting complex aerial scenes. Each of these models possesses unique strengths and weaknesses; hence, we employ an ensemble technique, weighted averaging, to harness their combined capabilities for superior results. Additionally, we incorporate image augmentation techniques to simulate various weather conditions such as haze and raindrops, enhancing the robustness of our models. To manage real-time data efficiently, we implement a streaming pipeline using Apache Kafka and Apache Spark, ensuring scalable and effective processing. Our methods demonstrate significant performance improvements when trained on the original dataset and the combination of original dataset and augmented dataset compared to conventional methods.
The vibration signal is a typical non-stationary signal, making it challenging to use traditional time-frequency analysis techniques for fault diagnosis. Therefore, this work investigates the processing of vibration s...
详细信息
The vibration signal is a typical non-stationary signal, making it challenging to use traditional time-frequency analysis techniques for fault diagnosis. Therefore, this work investigates the processing of vibration signals and proposes a deeplearning method based on processed signals for the fault diagnosis of ball bearings. In this work, the fault diagnosis is formulated as an image classification problem and solved with deeplearning networks. The intrinsic mode functions (IMFs), converted from the vibration signals in the time domain, are then transformed into symmetrized dot pattern (SDP) images. In order to increase classification accuracy, the SDP parameters in this study are chosen by optimizing image similarity. The feasibility and accuracy of the proposed approach are examined experimentally.
With the continuous advancement of deeplearning and imageprocessing technologies, consumer emotion recognition has emerged as a significant area of research in advertising and marketing. Emotional responses from con...
详细信息
With the continuous advancement of deeplearning and imageprocessing technologies, consumer emotion recognition has emerged as a significant area of research in advertising and marketing. Emotional responses from consumers playAa crucial role in optimizing advertising effectiveness and marketing strategies. Among these, micro-expressions- subtle and involuntary facial movements-offer rich emotional cues that can enhance understanding of consumer sentiment. However, existing studies predominantly focus on conventional facial expressions or single-dimensional emotion classification, lacking indepth exploration and accurate detection of micro-expressions. Additionally, current approaches often overlook individual differences and the dynamic nature of emotional changes, resulting in limited accuracy and real-time performance. Effectively leveraging deeplearning and imageprocessing for precise emotion recognition thus presents a critical challenge in modern advertising. Traditional methods-based on facial expressions, speech, or physiological signals-face various limitations in practical applications. Facial expression-based models are sensitive to individual variations and rely heavily on the quality of facial feature extraction. Although speech and physiological signal-based techniques can offer valuable emotional insights, constraints in data acquisition and processing hinder their effectiveness in recognizing complex emotional states. This study aims to enhance the precision and real-time capability of consumer emotion recognition by utilizing deeplearning and imageprocessing techniques. The key research contributions include: (1) proposing an improved preprocessing method for micro-expression images to enhance emotional feature extraction;(2) designing a deeplearning model tailored for micro-expression recognition to optimize emotion classification accuracy;and (3) developing adaptive advertising strategies based on emotion recognition results to maximize adve
Machine vision has been increasingly used to address agricultural issues. One such case is corn field harvest losses and image-based object detection approaches, namely imageprocessing, machine learning, and deep lea...
详细信息
Machine vision has been increasingly used to address agricultural issues. One such case is corn field harvest losses and image-based object detection approaches, namely imageprocessing, machine learning, and deeplearning were investigated to detect and count infield corn kernels, immediately after harvest for combine harvester performance evaluation. A hand-held low-cost RGB camera was used to collect images with kernels of different backgrounds, based on which a 420 images dataset (200, 40, and 180 for training, validation, and testing, respectively) was generated. Three different models for kernel detection were constructed based on imageprocessing, machine learning, and deeplearning. For the imaging processing method, the images were preprocessed (color thresholding, graying, and erosion), followed by Hough circle detection to identify kernels. For the machine learning (cascade detector) and deeplearning (Mask R-CNN, EfficientDet, YOLOv5, and YOLOX), models were trained, validated, and tested. Experimental results showed the overall performance of the deeplearning network YOLOv5 was superior to the other approaches, with a small model size (89.3 MB) and a high model average precision (78.3 %) for object detection. The detection accuracy, undetection rate and F1 value were 90.7 %, 9.3 %, and 91.1 %, respectively, and the average detection rate was 55 fps. This study demonstrates that the YOLOv5 model has the potential to be used as a real-time, reliable, and robust method for infield corn kernel detection.
暂无评论