Aerial search and response plays an important role in finding and rescuing persons in need. Unmanned Aerial Vehicle (UAV) -acquired aerial images provide an intensive profile search area and facilitate identification ...
详细信息
ISBN:
(数字)9798350372748
ISBN:
(纸本)9798350372755
Aerial search and response plays an important role in finding and rescuing persons in need. Unmanned Aerial Vehicle (UAV) -acquired aerial images provide an intensive profile search area and facilitate identification of potential targets. To improve the effectiveness of aircraft Search and Rescue missions, this study compares many object detection algorithms and imageprocessing methods. The investigated object detection methods are Mask R-CNN, SSD, and YOLOv8, and imageprocessing techniques are contrast improvement and image brightness control. The effectiveness of these strategies has been evaluated by using real aerial images that include different targets such as humans, vehicles, and fire. The results show how well the object detection algorithms work to accurately distinguish targets, and how an attractive imageprocessing approach improves the image performance of the best object detection model, as well as how the performance of the best object detection model with imageprocessing can improve the performance of model. This research shows that combining imageprocessing (contrast and brightness) with an optimal object detection algorithm (Mask R-CNN) significantly improves target detection in aerial search and rescue. While only Mask R-CNN performs best, adding imageprocessing increases its accuracy to 0.92, mAP to 0.89, and makes it more reliable in finding people in need. This shows the potential of this joint approach to save people in air rescue missions.
Despite the rapid advance of 3D-aware image synthesis, existing studies usually adopt a mixture of techniques and tricks, leaving it unclear how each part contributes to the final performance in terms of generality. F...
详细信息
Automatic License Plate Recognition (ALPR) is an embedded real-time technology that automatically recognizes a vehicle's license plate. There are numerous uses, ranging from complex security to shared spaces, park...
详细信息
The acquisition of 3D data using indirect 3D Time-of-Flight (3D ToF) sensors is a well-established technology. However, Multi-Path Interference (MPI) is a key issue and causes significant errors in the distance measur...
详细信息
ISBN:
(纸本)9798350376371;9798350376364
The acquisition of 3D data using indirect 3D Time-of-Flight (3D ToF) sensors is a well-established technology. However, Multi-Path Interference (MPI) is a key issue and causes significant errors in the distance measurements. This paper presents a novel compact 3D ToF sensor with real-time multi-path separation. The sensor combines multiple measurements at different modulation frequencies and employs Particle Swarm Optimization (PSO) in combination with Sequential Quadratic Programming (SQP) to separate the different paths. The sensor is used to demonstrate the separation of two different paths in each pixel of the entire image in real-time. Evaluation in various laboratory and real-world scenarios reveals a significant improvement in distance accuracy compared to a standard 3D ToF sensor. Furthermore, the proposed algorithm exhibits processing speeds that are orders of magnitude faster than previously reported separation algorithms with comparable accuracy.
Deep Learning comes under Machine Learning that accomplishes more power and flexibility by learning to present different concepts or relations of real world to simpler concepts. We use Deep learning fundaments in this...
详细信息
This article addresses a critical problem in the field of target interception using a new foldable quadrotor with rotating arms. The primary challenge is maximizing linear acceleration to enhance the quadrotor’s abil...
详细信息
ISBN:
(数字)9798350309249
ISBN:
(纸本)9798350309256
This article addresses a critical problem in the field of target interception using a new foldable quadrotor with rotating arms. The primary challenge is maximizing linear acceleration to enhance the quadrotor’s ability to intercept targets effectively. To address this challenge, we introduce a novel optimization framework. Our study employs two distinct optimization algorithms, namely the genetic algorithms and the whales optimization algorithm, to ascertain the maximum attainable linear acceleration for the foldable quadrotor. The results obtained are confirmed through the use of our innovative foldable quadrotor.
With the development of deep learning, super-resolution image synthesis techniques for enhancing low-resolution images have advanced remarkably. However, mainstream algorithms focus on improving the quality of the ent...
详细信息
ISBN:
(纸本)9781665462198
With the development of deep learning, super-resolution image synthesis techniques for enhancing low-resolution images have advanced remarkably. However, mainstream algorithms focus on improving the quality of the entire image on average and this may result in blurring. In this paper, we propose three key components for synthesizing super-resolution images that can reflect the fine details of an image. We synthesize super-resolution images by image classification. First, the neural network weights learned using the images in the same image category were utilized in synthesizing super-resolution images. For this purpose, image classification was performed using a transfer-trained ResNet. Second, SENet was applied to the generators in our proposed method to obtain detailed information about the images. Finally, the feature extraction network was changed from VGG to ResNet in order to get more important features. As a result, we achieved better image evaluation values (PSNR, NIQE) for the super-resolution images of dogs and cats compared to the previous studies. Furthermore, the images were generated more naturally on the benchmark dataset.
Although AI systems have been applied in various fields and achieved impressive performance, their safety and reliability are still a big concern. This is especially important for safety-critical tasks. One shared cha...
详细信息
ISBN:
(纸本)9783031282409;9783031282416
Although AI systems have been applied in various fields and achieved impressive performance, their safety and reliability are still a big concern. This is especially important for safety-critical tasks. One shared characteristic of these critical tasks is their risk sensitivity, where small mistakes can cause big consequences and even endanger life. There are several factors that could be guidelines for the successful deployment of AI systems in sensitive tasks: (i) failure detection and out-ofdistribution (OOD) detection;(ii) overfitting identification;(iii) uncertainty quantification for predictions;(iv) robustness to data perturbations. These factors are also challenges of current AI systems, which are major blocks for building safe and reliable AI. Specifically, the current AI algorithms are unable to identify common causes for failure detection. Furthermore, additional techniques are required to quantify the quality of predictions. All these contribute to inaccurate uncertainty quantification, which lowers trust in predictions. Hence obtaining accurate model uncertainty quantification and its further improvement are challenging. To address these issues, many techniques have been proposed, such as regularization methods and learning strategies. As vision and language are the most typical data type and have many open source benchmark datasets, this thesis will focus on vision-language data processing for tasks like classification, image captioning, and vision question answering. In this thesis, we aim to build a safeguard by further developing current techniques to ensure the accurate model uncertainty for safety-critical tasks.
This work introduces the first prior case retrieval models for Turkish courts. We investigated the rulings of the Court of Cassation of Turkey. Since law professionals have to find texts they are interested in from en...
详细信息
ISBN:
(纸本)9781665464956
This work introduces the first prior case retrieval models for Turkish courts. We investigated the rulings of the Court of Cassation of Turkey. Since law professionals have to find texts they are interested in from enormous legal databases and computers can process large amounts of text swiftly, information retrieval algorithms are helpful for law professionals. The information retrieval algorithms utilized in this work are recurrent neural network autoencoders, and the combinations of recurrent neural network autoencoders with BM25 algorithms. The combination of the long-short term memory autoencoder with ATIRE BM25 achieves the best scores on our dataset.
With the advent of technology and algorithms in imageprocessing, many wearable aides are available in the market, and researchers across the globe are developing new solutions. The existing solutions and products fai...
详细信息
暂无评论