The continuous growth in the scale of unmanned aerial vehicle (UAV) applications in transmission line inspection has resulted in a corresponding increase in the demand for UAV inspection imageprocessing. Owing to its...
详细信息
The continuous growth in the scale of unmanned aerial vehicle (UAV) applications in transmission line inspection has resulted in a corresponding increase in the demand for UAV inspection imageprocessing. Owing to its excellent performance in computer vision, deep learning has been applied to UAV inspection imageprocessing tasks such as power line identification and insulator defect detection. Despite their excellent performance, electric power UAV inspection imageprocessing models based on deep learning face several problems such as a small application scope, the need for constant retraining and optimization, and high R&D monetary and time costs due to the black-box and scene data-driven characteristics of deep learning. In this study, an automated deep learning system for electric power UAV inspection image analysis and processing is proposed as a solution to the aforementioned problems. This system design is based on the three critical design principles of generalizability, extensibility, and automation. Pre-trained models, fine-tuning (downstream task adaptation), and automated machine learning, which are closely related to these design principles, are reviewed. In addition, an automated deep learning system architecture for electric power UAV inspection image analysis and processing is presented. A prototype system was constructed and experiments were conducted on the two electric power UAV inspection image analysis and processing tasks of insulator self-detonation and bird nest recognition. The models constructed using the prototype system achieved 91.36% and 86.13% mAP for insulator self-detonation and bird nest recognition, respectively. This demonstrates that the system design concept is reasonable and the system architecture feasible .
Recent advances in both Artificial Intelligent (AI) and the Internet of Things (loT) make it possible to implement surveillance systems that can detect and recognize objects in an automatic manner. It is still challen...
详细信息
Massive image datasets are often required for the proper functioning of machine Learning (ML) and Computer vision (CV) applications. This paper offers a solution to computational challenges in the imageprocessing of ...
详细信息
Massive image datasets are often required for the proper functioning of machine Learning (ML) and Computer vision (CV) applications. This paper offers a solution to computational challenges in the imageprocessing of satellite imagery, by proposing an optimization procedure. The presented approach is verified by an exemplary Python implementation, constituting a standalone tool for automating the dataset creation and labeling, including the extraction of road network data from the national satellite cartography provider. The collected data include detailed road maps along with the parcel information obtained via WebMapService endpoints. The method presented in this paper involves three basic steps: road segmentation (using the Shapely module) to facilitate handling high-resolution orthoimagery, and then a modified Region-of-Interest approach, i.e., removing irrelevant areas, with only roads remaining. This results in obtaining file sizes that are significantly smaller. The presented algorithm also involves asynchronous tile downloading, which, combined with the masking of irrelevant areas, improves not only the efficiency but surprisingly also the accuracy of subsequent ML/CV procedures. The research results of the paper reveal substantial file size reduction, and improved processing efficiency, thus making the optimized geospatial graphical data more practical for ML/CV applications, while still maintaining the original data quality and relevance of the analyzed parcels or infrastructure.
The performance of Convolutional Neural Networks (CNNs) is critically dependent on the underlying computational configurations, particularly in terms of processing capabilities and data handling techniques. As deep le...
详细信息
Sign language is a form of communication prominently used by the deaf-mute community to convey their ideas and thoughts. In the Philippines, local signers use Filipino Sign Language (FSL) derived from the well-known A...
详细信息
ISBN:
(数字)9783031477249
ISBN:
(纸本)9783031477232;9783031477249
Sign language is a form of communication prominently used by the deaf-mute community to convey their ideas and thoughts. In the Philippines, local signers use Filipino Sign Language (FSL) derived from the well-known American Sign Language (ASL). Despite the recent formalization of FSL as the country's official sign language, there is still minimal familiarity among the public. That said, Sign Language Recognition (SLR) systems integrated with machine learning applications have been developed to understand FSL better. However, the prevalent limitations of most of these systems are that it only involves static signs and asynchronous recognition. This study aimed to take this solution further and overcome existing limitations by developing a model capable of recognizing FSL gestures in real-time usable for applications such as in government service centers. To this end, the study proposes the deep learning algorithm Convolutional and Long Short-Term Memory Neural Networks in system capturing of real-time signs from a signer. The proponents considered 15 signs related to common greetings and business transactions. A total of 450 video recordings were collected for the signs with each having an equal number of samples. The collected data underwent cleaning, preprocessing, and augmentation before training. The proposed model's performance was analyzed with the following classification metrics: Accuracy, Precision, Recall, and F1-Score, and was able to achieve 95% accuracy and a macro-average of 0.95 precision, 0.95 Recall, and 0.95 F1-Score. Furthermore, the model had a comparable accuracy and loss between validation and test data-a 95.18% accuracy and 0.13629 loss on validation while 95.93% accuracy and loss of 0.1478 on the test. With that said, the proposed model was well-fit for classifying the 15 signs that involve upper body movements.
Most of the visual information presented to humans doesn’t have a description associated with it, but humans can generally understand the visual information without any detailed description accompanying them. But for...
详细信息
The rapidly growing world population challenges farmers to meet the rising food demand. Monitoring crop phenotypes, or the physical plant traits, is useful in tracking plant development, maintaining plant health, and ...
详细信息
The rapidly growing world population challenges farmers to meet the rising food demand. Monitoring crop phenotypes, or the physical plant traits, is useful in tracking plant development, maintaining plant health, and increasing yield. However, phenotyping efforts are traditionally manual and become tedious for large scale farms. Thus, it is imperative to develop autonomous solutions to monitor plants accurately, remotely, and timely. To meet this objective, computer vision techniques have been used by researchers to perform automatic plant phenotyping on video and image data collected from either indoor, controlled environments or from the field. Furthermore, these methods have focused on using traditional pixel-based processing, machine learning, and deep learning for plant phenotyping. In this study, various modern computer vision techniques are implemented to automatically phenotype plants for agriculture applications, thereby reducing manual labor while accurately detecting important traits to help increase yield.
Fashion is the way we present ourselves which mainly focuses on vision, has attracted great interest from computer vision researchers. It is generally used to search fashion products in online shopping malls to know t...
详细信息
The importance of speech emotion recognition has increased as a result of the acceptance of intelligent conversational assistant services. The communication between humans and machines may be made better via emotion r...
详细信息
As a kind of noise, speckle seriously affects the imaging quality of optical imaging system. However, the speckle image carries a large amount of information related to the physical characteristics of the object surfa...
详细信息
暂无评论