Agriculture is one of the most important industries in any economy since it plays a big role in the food supply chain. Agricultural fields, on the other hand, confront a number of issues, including animal encroachment...
详细信息
ISBN:
(数字)9798350361186
ISBN:
(纸本)9798350361193
Agriculture is one of the most important industries in any economy since it plays a big role in the food supply chain. Agricultural fields, on the other hand, confront a number of issues, including animal encroachment, which can cause severe crop damage and loss. Traditional animal control tactics, such as electrical fences, physical barriers, and scarecrows, can be inefficient, time-consuming and a serious threat to animal lives. The animals either become entangled in the fence's wire mesh or were electrocuted by the electric lines. To overcome these problems we propose a unique method that involves imageprocessing-based animal incursion detection system in agricultural fields using Raspberry Pi and deeplearning technique, mainly the YOLOv7. This technology captures live video feeds of agricultural fields using a camera and analyses them using deeplearning algorithm to detect any animal invasions. If an intrusion is detected, the system emits specific repellent sounds for specific animal via speakers in order to scare them away and alerts the farmers by sending SMS. This method provides an efficient and practical alternative for crop damage prevention and human-wildlife conflict reduction in agricultural settings.
The traditional imageprocessing method of power grid facilities map is based on iconography, which can alleviate the artificial pressure to a certain extent. However, due to the slow speed and low accuracy of the tra...
详细信息
ISBN:
(数字)9798350393682
ISBN:
(纸本)9798350393699
The traditional imageprocessing method of power grid facilities map is based on iconography, which can alleviate the artificial pressure to a certain extent. However, due to the slow speed and low accuracy of the traditional iconography method, it is difficult to be applied in the field of fault inspection. In order to realize intelligent power inspection more quickly and accurately, an image recognition and processing algorithm of power grid facilities map based on deeplearning is proposed to solve the problems of occlusion, inaccurate classification and insufficient feature extraction in the actually collected power grid facilities map images. The convolution operation module and residual module in YOLOv5 algorithm are improved, and the learning depth of the algorithm is deepened by increasing the number of convolution layers. At the same time, the SENet attention mechanism is added to the basic convolution module. The research results show that the accuracy of this model for power equipment identification has reached more than $99 \%$. And the recognition accuracy of fault defects can reach $\mathbf{9 2. 7 4 6} \%$. This model improves the detection accuracy and speed of power grid facilities map images, and also provides a novel and feasible scheme for intelligent detection of power grid facilities map images.
deeplearning models have been a huge success in image recognition which hence can be used for the purpose of text generation. In the field of imaging science, captioning images and videos is regarded as an intellectu...
详细信息
ISBN:
(数字)9798350391770
ISBN:
(纸本)9798350391787
deeplearning models have been a huge success in image recognition which hence can be used for the purpose of text generation. In the field of imaging science, captioning images and videos is regarded as an intellectually difficult job. Visual Geometry Group (VGG); is a standard deep Convolutional Neural Network (CNN) architecture with multiple layers, specifically focusing on the integration of CNN for image feature extraction. Exploring this underlying method, the use of another model is essential for caption generation. Here the Recurrent Neural Network (RNN) comes in use for caption generation from the extracted features. Models named Long Short-Term Memory (LSTM) based on RNN and Bidirectional encoder representation transformer (BERT) based on Transformers have been prominent in ensuring accurate results. The Flicker8k dataset is used which provides a variety of information useful for model training. By testing validation data along with evaluation metrics, we analyze the effectiveness of different models to create consistent and descriptive headlines. Extending our inquiry to encompass title generation using transformer models, while also exploring learning techniques for real-time title generation and delivery using the Open-CV library available in Python to get the output from the camera and display it on screen. The result shows that the LSTM is the best model for captioning, with an accuracy of 65.07% at the epochs of 300 and the BERT model has an accuracy of 31% at the epochs of 2. The findings of this study not only contribute to advancing subtitle enhancement methodologies but also broaden the potential applications of deeplearning techniques in this domain.
Static and dynamic posture analysis was a critical clinical examination in physiotherapy and rehabilitation. It was a time-consuming task for clinicians, so a semi-automatic method can facilitate this process as well ...
详细信息
Static and dynamic posture analysis was a critical clinical examination in physiotherapy and rehabilitation. It was a time-consuming task for clinicians, so a semi-automatic method can facilitate this process as well as provide well-documented medical records and strong infrastructure for deeplearning scenarios. The current research presents a mechatronics platform for static and real-time dynamic posture analysis, which consisted of hybrid computational modules. Our study was a developmental and applied research according to a system development life cycle. The designed modules are as follows: (1) a mechanical structure includes patient place, 360-degree engine, mirror, laser, distance meter, and cams;(2) a software module includes data collection, electronic medical record, semi-automatic image analysis, annotation, and reporting, and (3) a network to exchange raw data with deeplearning server. Patients were informed about the research by their healthcare provider and all data were transformed into a Fourier format, in which the patients remained autonomous without a bit of information. The results show acceptable reliability and validity of the instruments. Also, a telerehabilitation application was designed to cover the patients after diagnosis. We suggest a longer time for data acquisition. It will lead to a more accurate and fully automated dynamic posture analysis. The result of this study suggest that the designed mechatronics device used in conjunction with smartphone application is a valid tool that can be used to obtain reliable measurements.
Statistical Charts contain a wealth of information. As an important way to visualize data presentation, statistical charts allow viewers to obtain a complete and intuitive understanding of the content shown in a very ...
详细信息
ISBN:
(数字)9798331541729
ISBN:
(纸本)9798331541736
Statistical Charts contain a wealth of information. As an important way to visualize data presentation, statistical charts allow viewers to obtain a complete and intuitive understanding of the content shown in a very short time. At present, the research on automatic extraction and understanding of a large amount of text information has been relatively mature. However, even the latest big artificial intelligence models cannot accurately extract statistical graphs, which are personalized and contain a large amount of information. We propose an automatic bar chart data extraction process by combining deeplearning and imageprocessing technology, and construct an intelligent bar chart decoding system. The system is divided into three parts: the classification of statistical chart types, the text detection in the image, the classification of text roles and the image extraction. The original data used to create the chart in the pan-bar graph image is extracted for downstream applications. We evaluate and compare our system on public datasets. The results show that our system has better accuracy.
The advancement of artificial intelligence (AI) has bought many advances to human society as a whole. By using daily activities and integrating the technology from the fruits of AI, we can manage to gain further acces...
详细信息
Intelligent transportation systems rely heavily on accurate traffic sign detection (TSD) to enhance road safety and traffic management. Various methods have been explored in the literature for this purpose, with deep ...
详细信息
Intelligent transportation systems rely heavily on accurate traffic sign detection (TSD) to enhance road safety and traffic management. Various methods have been explored in the literature for this purpose, with deeplearning methods consistently demonstrating superior accuracy. However, existing research highlights the persistent challenge of achieving high accuracy rates while maintaining non-destructive and real-time requirements. In this study, we propose a deeplearning model based on the YOLOv8 architecture to address this challenge. The model is trained and evaluated using a custom dataset, and extensive experiments and performance analysis demonstrate its ability to achieve precise results, thus offering a promising solution to the current research challenge in deeplearning-based TSD.
As the need for on-device artificial intelligence (AI) has increased in recent years, mobile devices tend to be equipped with multiple heterogeneous processors, including CPU, GPU, and Neural processing Unit (NPU). Wh...
详细信息
As the need for on-device artificial intelligence (AI) has increased in recent years, mobile devices tend to be equipped with multiple heterogeneous processors, including CPU, GPU, and Neural processing Unit (NPU). While NPUs can offer low-cost and real-time AI processing capabilities for deep Neural Network (DNN) inference, its limited resources often lead to a trade-off between performance and accuracy, potentially resulting in a non-trivial accuracy drop. To address this problem, we propose a new NPU-GPU Scheduling (NGS) framework for DNN-based video analytics. The main challenge lies in determining when and how to execute inference on the NPU/GPU to satisfy the performance objectives. To make more precise scheduling decisions, we first propose a new image complexity assessment model to replace the existing normalized edge density metric. Then, we formulate the scheduling problem with the objective of maximizing inference accuracy under the given latency constraint, and introduce an adaptive solution based on dynamic programming to determine which frames should be processed on the GPU and when to exit from inference for each of them. Extensive experiments conducted on a real mobile device show that our NGS framework substantially outperforms other solutions, and achieves a close-to-oracle performance.
Underwater images often suffer from serious color bias and blurred features because of the effect of the water bodies on the light. To enhance underwater images, we present SU-DDPM, a method of real-time underwater im...
详细信息
Underwater images often suffer from serious color bias and blurred features because of the effect of the water bodies on the light. To enhance underwater images, we present SU-DDPM, a method of real-time underwater image enhancement (UIE) based on a denoising diffusion probabilistic model (DDPM). SU-DDPM outperforms other baseline and generative adversarial network models in underwater image enhancement, thus establishing a new state-of-the-art baseline. SU-DDPM processes images more rapidly than the diffusion model, which makes it competitive with other deeplearning-based methods. We demonstrate that if conditional DDPM is used directly for the UIE task, the processing speed is slow, and the enhanced images are of poor quality and show color bias. The quality of the enhanced image is improved by combining the degraded image with the reference image in the diffusion stage to create a fusion-DDPM model. The specificity of the UIE task allows us to accelerate the inference process by changing the initial sampling distribution and reducing the number of iterations in the denoising stage of the model. We evaluate SU-DDPM on the UIE task using challenging real underwater image datasets and a synthetic image dataset and compare it to state-of-the-art models. SU-DDPM ensures increased enhancement quality, and enhancement processing speed is comparable to the speed of real-time enhancement models.
Correct identification of the most recent case of pneumonia fever determines successful therapy and management of the condition. Computable tomography (CT) scans can rapidly and precisely classify and evaluate cases o...
详细信息
暂无评论