Machine learning and deeplearning are subsets of Artificial Intelligence that have revolutionized object detection and classification in images or *** technology plays a crucial role in facilitating the transition fr...
详细信息
Machine learning and deeplearning are subsets of Artificial Intelligence that have revolutionized object detection and classification in images or *** technology plays a crucial role in facilitating the transition from conventional to precision agriculture,particularly in the context of weed *** agriculture,which previously relied on manual efforts,has now embraced the use of smart devices for more efficient weed ***,several challenges are associated with weed detection,including the visual similarity between weed and crop,occlusion and lighting effects,as well as the need for early-stage weed ***,this study aimed to provide a comprehensive review of the application of both traditional machine learning and deeplearning,as well as the combination of the two methods,for weed detection across different crop *** results of this review show the advantages and disadvantages of using machine learning and deep ***,deeplearning produced superior accuracy compared to machine learning under various *** learning required the selection of the right combination of features to achieve high accuracy in classifyingweed and crop,particularly under conditions consisting of lighting and early growth ***,a precise segmentation stage would be required in cases of *** learning had the advantage of achieving real-timeprocessing by producing smaller models than deeplearning,thereby eliminating the need for additional ***,the development of GPU technology is currently rapid,so researchers are more often using deeplearning for more accurate weed identification.
Detection of abnormalities is important for the security and reliability of computer systems as they heavily rely on logs to detect anomalies. The logs provide general information, errors, warnings, and debugging info...
详细信息
Detection of abnormalities is important for the security and reliability of computer systems as they heavily rely on logs to detect anomalies. The logs provide general information, errors, warnings, and debugging information. Existing approaches for detecting anomalies are sometimes inaccurate due to their limitations related to log-processing leading to loss of semantic significance. Existing approaches, like deeplog and LogAnomaly, have restrictions in detecting irregularities in log frameworks mainly in large dynamic systems. In this paper, we propose a hybrid anomaly detection technique that combines unsupervised approaches such as Self-Organizing Maps, Bert Encoders, and Autoencoders to handle these issues. The approach improves anomaly identification accuracy by employing semantic vectors obtained by the Bert Encoder to recognize patterns with autoencoders. The evaluation results show that the proposed strategy outperforms the existing methods for various types of data including system logs, network traffic, and financial transactions.
Modeling non-euclidean data is drawing extensive attention along with the unprecedented successes of deep neural networks in diverse fields. Particularly, a symmetric positive definite matrix is being actively studied...
详细信息
Modeling non-euclidean data is drawing extensive attention along with the unprecedented successes of deep neural networks in diverse fields. Particularly, a symmetric positive definite matrix is being actively studied in computer vision, signal processing, and medical image analysis, due to its ability to learn beneficial statistical representations. However, owing to its rigid constraints, it remains challenging to optimization problems and inefficient computational costs, especially, when incorporating it with a deeplearning framework. In this paper, we propose a framework to exploit a diffeomorphism mapping between Riemannian manifolds and a Cholesky space, by which it becomes feasible not only to efficiently solve optimization problems but also to greatly reduce computation costs. Further, for dynamic modeling of time-series data, we devise a continuous manifold learning method by systematically integrating a manifold ordinary differential equation and a gated recurrent neural network. It is worth noting that due to the nice parameterization of matrices in a Cholesky space, training our proposed network equipped with Riemannian geometric metrics is straightforward. We demonstrate through experiments over regular and irregular time-series datasets that our proposed model can be efficiently and reliably trained and outperforms existing manifold methods and state-of-the-art methods in various time-series tasks.
Underwater imaging plays a critical role in various fields such as marine biology, environmental monitoring, underwater archaeology, and defense. However, it faces unique challenges including light absorption and scat...
详细信息
Low-power edge devices equipped with Graphics processing Units (GPUs) are a popular target platform for real-time scheduling of inference pipelines. Such application-architecture combinations are popular in Advanced D...
详细信息
Low-power edge devices equipped with Graphics processing Units (GPUs) are a popular target platform for real-time scheduling of inference pipelines. Such application-architecture combinations are popular in Advanced Driver-assistance Systems for aiding in the real-time decision-making of automotive controllers. However, the real-time throughput sustainable by such inference pipelines is limited by resource constraints of the target edge devices. Modern GPUs, both in edge devices and workstation variants, support the facility of concurrent execution of computation kernels and data transfers using the primitive of streams, also allowing for the assignment of priority to these streams. This opens up the possibility of executing computation layers of inference pipelines within a multi-priority, multi-stream environment on the GPU. However, manually co-scheduling such applications while satisfying their throughput requirement and platform memory budget may require an unmanageable number of profiling runs. In this work, we propose a deep Reinforcement learning (DRL)-based method for deciding the start time of various operations in each pipeline layer while optimizing the latency of execution of inference pipelines as well as memory consumption. Experimental results demonstrate the promising efficacy of the proposed DRL approach in comparison with the baseline methods, particularly in terms of real-time performance enhancements, schedulability ratio, and memory savings. We have additionally assessed the effectiveness of the proposed DRL approach using a real-time traffic simulation tool IPG CarMaker.
Aiming at the problem that it is difficult for teachers to obtain accurate classroom status in daily teaching process, which is not conducive to making targeted adjustments to teaching methods, this paper proposes a d...
详细信息
Aiming at the problem that it is difficult for teachers to obtain accurate classroom status in daily teaching process, which is not conducive to making targeted adjustments to teaching methods, this paper proposes a deeplearning-based student class status analysis system. The system uses camera to capture classroom video, and uses image recognition, target detection, deeplearning and other technologies to detect the behavioral state of students in the classroom in realtime and concentration information, and through the statistical analysis of the collected data, it helps the teacher to get timely feedback in the classroom, and better judge the learning state and concentration of students. In order to realize the real-time and accuracy of the system design, the system in the paper introduces the OpenPose model into the YOLOv5 network to identify the students' skeletal keypoints, and synthesizes the results of the processing of the YOLOv5 model and the OpenPose model to make an analysis of the students' classroom behaviors and concentration. The experimental results show that the loss curve can achieve good convergence, and the AP and mAP can reach 95.1% and 88.0%, respectively.
License plate recognition is crucial in Intelligent Transportation Systems (ITS) for vehicle management, traffic monitoring, and security inspection. In highway scenarios, this task faces challenges such as diversity,...
详细信息
License plate recognition is crucial in Intelligent Transportation Systems (ITS) for vehicle management, traffic monitoring, and security inspection. In highway scenarios, this task faces challenges such as diversity, blurriness, occlusion, and illumination variation of license plates. This article explores Recurrent Neural Networks based on Connectionist Temporal Classification (RNN-CTC) for license plate recognition in challenging highway conditions. Four neural network models: ResNet50, ResNeXt, InceptionV3, and SENet, all combined with RNN-CTC are comparatively evaluated. Furthermore, a novel architecture named ResNet50 deep Fusion Network using Connectionist Temporal Classification (ResNet50-DFN-CTC) is proposed. Comparative and ablation experiments are conducted using the Highway License Plate Dataset of Southeast University (HLPD-SU). Results demonstrate the superior performance of ResNet50-DFN-CTC in challenging highway conditions, achieving 93.158% accuracy with a processingtime of 7.91 ms, outperforming other tested models. This research contributes to advancing license plate recognition technology for real-world highway applications under adverse conditions. We propose a novel architecture named ResNet50 deep Fusion Network using Connectionist Temporal Classification (ResNet50-DFN-CTC). Comparative and ablation experiments are conducted using the Highway License Plate Dataset of Southeast University (HLPD-SU). Results demonstrate the superior performance of ResNet50-DFN-CTC in challenging highway conditions, achieving 93.158% accuracy with a processingtime of 7.91 ms, outperforming other tested models. This research contributes to advancing license plate recognition technology for real-world highway applications under adverse conditions. image
Direction-of-arrival (DOA) estimation is a fundamental task in audio signal processing that becomes difficult in real-world environments due to the presence of reverberation. To address this difficulty, Direct-Path Do...
详细信息
Direction-of-arrival (DOA) estimation is a fundamental task in audio signal processing that becomes difficult in real-world environments due to the presence of reverberation. To address this difficulty, Direct-Path Dominance (DPD) tests have been proposed as an effective approach for detecting time-frequency (TF) bins dominated by direct sound, which contain accurate DOA information. These have been found to be particularly efficient when working with spherical arrays. While methods based on neural networks (NNs) have been developed to estimate the DOA, they have limitations such as the need for a large training database, and often understanding of the system's operation is lacking. This work proposes two novel DPD-test methods based on a model-based deeplearning approach that combines the original DPD-test model with a data-driven system. Thus, it is possible to preserve the robustness of the original DPD-test across acoustic environments, while using a data-driven approach to better extract useful information about the direct sound, thereby enhancing the original method's performance. In particular, the paper investigates how energetic, temporal and spatial information contribute to the identification of TF-bins dominated by the direct signal. The proposed methods are trained on simulated data of a single sound source in a room, and evaluated on simulated and real data. The results show that energetic and temporal information provide new information about direct sound, which has not been considered in previous works and can improve its performance.
A lightweight seedling detection model with improved YOLOv8s is proposed to address the seedling identification problem in the replenishment process in industrial vegetable seedling *** CBS module for feature extracti...
详细信息
A lightweight seedling detection model with improved YOLOv8s is proposed to address the seedling identification problem in the replenishment process in industrial vegetable seedling *** CBS module for feature extraction in Backbone and Neck has been replaced with a lightweight depthwise separable convolution (DSC) in order to reduce the number of model parameters and increase the speed of detection. Furthermore, the fifth layer of Backbone has been augmented with efficient multiscale attention (EMA), which can aggregate multi-scale spatial structure information more rapidly through the two branches of the parallel structure, thereby enhancing the extraction of multi-scale features. Ultimately, the computational complexity of the model is further reduced by enhancing the structure of the bottleneck to form the VoVGSCSP module, which replaces the C2f module in Neck. The mAP of the improved model on the test set is 96.2%, its parameters, GFLOPS, and model size are 7.88 M, 20.9, and 16.1 MB, respectively. The detection speed of the algorithm is 116.3 frames per second (FPS), which is higher than that of the original model (107.5 FPS). The results indicate that the improved model can accurately identify empty cell and unqualified seedling in the plug tray in realtime and has a smaller number of parameters and GFLOPS, making it suitable for use on embedded or mobile devices for seedling replenishment and contributing to the realization of automated and unmanned seedling replenishment.
Forest ecosystems are of paramount importance to the sustainable existence of life on earth. Unique natural and artificial phenomena pose severe threats to the perseverance of such ecosystems. With the advancement of ...
详细信息
Forest ecosystems are of paramount importance to the sustainable existence of life on earth. Unique natural and artificial phenomena pose severe threats to the perseverance of such ecosystems. With the advancement of artificial intelligence technologies, the effectiveness of implementing forest monitoring systems based on acoustic surveillance has been established due to the practicality of the approach. It can be identified that with the support of transfer learning, deeplearning algorithms outperform conventional machine learning algorithms for forest acoustic classification. Further, a clear requirement to move the conventional cloud-based sound classification to the edge is raised among the research community to ensure real-time identification of acoustic incidents. This article presents a comprehensive survey on the state-of-the-art forest sound classification approaches, publicly available datasets for forest acoustics, and the associated infrastructure. Further, we discuss the open challenges and future research aspects that govern forest acoustic classification.
暂无评论