Combining Dirichlet Mixture Models (DMM) with deep learning models for road extraction is an attractive study topic. Benefiting from DMM, the manually labeling work is alleviated. However, DMM suffers from high comput...
详细信息
Combining Dirichlet Mixture Models (DMM) with deep learning models for road extraction is an attractive study topic. Benefiting from DMM, the manually labeling work is alleviated. However, DMM suffers from high computational complexity due to pixel by pixel computations. Also, traditional constant parameter settings of DMM may not be suitable for different target images. To address the above problems, we propose an improved DMM which embeds superpixel strategy and sparse representation into DMM. In our road extraction framework, we first use improved DMM to filter out most backgrounds. Then, a trained deep CNN model is used for further precise road area recognition. To further promote the processing speed, we also apply the superpixel scanning strategy for CNN models. We tested our method on a Shaoshan dataset and proved that our method not only can achieve better results than other compared state-of-the-art image segmentation methods, but the processing speed and accuracy of DMM are also improved.
— imageprocessing is an important task in data processing systems for applications such as medical sectors, remotesensing, and microscopy tomography. Edge recognition is a sort of image division method that is used...
详细信息
The automatic detection and tracking of appearance and behavior anomalies in video surveillance systems is one of the promising areas for the development and implementation of artificial intelligence. In this paper, w...
详细信息
The automatic detection and tracking of appearance and behavior anomalies in video surveillance systems is one of the promising areas for the development and implementation of artificial intelligence. In this paper, we present a formalization of these problems. Based on the proposed generalization, a detection and tracking algorithm that uses the tracking-by-detection paradigm and convolutional neural networks (CNNs) is developed. At the first stage, people are detected using the YOLOv5 CNN and are marked with bounding boxes. Then, their faces in the selected regions are detected and the presence or absence of face masks is determined. Our approach to face-mask detection also uses YOLOv5 as a detector and classifier. For this problem, we generate a training dataset by combining the Kaggle dataset and a modified Wider Face dataset, in which face masks were superimposed on half of the images. To ensure a high accuracy of tracking and trajectory construction, the CNN features of the images are included in a composite descriptor, which also contains geometric and color features, to describe each person detected in the current frame and compare this person with all people detected in the next frame. The results of the experiments are presented, including some examples of frames from processed video sequences with visualized trajectories for loitering and falls.
Hyperspectral imaging is a widely used method in remotesensing, particularly for use in airborne and satellite-based land surveillance. Its versatility is, however, much larger and has also seen usage in everything r...
详细信息
ISBN:
(数字)9798350365474
ISBN:
(纸本)9798350365481
Hyperspectral imaging is a widely used method in remotesensing, particularly for use in airborne and satellite-based land surveillance. Its versatility is, however, much larger and has also seen usage in everything ranging from food processing and surveillance to astronomy and waste sorting. It is also gaining inroads with agricultural research. With most available datasets focusing on per-pixel classification, there is, however, a potential for hyperspectral whole-image analysis, but there is a severe lack of datasets for whole-image analysis. To help fill this gap and facilitate methodological development in whole-image hyperspectral image analysis, we introduce the Hy-perLeaf2024 dataset. The dataset consists of 2410 hyper-spectral images of wheat leaves, along with associated classification and regression targets at both the leaf level and the plot level. In addition to the dataset, we also provide experiments showing the importance of pretraining and highlighting the future research direction in whole-image hyper-spectral image analysis.
This paper introduces a custom-built low-cost camera ring device designed for automatic cast synthesis, able to accurately and instantly scan body parts. The scanned mesh will be used as a backbone model for the cast ...
详细信息
This paper introduces a novel method for object recognition and automatic labeling in large-area remotesensingimages, called LRSAA. The proposed method integrates the YOLOv11 and MobileNetV3-SSD object detection alg...
详细信息
Purpose The overall goal of this research is to develop algorithms for feature-based recognition of 2D parts from intensity images. Most present industrial vision systems are custom-designed systems, which can only ha...
详细信息
Purpose The overall goal of this research is to develop algorithms for feature-based recognition of 2D parts from intensity images. Most present industrial vision systems are custom-designed systems, which can only handle a specific application. This is not surprising, since different applications have different geometry, different reflectance properties of the parts. Design/methodology/approach Computer vision recognition has attracted the attention of researchers in many application areas and has been used to solve many ranges of problems. Object recognition is a type of patternrecognition. Object recognition is widely used in the manufacturing industry for the purpose of inspection. Machine vision techniques are being applied in areas ranging from medical imaging to remotesensing, industrial inspection to document processing and nanotechnology to multimedia databases. In this work, recognition of objects manufactured in mechanical industry is considered. Mechanically manufactured parts have recognition difficulties due to manufacturing process including machine malfunctioning, tool wear and variations in raw material. This paper considers the problem of recognizing and classifying the objects of such mechanical part. Red, green and blue RGB images of five objects are used as an input. The Fourier descriptor technique is used for recognition of objects. Artificial neural network (ANN) is used for classification of five different objects. These objects are kept in different orientations for invariant rotation, translation and scaling. The feed forward neural network with back-propagation learning algorithm is used to train the network. This paper shows the effect of different network architecture and numbers of hidden nodes on the classification accuracy of objects as well as the effect of learning rate and momentum. Findings One important finding is that there is not any considerable change in the network performances after 500 iterations. It has been found that
Robustness of different patternrecognition methods is one of the key challenges in autonomous driving, especially when driving in the high variety of road environments and weather conditions, such as gravel roads and...
详细信息
ISBN:
(数字)9783031064272
ISBN:
(纸本)9783031064272;9783031064265
Robustness of different patternrecognition methods is one of the key challenges in autonomous driving, especially when driving in the high variety of road environments and weather conditions, such as gravel roads and snowfall. Although one can collect data from these adverse conditions using cars equipped with sensors, it is quite tedious to annotate the data for training. In this work, we address this limitation and propose a CNN-based method that can leverage the steering wheel angle information to improve the road area semantic segmentation. As the steering wheel angle data can be easily acquired with the associated images, one could improve the accuracy of road area semantic segmentation by collecting data in new road environments without manual data annotation. We demonstrate the effectiveness of the proposed approach on two challenging data sets for autonomous driving and show that when the steering task is used in our segmentation model training, it leads to a 0.1-2.9% gain in the road area mIoU (mean Intersection over Union) compared to the corresponding reference transfer learning model.
Event cameras are novel bio-inspired sensors, which asynchronously capture pixel-level intensity changes in the form of "events". Due to their sensing mechanism, event cameras have little to no motion blur, ...
详细信息
ISBN:
(数字)9781665469463
ISBN:
(纸本)9781665469463
Event cameras are novel bio-inspired sensors, which asynchronously capture pixel-level intensity changes in the form of "events". Due to their sensing mechanism, event cameras have little to no motion blur, a very high temporal resolution and require significantly less power and memory than traditional frame-based cameras. These characteristics make them a perfect fit to several real-world applications such as egocentric action recognition on wearable devices, where fast camera motion and limited power challenge traditional vision sensors. However, the ever-growing field of event-based vision has, to date, overlooked the potential of event cameras in such applications. In this paper, we show that event data is a very valuable modality for egocentric action recognition. To do so, we introduce N-EPIC-Kitchens, the first event-based camera extension of the large-scale EPIC-Kitchens dataset. In this context, we propose two strategies: (i) directly processing eventcamera data with traditional video processing architectures (E-2(GO)) and (ii) using event-data to distill optical flow information (E-2(GO)MO). On our proposed benchmark, we show that event data provides a comparable performance to RGB and optical flow, yet without any additional flow computation at deploy time, and an improved performance of up to 49' with respect to RGB only information. The NEPIC-Kitchens dataset is available at https:/EgocentricVision/N-EPIC-Kitchens.
The fast landslide segmentation algorithm based on deep learning technology for remotesensingimages can play an important role in disaster analysis and risk assessment. At this stage, deep learning-based landslide s...
详细信息
暂无评论