Semantic segmentation on LiDAR imaging is increasingly gaining attention, as it can provide useful knowledge for perception systems and potential for autonomous driving. However, collecting and labeling real LiDAR dat...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Semantic segmentation on LiDAR imaging is increasingly gaining attention, as it can provide useful knowledge for perception systems and potential for autonomous driving. However, collecting and labeling real LiDAR data is an expensive and time-consuming task. While datasets such as SemanticKITTI [1] have been manually collected and labeled, the introduction of simulation tools such as CARLA [2], has enabled the creation of synthetic datasets on demand. In this work, we present a modified CARLA simulator designed with LiDAR semantic segmentation in mind, with new classes, more consistent object labeling with their counterparts from real datasets such as SemanticKITTI, and the possibility to adjust the object class distribution. Using this tool, we have generated SynthmanticLiDAR, a synthetic dataset for semantic segmentation on LiDAR imaging, designed to be similar to SemanticKITTI, and we evaluate its contribution to the training process of different semantic segmentation algorithms by using a naive transfer learning approach. Our results show that incorporating SynthmanticLiDAR into the training process improves the overall performance of tested algorithms, proving the usefulness of our dataset, and therefore, our adapted CARLA simulator. The dataset and simulator are available in https://***/vpulab/SynthmanticLiDAR.
Object detection technology is an important research content in the field of computer vision, and it is one of the important basic technologies for understanding image content. real-time operating system refers to the...
详细信息
ISBN:
(纸本)9798350374315
Object detection technology is an important research content in the field of computer vision, and it is one of the important basic technologies for understanding image content. real-time operating system refers to the operating system that can complete the processing of system request tasks within a specified time, and can provide timely response and high reliability are its main characteristics. Since the video-based target detection algorithm has high requirements on computing power and real-time performance, this paper proposes to deploy the target detection algorithm on the real-time operating system. The experiment verifies that the characteristics of the real-time operating system can improve the real-time performance of the target detection algorithm. Aiming at the hardware system of the industrial computer used in this paper, the principle and construction process of Xenomai real-time operating system are analyzed, and the scheme of building Linux+Xenomai real-time operating system on the industrial computer is proposed. Aiming at the application scenario with stable background and single target, the object detection algorithm based on imageprocessing is studied. Based on background difference method and three-frame difference method, an improved algorithm based on adaptive detection window of target region is proposed. Experimental results show that the improved algorithm has better real-time performance than the basic algorithm. Aiming at the application scenario with complex background and multiple targets, the object detection algorithm based on deep learning is studied, and the full convolutional neural network in Dlib machine learning library is selected for research and implementation. According to the hardware and system environment of this paper, a computational scale estimation method of the total convolutional neural network is proposed, and a method of deploying the network model trained in the GPU environment in the real-time operating system e
The execution of complex functions and high computational tasks remains a major obstacle to efficient and transparent real-timeprocessing. Overcoming these challenges is essential to unlocking the full potential of i...
详细信息
In surveillance video, target tracking is an important part. Based on imageprocessing technology, this paper studies a real-time and effective method to collect and recognize camera motion information. Firstly, the i...
详细信息
ISBN:
(纸本)9798350310801
In surveillance video, target tracking is an important part. Based on imageprocessing technology, this paper studies a real-time and effective method to collect and recognize camera motion information. Firstly, the influence of visual dead angle and illumination on recognition is analyzed. Secondly, according to the characteristic of background light intensity, the corresponding algorithm is designed to realize the positioning and tracking control strategy of the target and surrounding environment scenery. Finally, the correctness of the method is verified by MATLAB simulation software, so as to obtain a better and scalable scheme, which is more economical and feasible after the occlusion rate is minimized.
In order to monitor the video state changes of safe human settlements in realtime, a Mean Shift algorithm is proposed. The video monitoring images collected in realtime were preprocessed with enhancement and denoisi...
详细信息
Foggy weather conditions frequently reduce visibility and degrade video stream quality, affecting a variety of real-world applications such as surveillance, autonomous navigation, and outdoor video recording. This pap...
详细信息
Human image animation aims to generate a human motion video from the inputs of a reference human image and a target motion video. Current diffusion-based image animation systems exhibit high precision in transferring ...
Moving object detection plays a significant role in video surveillance. However, existing moving object detection methods often rely on software implementations, which means low real-time performance and high power co...
详细信息
ISBN:
(纸本)9798331540050;9798331540043
Moving object detection plays a significant role in video surveillance. However, existing moving object detection methods often rely on software implementations, which means low real-time performance and high power consumption. This paper's core detection algorithm employs a frame difference method that is enhanced by morphological filtering. Additionally, we propose an architecture that integrates FPGA(Field Programmable Gate Array) and ARM(Advanced RISC Machine), fully leveraging the parallel computing advantages of FPGA and the high processing efficiency of ARM. The system utilizes a ZYNQ7000 SoC, coupled with an OV7725 camera for image capture and DDR3 SDRAM for data caching, to address the challenges of high-speed data processing and low power consumption. Experimental results show that the system meets the requirements for high real-time performance and low power consumption with a frame rate of 85.9375 frames per second and a total power consumption of 1.101 W.
The video object segmentation (VOS) task involves the segmentation of an object over time based on a single initial mask. Current state-of-the-art approaches use a memory of previously processed frames and rely on mat...
详细信息
ISBN:
(纸本)9781713899921
The video object segmentation (VOS) task involves the segmentation of an object over time based on a single initial mask. Current state-of-the-art approaches use a memory of previously processed frames and rely on matching to estimate segmentation masks of subsequent frames. Lacking any adaptation mechanism, such methods are prone to test-time distribution shifts. This work focuses on matching-based VOS under distribution shifts such as video corruptions, stylization, and sim-to-real transfer. We explore test-time training strategies that are agnostic to the specific task as well as strategies that are designed specifically for VOS. This includes a variant based on mask cycle consistency tailored to matching-based VOS methods. The experimental results on common benchmarks demonstrate that the proposed test-time training yields significant improvements in performance. In particular for the sim-to-real scenario and despite using only a single test video, our approach manages to recover a substantial portion of the performance gain achieved through training on realvideos. Additionally, we introduce DAVIS-C, an augmented version of the popular DAVIS test set, featuring extreme distribution shifts like image-/video-level corruptions and stylizations. Our results illustrate that test-time training enhances performance even in these challenging cases. Project page: https://***/test-time-training-vos/
image recognition and processing technology is an important application direction of artificial intelligence technology. With the growth of demand for various types of video intelligent analysis, the importance of usi...
详细信息
暂无评论