Light field image (LFI) quality assessment is becoming more and more important, which helps to better guide the acquisition, processing and application of immersive media. However, due to the inherent high dimensional...
详细信息
Rapid growing intelligent applications require optimized bit allocation in image/video coding to support specific task-driven scenarios such as detection, classification, segmentation, etc. Some learning-based framewo...
详细信息
Semantic segmentation is a fundamental task in indoor scene understanding. Most previous supervised approaches rely on densely annotated image data sets. Due to the limited amount of images with segmentation labels, t...
ISBN:
(数字)9781728123455
ISBN:
(纸本)9781728123462
Semantic segmentation is a fundamental task in indoor scene understanding. Most previous supervised approaches rely on densely annotated image data sets. Due to the limited amount of images with segmentation labels, the performance of existing networks is greatly limited. In this paper, we exploit temporal correlation in video frames to improve the performance and robustness of segmentation networks. Two effective learning strategies are proposed to propagate the information from a few labeled frames to their immediate neighbor frames. First, we scale up training dataset for supervised semantic segmentation networks by generating pseudo ground-truth for neighboring frames from a labeled frame using filtered homography transformation. Furthermore, we introduce a self-supervised loss function to ensure temporal consistency between the segmentation results of adjacent frames. The experimental results demonstrate that our proposed method outperforms state-of-the-art techniques for semantic segmentation on NYU-Depth V2 dataset.
Video stitching remains a challenging problem in computer vision. In this paper, we propose a novel edge-guided method to stitch multiple videos that have small overlapped regions. Our algorithm consists of three step...
ISBN:
(数字)9781728123455
ISBN:
(纸本)9781728123462
Video stitching remains a challenging problem in computer vision. In this paper, we propose a novel edge-guided method to stitch multiple videos that have small overlapped regions. Our algorithm consists of three steps: (1) spherical projection of the input video frames based on camera calibration, (2) edge detection and edge-guided feature matching for video registration, and (3) seam optimization to eliminate distortions and ghosts in the composited panoramic videos. The experimental results and user studies demonstrate that our method is robust to videos that have small overlapped regions and produces more visually pleasing panoramic videos than state-of-the-art techniques.
The following topics are dealt with: video coding; data compression; image coding; convolutional neural nets; decoding; learning (artificial intelligence); motion compensation; video codecs; image reconstruction; filt...
The following topics are dealt with: video coding; data compression; image coding; convolutional neural nets; decoding; learning (artificial intelligence); motion compensation; video codecs; image reconstruction; filtering theory.
—Photo-realistic point cloud capture and transmission are the fundamental enablers for immersive visual communication. The coding process of dynamic point clouds, especially video-based point cloud compression (V-PCC...
详细信息
The spaceborne SAR is required to fulfill the increasing demands for improved spatial resolution and wider swath coverage in recent years. The azimuth multi-channel SAR system is a typical technique adopted for realiz...
详细信息
ISBN:
(数字)9781728119465
ISBN:
(纸本)9781728119472
The spaceborne SAR is required to fulfill the increasing demands for improved spatial resolution and wider swath coverage in recent years. The azimuth multi-channel SAR system is a typical technique adopted for realizing high-resolution and wide-swath (HRWS) simultaneously. The flexibility of this system also provides favorable conditions for moving target detection. Aiming at the problems of moving target detection, velocity estimation, and imaging of SAR maritime scenes in the spaceborne multi-channel system, this paper proposes a set of processes based on coarse imaging, detection, velocity estimation and refocusing. The experimental results of the simulation verify the effectiveness of the method.
The Gaofen-3 (GF3) data processor was developed as a workstation-based GF3 synthetic aperture radar (SAR) data processing system. The processor consists of two subsystems of the GF3 ground segment, which are referred ...
详细信息
Segmentation of multiple anatomical structures is of great importance in medical image analysis. In this study, we proposed a W-net to simultaneously segment both the optic disc (OD) and the exudates in retinal images...
详细信息
Synthetic aperture radar (SAR) has a good ability to detect the microwave scattering characteristics of the target and has a good capability of slant range Doppler positioning. Using multi-view SAR images in combinati...
详细信息
暂无评论