In this paper, hardware implementation of edge detection at real time video signals using Sobel, Robert, Prewitt and Laplacian filters based on FPGA is explained. Besides, filters are compared in many ways. Edge detec...
详细信息
ISBN:
(纸本)9781509064946
In this paper, hardware implementation of edge detection at real time video signals using Sobel, Robert, Prewitt and Laplacian filters based on FPGA is explained. Besides, filters are compared in many ways. Edge detection is an elemantary and fundamental tool for image segmentation and feature extraction. Very high speed hardware like FPGA's are used to implement the image and videoprocessing algorithms for improving the performance of processing systems. Algorithms are implemented on the Xilinx Zynq 7000. The video input signals come from a laptop's HDMI interface to FPGA in order to filter and the detected edges arc displayed on a HDMI display screen.
Banding, a common video quality artifact, it refers to the noticeable gradient steps or abrupt transitions between adjacent colors or shades within an image or video, often as a resulting from low color depth or compr...
详细信息
ISBN:
(纸本)9798331529543;9798331529550
Banding, a common video quality artifact, it refers to the noticeable gradient steps or abrupt transitions between adjacent colors or shades within an image or video, often as a resulting from low color depth or compression algorithms. Assessing banded video quality is vital, especially with the rise of high-quality screens. Several banding metrics have been proposed in recent years to quantify and evaluate the presence of banding artifacts in videos. However, the visibility of banding strongly depends on viewing conditions such as screen brightness, contrast, ambient lighting, and others. This results in inconsistencies in the correlation between banding metrics and the subjective quality perceived across various screens. In this study, we conduct subjective experiments under varied viewing conditions and demonstrate the limited reliability of banding metrics.
We propose an inter mode decision scheme for P slices in the H264 video coding standard. Our scheme initially exploits neighbourhood information jointly with a set of skip mode conditions for enhanced skip mode decisi...
详细信息
ISBN:
(纸本)0819456586
We propose an inter mode decision scheme for P slices in the H264 video coding standard. Our scheme initially exploits neighbourhood information jointly with a set of skip mode conditions for enhanced skip mode decision. It subsequently performs inter mode decision for the remaining macroblocks by using a gentle set of smoothness constraints. For RD performance very close to the standard we achieve 35-58% reduction in run times and 33-55% reduction in CPU cycles for both the rate controlled and the non rate controlled versions of H264. Compared to other work that has been proposed as input to the standard, gains of 9-23% in run times and 7-22% in CPU cycles are also reported.
With the deployment of 2.5G/3G cellular network infrastructure and large number of camera equipped cell phones, the demand for video enabled applications are high. However, for an uplink wireless channel, both the ban...
详细信息
ISBN:
(纸本)0819456586
With the deployment of 2.5G/3G cellular network infrastructure and large number of camera equipped cell phones, the demand for video enabled applications are high. However, for an uplink wireless channel, both the bandwidth and battery energy capability are limited in a mobile phone for the video communication. These technical problems need to be effectively addressed before the practical and affordable video applications can be made available to consumers. In this paper we investigate the energy efficient video communication solution through joint video summarization and transmission adaptation over a slow fading channel. Coding, and modulation schemes, as well as packet transmission strategy are optimized and adapted to the unique packet arrival and delay characteristics of the video summaries. Operational energy efficiency - summary distortion performance is characterized under an optimal summarization setting.
This paper presents a demonstration setup for our open-source intra encoder called uvgVPCCenc, which is optimized for real-time video-based Point Cloud Compression (V-PCC). uvgVPCCenc achieves an average encoding spee...
详细信息
ISBN:
(纸本)9798331529543;9798331529550
This paper presents a demonstration setup for our open-source intra encoder called uvgVPCCenc, which is optimized for real-time video-based Point Cloud Compression (V-PCC). uvgVPCCenc achieves an average encoding speed of 26 frames per second (fps) on an Intel i7-12700 CPU when encoding volumetric video sequences with up to 185 000 points per frame. It is shown to be 700 times as fast as TMC2 reference implementation for V-PCC. Our work is the first to demonstrate real-time intra V-PCC encoding on a consumer-grade desktop computer. It indicates that even the immense computational complexity of intra V-PCC encoding can be tackled for practical applications with effective design and optimization techniques.
A novel approach is proposed to increase the throughput rate and reduce the hardware cost requirement of the linear systolic matrix-vector multiplication architecture for 2-D Fourier transform implementation. This is ...
详细信息
ISBN:
(纸本)0852965222
A novel approach is proposed to increase the throughput rate and reduce the hardware cost requirement of the linear systolic matrix-vector multiplication architecture for 2-D Fourier transform implementation. This is achieved by coding video signal/images more efficiently using 6-bit 1-D DPCM coding system prior to processing. It is shown that using 6-bit DPCM processing results in 64% improvement in speed and a significant saving in the hardware cost. The effect of quantisation errors on DPCM video signal/image Fourier transform is also presented.
With the rapid development of video-on-demand (VOD) and real-time streaming video technologies, the accurate objective assessment of streaming video Quality of Experience (QoE) has become a focal point for optimizing ...
详细信息
ISBN:
(纸本)9798331529543;9798331529550
With the rapid development of video-on-demand (VOD) and real-time streaming video technologies, the accurate objective assessment of streaming video Quality of Experience (QoE) has become a focal point for optimizing streaming-related technologies. However, due to the inherent transmission distortions caused by poor Quality of Service (QoS) conditions in streaming videos, such as intermittent stalling, rebuffering, and drastic changes in video sharpness due to bitrate fluctuations, evaluating streaming video QoE presents numerous challenges. This paper introduces a large and diverse in-the-wild streaming video QoE evaluation dataset - the SJLIVE-1k dataset. This work addresses the limitations of corresponding datasets, which lack in-the-wild video sequences under real network conditions and whose amount of video content is insufficient. Furthermore, we propose an end-to-end objective QoE evaluation strategy that extracts video content and QoS features from the video itself without using any extra information. By implementing self-supervised contrastive learning as the "reminder" to bridge the gap between the different types of features, our approach achieves state-of-the-art results across three datasets. Our proposed dataset will be released to facilitate further research.
A biometric scheme based on the silhouettes and/or textures of the hands is developed. The crucial part of the algorithm is the accurate registration of the deformable shape of the hands since subjects are not constra...
详细信息
ISBN:
(纸本)0819456586
A biometric scheme based on the silhouettes and/or textures of the hands is developed. The crucial part of the algorithm is the accurate registration of the deformable shape of the hands since subjects are not constrained in pose or posture during acquisition. A host of shape and texture features are comparatively evaluated, such as Independent component features (ICA features), Principal Component Analysis (PCA features), Angular Radial Transform (ART features) and the distance transform (DT) based features. Even with a limited number of training data it is shown that this biometric scheme can perform reliably for populations up to several hundreds.
作者:
Wee, SHP Labs
Multimedia Commun & Networking Dept Palo Alto CA USA
Multimedia communication and streaming media services will become mainstream network infrastructure applications in the coming decade. However, there are many challenges that must be overcome. These challenges include...
详细信息
ISBN:
(纸本)0819456586
Multimedia communication and streaming media services will become mainstream network infrastructure applications in the coming decade. However, there are many challenges that must be overcome. These challenges include the Internet's limited ability to handle real-time, low-latency media streams, the need for media security, and an uncertainty of the killer app. The nature of these challenges lends itself to enabling technology innovations in the media delivery and media processing space. Specifically, we envision an overlay infrastructure that supports networked media services that couple media delivery with in-network media processing. The media overlay should be programmable to allow rapid deployment of new applications and services and manageable so as to support the evolving requirements of the resulting usage models. Furthermore, the media overlay should allow for the delivery of protected media content for applications that have security requirements. A properly architected infrastructure can enable real-time multimedia communication and streaming media services in light of the inherent challenges.
As a main video transmission mode for digital media networks, the capability to predict VBR video traffic can significantly improve the effectiveness of quality of services. Therefore, aiming at the complex characteri...
详细信息
ISBN:
(纸本)0819459763
As a main video transmission mode for digital media networks, the capability to predict VBR video traffic can significantly improve the effectiveness of quality of services. Therefore, aiming at the complex characteristics of VBR MPEG videos, a novel intelligent integrated traffic prediction model is proposed based on fuzzy and neural network. The fuzzy predictor reduces the prediction error, and the implementation of neural network is used to lower the computational complexity for real-time operation. Experimental results show that the prediction errors of the proposed model are significantly smaller than the conventional AR models and provide an improved video traffic prediction technique.
暂无评论