The recent advances in image based rendering (IBR) have pioneered freely determining the viewing position and angle in a scene from multi-view video. Remembering that a person could also record a personal video for th...
详细信息
The recent advances in image based rendering (IBR) have pioneered freely determining the viewing position and angle in a scene from multi-view video. Remembering that a person could also record a personal video for this arbitrarily selected view and misuse this content, it is apparent that copyright and copy protection problems also exist and should be solved for IBR applications, as well. In our recent work (Alper Koz, 2006), we propose a watermarking method, which embeds the watermark pattern into every frame of multi-view video and extracts this watermark from a rendered image, generated by the nearest-interpolation based light-field rendering (LFR) and watermark detection is achieved for the cases in which the virtual camera could be arbitrarily located on the camera plane only. This paper presents an extension to the previous formulation for the rendered images, which are generated by using bilinear interpolation, namely the most attractive and promising interpolation method in LFR-based applications. Moreover, the location of the virtual camera could be completely arbitrary in this new formulation. The results show that the watermark could be extracted successfully for LFR via bilinear interpolation for any imagery camera location and rotation, as long as the visual quality of the rendered image is preserved.
This paper presents an audio-visual emotion database that can be used as a reference database for testing and evaluating video, audio or joint audio-visual emotion recognition algorithms. Additional uses may include t...
详细信息
ISBN:
(纸本)9781424430109
This paper presents an audio-visual emotion database that can be used as a reference database for testing and evaluating video, audio or joint audio-visual emotion recognition algorithms. Additional uses may include the evaluation of algorithms performing other multimodal signal processing tasks, such as multimodal person identification or audio-visual speech recognition. This paper presents the difficulties involved in the construction of such a multimodal emotion database and the different protocols that have been used to cope with these difficulties. It describes the experimental setup used for the experiments and includes a section related to the segmentation and selection of the video samples, in such a way that the database contains only video sequences carrying the desired affective information. This database is made publicly available for scientific research purposes.
Based on the analysis of the integer transform and quantization in H.264, a guideline was provided considering the human visual characteristics. With this guideline, a new approach according to visual characteristics ...
详细信息
Based on the analysis of the integer transform and quantization in H.264, a guideline was provided considering the human visual characteristics. With this guideline, a new approach according to visual characteristics to pre-determine all-zero blocks with high efficiency was proposed. The experimental results show that the proposed approach reduces the coding computation significantly compared with other detection methods.
暂无评论