版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:COLUMBIA UNIVDEPT ELECT ENGNNEW YORKNY 10027
出 版 物:《SIGNAL PROCESSING-IMAGE COMMUNICATION》 (信号处理:图像通信)
年 卷 期:1995年第7卷第3期
页 面:231-248页
核心收录:
主 题:FACE TRACKING MODEL-BASED CODING TELECONFERENCING VIDEO CODING
摘 要:We present a novel and practical way to integrate techniques from computer vision to low bit-rate coding systems for video teleconferencing applications. Our focus is to locate and track the faces of persons in typical head-and-shoulders video sequences, and to exploit the face location information in a classical video coding/decoding system, The motivation is to enable the system to selectively encode various image areas and to produce psychologically pleasing coded images where faces are sharper, We refer to this approach as model-assisted coding. We propose a totally automatic, low-complexity algorithm, which robustly performs face detection and tracking. A priori assumptions regarding sequence content are minimal and the algorithm operates accurately even in cases of partial occlusion by moving objects. Face location information is exploited by a low bit-rate 3D subband-based video coder which uses both a novel model-assisted pixel-based motion compensation scheme, as well as model-assisted dynamic bit allocation with object-selective quantization. By transferring a small fraction of the total available bit-rate from the non-facial to the facial area, the coder produces images with better-rendered facial features. The improvement was found to be perceptually significant on video sequences coded at 96 kbps for an input luminance signal in CIF format, The technique is applicable to any video coding scheme that allows for fine-grain quantizer selection (e.g. MPEG, H.261), and can maintain full decoder compatibility.