We present a model based coding (MBC) scheme which employs 3D scene information resulting in more efficient image coding. We propose a balancing between texture and difference, such that MBC is a dynamic enhancement t...
详细信息
ISBN:
(纸本)9781457720482
We present a model based coding (MBC) scheme which employs 3D scene information resulting in more efficient image coding. We propose a balancing between texture and difference, such that MBC is a dynamic enhancement to current standard image encoders: if no usable model data can be produced, all information remains in the difference, corresponding to standard encoding schemes. The higher the model quality, the more information can be transmitted. The result is an encoding scheme which offers at least the code rate of current encoders and additionally produces a 3D model.
Pattern discovery is a potential boon for data compression, but current approaches are inefficient and produce cumbersome pattern descriptions. The clustered causal state algorithm is a new pattern-discovery algorithm...
详细信息
Pattern discovery is a potential boon for data compression, but current approaches are inefficient and produce cumbersome pattern descriptions. The clustered causal state algorithm is a new pattern-discovery algorithm that incorporates recent clustering technology.
Emerging 3-D displays show several views of the scene simultaneously. A direct transmission of a selection of these views is impractical., because various types of displays support a different number of views and the ...
详细信息
ISBN:
(纸本)0819460958
Emerging 3-D displays show several views of the scene simultaneously. A direct transmission of a selection of these views is impractical., because various types of displays support a different number of views and the decoder has to interpolate the intermediate views. The transmission of multiview image information can be simplified by only transmitting the texture data for the central view and a corresponding depth map. Additional to the coding of the texture data, this technique requires the efficient coding of depth maps. Since the depth map represents the scene geometry and thereby covers the 3-D perception of the scene. sharp edges corresponding to object boundaries, should be preserved. We propose an algorithm that models depth maps using pieceivise-linear functions (platelets). To adapt to varying scene detail, we employ a quadtree decomposition that divides the image into blocks of variable size, each block being approximated by one platelet. In order to preserve sharp object boundaries, the support area of each platelet is adapted to the object boundary. The subdivision of the quadtree and the selection of the platelet type are optimized such that a global rate-distortion trade-off is realized. Experimental results show that the described method can improve the resulting picture quality after compression of depth maps by 1 - 3 dB when compared to a JPEG-2000 encoder.
A novel modelbased and waveform based hybrid coding system aimed at very low bit rate coding of face images in video sequences is presented. The well known Principal Component Analysis (PCA) concept which is widely u...
详细信息
ISBN:
(纸本)0819459763
A novel modelbased and waveform based hybrid coding system aimed at very low bit rate coding of face images in video sequences is presented. The well known Principal Component Analysis (PCA) concept which is widely used for face recognition systems is adopted for model based coding, with modifications to cope with video compression. model failures and unknown objects are handled by waveform basedcoding employing conventional prediction/transform block-basedcoding scheme. The two coding modes are combined under rate-distortion framework, where Lagrangian cost function is used to determine the most efficient prediction for each block. Simulations show that the system can achieve high compression ratios while maintaining the robustness and generality, which indicates its potential use for videophone application.
Three dimensional model based coding methods are proposed as next generation image coding methods. These new representations need 3D reconstruction techniques. This paper presents a method that extracts the surfaces o...
详细信息
Three dimensional model based coding methods are proposed as next generation image coding methods. These new representations need 3D reconstruction techniques. This paper presents a method that extracts the surfaces of static objects that occlude other objects from a spatiotemporal image captured with straight-line camera motion. We propose the concept of occlusion types and show that the occlusion types are restricted to only eight patterns. Furthermore, we show occlusion type pairs contain information that confirms the existence of surfaces. Occlusion information gives strong cues for segmentation and representation. The method can estimate not only the 3D positions of edge points but also the surfaces bounded by the edge points. We show that combinations of occlusion types contain information that can confirm surface existence. The method was tested successfully on real images by reconstructing flat and curved surfaces. Videos can be hierarchically structured with the method. The method makes various applications possible, such as object selective image communication and object selective video editing.
3D model-basedcoding methods that need 3D reconstruction techniques are proposed for nest-generation image coding methods. A method is presented that reconstructs 3D shapes of dynamic objects from image sequences cap...
详细信息
3D model-basedcoding methods that need 3D reconstruction techniques are proposed for nest-generation image coding methods. A method is presented that reconstructs 3D shapes of dynamic objects from image sequences captured using two cameras, thus avoiding the stereo correspondence problem. A coaxial camera system consisting of one moving and one static camera was developed. The optical axes of both cameras are precisely adjusted and have the same orientation using an optical system with true and half mirrors. The moving camera is moved alone a straight horizontal line. This method can reconstruct 3D shapes of static objects as well as dynamic objects using motion vectors calculated from the moving camera images and revised using the static camera image. The method was tested successfully on real images by reconstructing a moving human shape.
The Moving Pictures Experts Group (MPEG) of the International Standardization Organization has initiated a standardization effort, known as MPEG-4, addressing generic audiovisual coding at very low bit-rates (up to 64...
详细信息
ISBN:
(纸本)081941638X
The Moving Pictures Experts Group (MPEG) of the International Standardization Organization has initiated a standardization effort, known as MPEG-4, addressing generic audiovisual coding at very low bit-rates (up to 64 kbits/s) with applications in videotelephony, mobile audiovisual communications, video database retrieval, computer games, video over Internet, remote sensing, etc. This paper gives a survey of the status of MPEG-4, including its planned schedule, and initial ideas about requirements and applications. A significant part of this paper is summarizing an incomplete draft version of a `requirements document' which presents specifications of desirable features on the video, audio, and system level of the forthcoming standard. Very low bit-rate coding algorithms are not described, because no endorsement of any particular algorithm, or class of algorithms, has yet been made by MPEG-4, and several seminars held concurrently with MPEG-4 meetings have not so far provided evidence that such high performance coding schemes are achievable.
暂无评论