In video database systems, one of the most important methods for discriminating the videos is by using the objects and the perception of spatial and temporal relations that exist between objects in the desired videos....
详细信息
In video database systems, one of the most important methods for discriminating the videos is by using the objects and the perception of spatial and temporal relations that exist between objects in the desired videos. In this paper, we propose a new spatio-temporal knowledge representation called3d c-string. The knowledge structure of 3d c-string, extended from the 2dc+-string, uses the projections of objects to represent spatial and temporal relations between the objects in a video. Moreover, it can keep track of the motions and size changes of the objects in a video. The string generation and video reconstruction algorithms for the 3d c-string representation of video objects are also developed. By introducing the concept of the template objects and nearest former objects, the string generated by the string generation algorithm is unique for a given video and the video reconstructed from a given 3d c-string is unique too. This approach can provide us an easy and efficient way to retrieve, visualize and manipulate video objects in video database systems. Finally, some experiments are performed to show the performance of the proposed algorithms. (c) 2002 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
We have proposed a new spatio-temporal knowledge structure called3d c-string to represent symbolic videos accompanying with the string generation and video reconstruction algorithms. In this paper, we extend the idea...
详细信息
We have proposed a new spatio-temporal knowledge structure called3d c-string to represent symbolic videos accompanying with the string generation and video reconstruction algorithms. In this paper, we extend the idea behind the similarity retrieval of images in 2dc-string to 3d c-string. Our extended approach consists of two phases. First, we infer the spatial relation sequence and temporal relations for each pair of objects in a video. Second, we use the inferred relations to define various types of similarity measures and propose the similarity retrieval algorithm. By providing various types of similarity between videos, our proposed similarity retrieval algorithm has discrimination power about different criteria. Finally, some experiments are performed to show the efficiency of the proposed approach. (c) 2004 Elsevier Inc. All rights reserved.
The video content management has attracted increasing attention in recent years. We have proposed a new spatio-temporal knowledge structure, called3d c-string, to represent the spatio-temporal relations between the o...
详细信息
The video content management has attracted increasing attention in recent years. We have proposed a new spatio-temporal knowledge structure, called3d c-string, to represent the spatio-temporal relations between the objects in a video and to keep track of the motions and size changes of the objects. In this paper, we propose a video algebra to infer the spatio-temporal relations between the objects in a video represented by the 3d c-string. The algebra contains four kinds of rules, namely, transitive, distributive, manipulation, and integration rules. By using those rules, all the binary relations between the objects in a video can be derived from a given 3d c-string. The algebra provides the theoretic basis for spatio-temporal reasoning and video query inference.
In this paper, we propose a new knowledge structure called3d Z-string, extended from the 2d Z-string, to represent the spatial and temporal relations between objects in a video and to keep track of the motions and si...
详细信息
In this paper, we propose a new knowledge structure called3d Z-string, extended from the 2d Z-string, to represent the spatial and temporal relations between objects in a video and to keep track of the motions and size changes of the objects. Since there are no cuttings between objects in the 3d Z-string, the integrity of objects is preserved. The string generation and video reconstruction algorithms for the 3d Z-string representation of video objects are also developed. The string generated by the string generation algorithm is unique for a given video and the video reconstructed from a given 3d Z-string is unique too. The experimental results show that the 3d Z-string is more compact and efficient than the 3d c-string in terms of storage space and execution time. (c) 2005 Elsevier B.V. All rights reserved.
暂无评论