In this paper we describe a novel interactive imageviewer incorporating a range of image processing techniques that allows inexperienced users to quickly and easily delineate objects or shapes from a wide range of re...
详细信息
ISBN:
(纸本)0819427527
In this paper we describe a novel interactive imageviewer incorporating a range of image processing techniques that allows inexperienced users to quickly and easily delineate objects or shapes from a wide range of real world images. The viewer is specifically designed to be easily extensible, and this extensibility is demonstrated with the implementation of an iterative user guided segmentation tool. Using this tool objects can be efficiently extracted from images and used as the basis for navigation and retrieval within MAviS, the Multimedia Architecture for video, image, and Sound.
In this paper we consider the problem of similarity between video sequences. Three basic questions are raised and (partially) answered. Firstly, at what temporal duration can video sequences be compared? The frame, sh...
详细信息
ISBN:
(纸本)0819427527
In this paper we consider the problem of similarity between video sequences. Three basic questions are raised and (partially) answered. Firstly, at what temporal duration can video sequences be compared? The frame, shot, scene and video levels are identified. Secondly, given some image or video feature, what are the requirements on its distance measure and how can it be "easily" transformed into the visual similarity desired by the inquirer? Thirdly how can video sequences be compared at different levels? A general approach based on either a set or sequence representation with variable degrees of aggregation is proposed and applied recursively over the different levels of temporal resolution. It allows the inquirer to fully control the importance of temporal ordering and duration. Promising experimental results are presented.
We present a technique for indexing the directional detail and smoothness present in an image. By directional detail we imply strong directional activity in the horizontal, vertical and diagonal direction present in a...
详细信息
ISBN:
(纸本)0819427527
We present a technique for indexing the directional detail and smoothness present in an image. By directional detail we imply strong directional activity in the horizontal, vertical and diagonal direction present in areas of detail and texture. By smoothness we refer to the smooth or low frequency areas of the image which do not contain prominent edge or texture activity. We map the directional information into 3-d vectors, which are then used to build N-d histograms. These histograms can then be used as database indices which can be queried using histogram techniques.
The use of video data in the multimedia environment is increasing rapidly, and so tools to handle large volumes of video data are required. One of the first steps towards creating more versatile video tools is to segm...
详细信息
ISBN:
(纸本)0819427527
The use of video data in the multimedia environment is increasing rapidly, and so tools to handle large volumes of video data are required. One of the first steps towards creating more versatile video tools is to segment the video data, i.e. partition it into its component shots. This paper presents a novel combination of algorithms for video segmentation, utilising histogram comparison, motion vector information (focus of expansion), and edge information to detect transitions between shots. Our method can reliably detect transitions such as camera breaks, fades, dissolves and wipes, in video compressed to the MPEG-I standard.
In this paper videos are analyzed to get a content-based decription of the video. The structure of a given video is useful to index long videos efficiently and automatically. A comparison between shots gives an overvi...
详细信息
ISBN:
(纸本)0819427527
In this paper videos are analyzed to get a content-based decription of the video. The structure of a given video is useful to index long videos efficiently and automatically. A comparison between shots gives an overview about cut frequency, cut pattern, and scene bounds. After a shot detection the shots are grouped into clusters based on their visual similarity. A time-constraint clustering procedure is used to compare only those shots that are positioned inside a time range. Shots from different areas of the video (e.g., begin/end) are not compared. With this cluster information that contains a list about shots and their clusters it is possible to calculate scene bounds. A labeling of all clusters gives a declaration about the cut pattern. It is easy now to distinguish a dialogue from an action scene. The final content analysis is done by the imageMiner* system. The imageMiner system developed at the University of Bremen of the image Processing Department of the Center for Computing Technology realizes content-based imageretrieval for still images through a novel combination of methods and techniques of computer vision and artifical intelligence. The imageMiner system consists of three analysis modules for computer vision, namely for color, texture, and contour analysis. Additionally exists a module for object recognition. The output of the object recognition module can be indexed by a text retrieval system. Thus, concepts like forestscene may;be searched for. We combine the still image analysis with the results of the video analysis in order to retrieve shots or scenes.
A high-level representation of a video clip comprising information about its physical and semantic structure is necessary for providing appropriate processing, indexing and retrieval capabilities for videodatabases. ...
详细信息
ISBN:
(纸本)0819427527
A high-level representation of a video clip comprising information about its physical and semantic structure is necessary for providing appropriate processing, indexing and retrieval capabilities for videodatabases. We describe a novel technique which reduces a sequence of MPEG encoded video frames to a trail of points in a low dimensional space. In our earlier work,(1) we presented techniques applicable in 3-D, but in this paper, we describe techniques that can be extended to higher dimensions where improved performance is expected. In the low-dimensional space, we can cluster frames, analyze transitions between clusters and compute properties of the resulting trail. Portions of the trail can be classified as either stationary or transitional, leading to high-level descriptions of the video. Tracking the interaction of clusters over time, we lay the groundwork for the complete analysis and representation of the video's physical and semantic structure.
In this paper, we present topics related to tracking of video objects in compressed videodatabases in the context of videoretrieval applications. We developed a videoretrieval and tracking system (VORTEX) to enable...
详细信息
In this paper, we present topics related to tracking of video objects in compressed videodatabases in the context of videoretrieval applications. We developed a videoretrieval and tracking system (VORTEX) to enable operation directly on compressed video data. The structure of the video compression standards is exploited in order to avoid the costly decompression operation. This is achieved by utilizing motion compensation-a critical prediction filter embedded in video compression standards-to estimate and interpolate the desired method for template matching. Occlusion analysis, filtering and motion analysis are used to implement fast tracking of objects of interest on the compressed video data. Being presented with a query in the form of template images of objects, the system operates on the compressed video in order to find the images or video sequences where those objects are present and their positions in the image. This enables the retrieval and display of the query-relevant sequences.
An overview is given on the current research and potential future contributions of intelligent, integrated systems using text, image, video, and audio (TIVA) sources. Exchanges and cross-fertilization across the field...
详细信息
An overview is given on the current research and potential future contributions of intelligent, integrated systems using text, image, video, and audio (TIVA) sources. Exchanges and cross-fertilization across the fields of vision, speech processing, natural-language processing, machine learning, and information retrieval are demonstrated.
A rotation, translation, and scaling (RTS) invariant color image indexing technique for imaging database systems is described. To demonstrate the RTS property of the imageretrieval, four databases are experimented in...
详细信息
A rotation, translation, and scaling (RTS) invariant color image indexing technique for imaging database systems is described. To demonstrate the RTS property of the imageretrieval, four databases are experimented in the computer simulation of the algorithm. The proposed technique is found to be unaffected by substantial changes in the database images due to rotation, translation, and scaling. This makes the technique very robust and attractive for a number of applications of the imagestorage and retrieval systems.
video hypermedia systems enable users to retrieve information related to an object by selecting it directly in a video sequence. In video hypermedia systems, users must locate an anchor position according to the motio...
详细信息
ISBN:
(纸本)0819427527
video hypermedia systems enable users to retrieve information related to an object by selecting it directly in a video sequence. In video hypermedia systems, users must locate an anchor position according to the motion of the object. But it is very laborious to locate an anchor to its suitable position according to the motion of the object. We have proposed a new automatic object tracking method and implemented it to the system. A feature of this method is that it includes various automatic error correction algorithms. We evaluated this system on effectiveness on reducing human operations. As a result, the number of operations reduced to 30.3% of the former method, and the time of operations reduced to 60.1% of the former method.
暂无评论