This Volume 4519 of the conference proceedings contains 30 papers. Topics discussed include video segmentation, video content analysis and retrieval, semantics and knowledge representations, image analysis and retriev...
详细信息
This Volume 4519 of the conference proceedings contains 30 papers. Topics discussed include video segmentation, video content analysis and retrieval, semantics and knowledge representations, image analysis and retrieval, audio analysis and retrieval, video compression and delivery, video access and browsing, vide servers, video rate control and adaptation and video applications.
Current personal Video recorders make it very easy for consumers to record whole TV programs. Our research however, focuses on personalizing TV at a sub-program level. We use a traditional content-Based Information Re...
详细信息
ISBN:
(纸本)9781581133943
Current personal Video recorders make it very easy for consumers to record whole TV programs. Our research however, focuses on personalizing TV at a sub-program level. We use a traditional content-Based Information Retrieval system architecture consisting of archiving and retrieval modules. The archiving module employs a three-layered, multimodal integration framework to segment, analyze, characterize, and classify segments. The retrieval module relies on users' personal preferences to deliver both full programs and video segments of interest. We tested retrieval concepts with real users and discovered that they see more value in segmenting non-narrative programs (e.g. news) than narrative programs (e.g. movies). We benchmarked individual algorithms and segment classification for celebrity and financial segments as instances of non-narrative content. For celebrity segments we obtained a total precision of 94.1% and recall of 85.7%, and for financial segments a total precision of 81.1% and a recall of 86.9%.
Development towards high-bandwidth wireless devices that are capable of processing complex, streaming multimedia is enabling a new breed of network-based media services. Coping with the diversity of network and device...
详细信息
ISBN:
(纸本)0819442429
Development towards high-bandwidth wireless devices that are capable of processing complex, streaming multimedia is enabling a new breed of network-based media services. Coping with the diversity of network and device capabilities requires services to be flexible and able to adapt to the needs and limitations of the environment at hand. Before efficient deployment, multi-platform services require additional issues to be considered, e.g. content handling, digital rights management, adaptability of content, user profiling, provisioning, and the available access methods. The key issue is how the content and the service is being modelled and stored for inauguration. We propose a new service content model based on persistent media objects able to store and manage XHTML-based multimedia services. In our approach, media, content summaries, and other meta-information are stored within media objects that can be queried from the object database. The content of the media objects can also specify queries to the database and links to other media objects. The final presentation is created dynamically according to the service request and user profiles. Our approach allows for dynamic updating of the service database together with user group management, and provides a method for notifying the registered users by different smart messaging methods, e.g. via e-mail or a SMS message. The model is demonstrated with an 'ice-hockey service' running in our platform called Princess. The service also utilizes SMIL and key frame techniques for the video representation.
Education and training are expected to change dramatically due to the combined impact of the Internet, database, and multimedia technologies. However, the distance learning is often impeded by the lack of effective me...
详细信息
ISBN:
(纸本)0769511988
Education and training are expected to change dramatically due to the combined impact of the Internet, database, and multimedia technologies. However, the distance learning is often impeded by the lack of effective methods to retrieve specific parts of a lecture by contents. This paper introduces a new approach to realize the content-based lecture retrieval on the Web. The approach involves: (1) The XML(eXtensible Markup Language)-based semistructured model not only to represent lecture contents but also to exchange them on the Web;(2) The technique to build structural summaries, i.e., schemas, of XML lecture databases. The structural summaries are useful for browsing and querying the database, building indexes, and enabling query optimization;(3) Index structures to speed up the search to find appropriate lecture contents.
The new MPEG-4 Audio standard provides two toolsets for synthetic Audio generation, Audio processing and multimediacontent description called Structured Audio (SA) and BInary Format for Scenes (BIFS). Moving from a s...
详细信息
ISBN:
(纸本)0780370414
The new MPEG-4 Audio standard provides two toolsets for synthetic Audio generation, Audio processing and multimediacontent description called Structured Audio (SA) and BInary Format for Scenes (BIFS). Moving from a systematic analysis of SA and from the implementation of an efficient SA decoder, this paper describes the design of a virtual DSP architecture able to exploit the data level parallelism contained in many typical audio processing algorithms. The proposed virtual DSP architecture shows good performance on general purpose platforms and can be easily adapted and optimized for parallel superscalar devices. The porting and results on a V-LIW DSP device confirm the effectiveness and flexibility of the approach, particularly suitable for standalone embedded solutions.
Pervasive Internet services today promise to provide users with a quick and convenient access to a variety of commercial applications. However, due to unsuitable architectures and poor performance user acceptance is s...
详细信息
ISBN:
(纸本)9729805024
Pervasive Internet services today promise to provide users with a quick and convenient access to a variety of commercial applications. However, due to unsuitable architectures and poor performance user acceptance is still low. To be a major success mobile services have to provide device-adapted content and advanced value-added Web services. Innovative enabling technologies like XML and wireless communication may for the first time provide a facility to interact with online applications anytime anywhere. We present a prototype implementing an efficient multimedia middleware approach towards ubiquitous value-added services using an auction house as a sample application. Advanced multi-feature retrieval technologies are combined with enhanced content delivery to show the impact of modern enterprise information systems on today's e-commerce applications.
This paper reports on prototype systems to provide an infrastructure for the dynamic and flexible repurposing, of multimedia resources held in a large database. The database, called ARKive, holds film, stills, audio a...
详细信息
ISBN:
(纸本)0769510132
This paper reports on prototype systems to provide an infrastructure for the dynamic and flexible repurposing, of multimedia resources held in a large database. The database, called ARKive, holds film, stills, audio and text about globally endangered and native UK animal and plant species as well as their habitats. It aims to offer a wide range of users customised access to both the core multimedia data, and full integration of core data with external educational resources. Aspects covered in the paper include;designing for repurposing with respect to specific audiences, storage and querying using RDF, XSL, SMIL and related technologies. The advantages of the approaches taken are discussed and key issues are highlighted.
This demonstration shows the end-user application of the Video-over-IP (VIP) system. This system encompasses a whole chain of processes for digital video databases, ranging from distributed content production to acqui...
详细信息
This demonstration shows the end-user application of the Video-over-IP (VIP) system. This system encompasses a whole chain of processes for digital video databases, ranging from distributed content production to acquire MPEG-7 metadata (including speech- and video analysis) to the deployment and access to the video database by end users. This system has been developed as an application for the next generation Internet. The end-user application provides distributed search engines, tools to browse and analyze videos, and playlist functionalities. The tools have been developed using a user-centered design approach, to assure usability for the students and teachers in the pilot project.
This paper analyzes the asymptotic performance of Maximum Likelihood (ML) channel estimation algorithms in wideband code division multiple access (WCDMA) scenarios. We concentrate on systems with periodic spreading se...
详细信息
ISBN:
(纸本)0780370414
This paper analyzes the asymptotic performance of Maximum Likelihood (ML) channel estimation algorithms in wideband code division multiple access (WCDMA) scenarios. We concentrate on systems with periodic spreading sequences (period larger than or equal to the symbol span) with high spreading factors, where the transmitted signal contains a code division multiplexed pilot for channel estimation purposes. Assuming randomized training and code sequences, we derive and compare the asymptotic covariances of the training-only (TO), semi-blind conditional ML (CML) and semi-blind Gaussian ML (GML) channel estimators.
An interactive multimedia presentation in a distributed multimedia system requires synchronization of media streams, preprocessing media for content-based retrieval and low-bandwidth transmission over network, and use...
详细信息
An interactive multimedia presentation in a distributed multimedia system requires synchronization of media streams, preprocessing media for content-based retrieval and low-bandwidth transmission over network, and user interface for interacting multimedia presentations. The power of synchronization models is limited to the synchronization specifications and user interactions. We propose an event-based synchronization model that can handle time-based actions while enabling user interactions like backward and skip. For effective transmission of multimedia data, the multimedia data needs to be preprocessed. The sprite generation and moving objects segmentation can reduce the required bandwidth significantly. We propose a method for multiresolution sprite which will allow reproduction of the video at different resolutions. The object segmentation will be extracted by generating a closed boundary for the object. Since the video data may also exist in a compressed format, we also propose to extract new features from the compressed video. We will consider compressed data that is generated by Discrete Cosine Transform (DCT) which has been used in MPEG-1, MPEG-2, MPEG-4 [8] and H263.1. The user will be provided a high-level user interface to access the contents of the presentation. We will test this integrated framework on distance education project over the Internet.
暂无评论