In this paper, we propose a new framework based on data mining algorithms for building a Web-page recommender system. A recommender system is an intermediary program (or an agent) with a user interface that automatica...
详细信息
This document, describes a universal multimediaaccess system and its implementation details. In the context of this document, universal multimediaaccess means accessing multimediacontent over ubiquitous computer ne...
详细信息
Efficient access to multimediacontent can be provided, if the media data is enriched with additional information about the content's semantics and functionality. For making full use of domain-specific knowledge f...
详细信息
ISBN:
(纸本)3540238948
Efficient access to multimediacontent can be provided, if the media data is enriched with additional information about the content's semantics and functionality. For making full use of domain-specific knowledge for a specific context this meta information has to be integrated with a domain ontology. In previous research. we have developed Enhanced multimedia Meta Objects (EMMOs) as a new means for semantic multimedia meta modeling, as well as a query algebra EMMA, which is adequate and complete with regard to the EMMO model. This paper focuses on the seamless integration of ontology knowledge into EMMA queries to enable sophisticated query refinement.
Depending on the specific information they are seeking, users desire flexible and intuitive methods to search and browse multimedia libraries. However, the cost of manually extracting the metadata to support such func...
Depending on the specific information they are seeking, users desire flexible and intuitive methods to search and browse multimedia libraries. However, the cost of manually extracting the metadata to support such functionalities may be unrealistically high. Therefore, over the last decade there has been a great interest in designing and building systems that automatically analyze and index multimedia data, and retrieve its relevant parts. In this work we describe algorithms that facilitate browsing, searching, and summarization of video sequences. We propose shot transition detection algorithms for the detection of cut and dissolve types of shot transitions based on a binary tree regression classifiers framework. Our system is able to detect these transitions with high accuracy. We discuss stochastic models to model video program genres, such as news programs or sitcoms, and show how these can be applied to automatically detect the genre of a given program. We investigate the use of hidden Markov Models (HMMs) and stochastic context-free grammars (SCFGs) for modeling. Since the computational complexity of SCFG training is high, we develop a hybrid HMM-SCFG model that reduces the training time of the models considerably. Deriving compact representations of video sequences that are intuitive for users and let them easily and quickly browse large collections of video data is fast becoming one of the most important topics in content-based video processing. Such representations, which we will collectively refer to as video summaries, rapidly provide the user with information about the content of the particular sequence being examined, while preserving the essential message. We propose an automated method to generate video skims for information-rich video programs, such as documentaries, educational videos, and presentations, using statistical analysis based on speech transcripts that are obtained by automatic speech recognition (ASR) from the audio. Ideally one would lik
In this paper, we describe a novel content transcoding middleware for accessing military geospatiall intelligence information in real-time. Intelligence information, including maps and location, category and propertie...
详细信息
ISBN:
(纸本)0780386035
In this paper, we describe a novel content transcoding middleware for accessing military geospatiall intelligence information in real-time. Intelligence information, including maps and location, category and properties of object targets, is adapted for various pervasive devices such as laptop, personal digital assistance (PDA), cellular phone, etc. The middleware is deployed as proxies on the Web using the IBM Websphere Transcoding Publisher (WTP) platform, which facilitates the middleware management. We developed several Java-based plug-ins and Extensible Stylesheet Language (XSL) stylesheets for content transcoding. A prototype has been established and real experiments have demonstrated the effectiveness of this novel middleware.
The increasing diversity of the characteristics of the terminals and networks that are used to accessmultimediacontent through the internet introduces new challenges for the distribution of multimedia data. Scalable...
详细信息
ISBN:
(纸本)0819452114
The increasing diversity of the characteristics of the terminals and networks that are used to accessmultimediacontent through the internet introduces new challenges for the distribution of multimedia data. Scalable video coding will be one of the elementary solutions in this domain. This type of coding allows to adapt an encoded video sequence to the limitations of the network or the receiving device by means of very basic operations. algorithms for creating fully scalable video streams, in which multiple types of scalability are offered at the same time, are becoming mature. On the other hand, research on applications that use such bitstreams is only recently emerging. In this paper, we introduce a mathematical model for describing such bitstreams. In addition, we show how we can model applications that use scalable bitstreams by means of definitions that are built on top of this model. In particular, we chose to describe a multicast protocol that is targeted at scalable bitstreams. This way, we will demonstrate that it is possible to define an abstract model for scalable bitstreams, that can be used as a tool for reasoning about such bitstreams and related applications.
More and more digital services provide capability of distributing digital content to end-users through high-band networks;such as satellite systems.' In such systems, Digital Right Management has become more and m...
详细信息
ISBN:
(纸本)0819455547
More and more digital services provide capability of distributing digital content to end-users through high-band networks;such as satellite systems.' In such systems, Digital Right Management has become more and more important and is encountering great challenges. Digital watermarking is proposed as a possible solution for the digital copyright tracking and enforcement. The nature of DRM systems puts high requirements on the watermark's robustness, uniqueness, easy detection, accurate retrieval and convenient management. We have developed a series of feature-based watermarking algorithms for digital video for satellite transmission.(1-3) In this paper, we will first describe a general secure digital content distribution system model and the requirements of watermark as one mechanism of DRM in digital content distribution applications. Then we will present a few feature-based digital watermarking methods in detail which are integrated with a dynamic watermarking schema to protect the digital content in a dynamic environment. For example, a watermark which is embedded in the DFT feature domain is invariant to rotation, scale and translation. Our proposed DFT domain watermarking schemas in which exploit the magnitude property of the DFT feature domain will allow both robust and easy watermark tracking and detection in the case of copyright infringement using cameras or camcorders. This DFT feature-based watermarking algorithm is able to tolerate large angle rotation and there is no need to search for possible rotated angles, which reduces the complexity of the watermark detection process and allows fast retrieval and easy management. We will then present a wavelet feature-based watermark algorithm for dynamic watermark key updates and key management, and we will conclude the paper with the summary, pointing our future research directions.
Today the availability of large digital content archives (video, ebook, audio) creates many problems in terms of user interaction and data manipulation (browsing, searching). Many approaches have been introduced in th...
详细信息
Today the availability of large digital content archives (video, ebook, audio) creates many problems in terms of user interaction and data manipulation (browsing, searching). Many approaches have been introduced in the past for quickly browsing a digital video library. In this paper we introduce a general framework for representing multimediacontents with a more effective, user-driven, speed-dependent browsing process by using a different user interaction metaphor: the manual/mental process of "leafing through the pages of an illustrated magazine". "Digital Leafing" is a combination of user interactions and speed-dependent data representation of digital contents. In case of video, we conclude that such a solution provides a more neutral and personal browsing and it trades off the complexity of video extraction algorithms with a higher user control.
Enhanced multimedia Meta Objects (EMMOs) are a novel approach to multimediacontent modeling, combining media, semantic relationships between those media, as well as functionality on the media, such as rendering, into...
详细信息
ISBN:
(纸本)3540236627
Enhanced multimedia Meta Objects (EMMOs) are a novel approach to multimediacontent modeling, combining media, semantic relationships between those media, as well as functionality on the media, such as rendering, into tradeable knowledge-enriched units of multimediacontent. For the processing of EMMOs and the knowledge they contain, suitable querying facilities are required. In this paper, we present EMMA, an expressive query algebra that is adequate and complete with regard to the EMMO model. EMMA offers a rich set of formally-defined, orthogonal query operators that give access to all aspects of EMMOs, enable query optimization, and allow the representation of elementary ontology knowledge within queries. Thereby, EMMA provides a sound and adequate foundation for the realization of powerful EMMO querying facilities.
Scalable coding is a technology that encodes a multimedia signal in a scalable manner where various representations can be extracted from a single codestream to fit a wide range of applications. Many new scalable code...
详细信息
ISBN:
(纸本)0819455547
Scalable coding is a technology that encodes a multimedia signal in a scalable manner where various representations can be extracted from a single codestream to fit a wide range of applications. Many new scalable coders such as JPEG 2000 and MPEG-4 FGS offer fine granularity scalability to provide near continuous optimal tradeoff between quality and rates in a large range. This fine granularity scalability poses great new challenges to the design of encryption and authentication systems for scalable media in Digital Rights Management (DRM) and other applications. It may be desirable or even mandatory to maintain a certain level of scalability in the encrypted or signed codestream so that no decryption or re-signing is needed when legitimate adaptations are applied. In other words, the encryption and authentication should be scalable, i.e., adaptation friendly. Otherwise secrets have to be shared with every intermediate stage along the content delivery system which performs adaptation manipulations. Sharing secrets with many patties would jeopardize the overall security of a system since the security depends on the weakest component of the system. hi this paper, we first describe general requirements and desirable features for an encryption or authentication system for scalable media, esp. those not encountered with the non-scalable case. Then we present an overview of the current state of the art of technologies in scalable encryption and authentication. These technologies include full and selective encryption schemes that maintain the original or coarser granularity of scalability offered by an unencrypted scalable codestream, layered access control and block level authentication that reduce the fine granularity of scalability to a block level, among others. Finally, we summarize existing challenges and propose future research directions.
暂无评论