Interactive broadcasting is now considered as a next generation broadcasting service, which covers territorial, mobile and wireless terminals. In interactive broadcasting, viewers not only watch the broadcasting progr...
详细信息
ISBN:
(纸本)9781581136203
Interactive broadcasting is now considered as a next generation broadcasting service, which covers territorial, mobile and wireless terminals. In interactive broadcasting, viewers not only watch the broadcasting programs but also pass their requirements to program providers. In order to represent this interactivity, it is considered that the MPEG-4 is a well-adopted standard because of its object-based scene description scheme, which is in the binary (BIFS) and textual (XMT) formats. This paper describes the XMT API that can generate, manipulate and translate an XML document for the interactive broadcasting content description, and also introduce an authoring system based on the provided the XMTAPI. Since the XMT is a textual format, content authors can easily exchange contents with other creators, applications and tools. This exchangeability of the XMT makes that authors can create interactive broadcasting contents more efficiently and rapidly. Therefore, our XMT API becomes core component module for developing interactive broadcasting contents.
Annotating image collections is crucial for different multimedia applications. Not only this provides an alternative access-to visual information but it is a critical step to perform the evaluation of content-based im...
详细信息
ISBN:
(纸本)081944412X
Annotating image collections is crucial for different multimedia applications. Not only this provides an alternative access-to visual information but it is a critical step to perform the evaluation of content-based image retrieval systems. Annotation is a tedious task so that there is a real need for developing tools that lighten the work of annotators. The tool should be flexible and offer customization so as to make the annotator the most comfortable. It should also automate the most tasks as possible. In this paper, we present a still image annotation tool that has been developed with the aim of being flexible and adaptive. The principle is to create a set of dynamic web pages that are an interface to a SQL database. The keyword set is fixed and every image receives from concurrent annotators a set of keywords along with time stamps and annotator IDs. Each annotator has the possibility of going back and forth within the collection and its previous annotations. He is helped by a number of search services and customization options. An administrative section allows the supervisor to control the parameter of the annotation, including the keyword set, given via an XML structure. The architecture of the tool is made flexible so as to accommodate further options through its development.
Providing natural and efficient access to the fast growing multimedia information, accommodating a variety of user skills and preferences, is a critical aspect of content-based information mining. Query by humming pro...
详细信息
Providing natural and efficient access to the fast growing multimedia information, accommodating a variety of user skills and preferences, is a critical aspect of content-based information mining. Query by humming provides a natural means for content-based retrieval from music databases. A statistical pattern recognition approach for recognizing hummed or sung melodies is reported in this paper. Being data-driven, the proposed system aims at providing a robust front-end especially for dealing with variability in user's productions. The segment of a note in the humming waveform is modeled by a hidden Markov model (HMM) while data features such as pitch measures are modeled by Gaussian mixture models (GMM). Preliminary real-time recognition experiments are carried out based on humming data obtained from eight users and an overall correct recognition rate of around 80% is demonstrated.
E-health is greatly impacting on information distribution and availability within the health services, hospitals and to the public. Previous research has addressed the development of system architectures with the aim ...
详细信息
E-health is greatly impacting on information distribution and availability within the health services, hospitals and to the public. Previous research has addressed the development of system architectures with the aim of integrating the distributed and heterogeneous medical information systems. Easing the difficulties in the sharing and management of multimedia medical data and the timely accessibility to these data are critical needs for health care providers. We have proposed a client-server agent that integrates and allows a portal to every permitted information system of the hospital that consists of picture archiving and communication systems (PACS), radiology information system (RIS) and hospital information system (HIS) via the intranet and the Internet. Our proposed agent enables remote access into the usually closed information system of the hospital and a server that manages all the multimedia medical data and allows for in-depth and complex search queries for contentaccess and automatic creation of patient reports for distribution.
hi recent years, there has been a growing interest in developing effective methods for searching large image databases based on image content. A commonly used method is search-by-query, that is often not satisfactory....
详细信息
ISBN:
(纸本)0819446416
hi recent years, there has been a growing interest in developing effective methods for searching large image databases based on image content. A commonly used method is search-by-query, that is often not satisfactory. Often it is difficult to find or produce good query images or repetitive queries tend to become trapped among a small group of undesirable images. To overcome these problems the user is to be provided with easy and intuitive access to information in image databases. In this paper we present a new browsing environment, which uses the metaphor of maps. Like street maps with different scales, from a world map to a city map, the image space is represented through "image maps" with different scales and user adapted similarity metrics. Beginning with a global view about the whole image database, containing only representative images (key images), the user can enter into any domain of the database by selecting appropriate key frames and iteratively refine the search process. Three different technologies, which in combination guarantee an intuitive browsing environment, are presented. These are: (1) A hierarchical organization of the image database as basis for an efficient iterative retrieval procedure from the global view to the finest scale. (2) A new relevance feedback technique, which computes the appropriate image similarity metric based on the users interaction, and (3) An intuitive three-dimensional visualization of the result images in such a way that the mutual dissimilarity of the images matches their distances in the virtual image space, the "image map".
The proceedings contain 31 papers. The topics discussed include: G-snake double transformation (GDT) model for semi-automatic registration of internet maps;classification of web pages with geographic scope and level o...
ISBN:
(纸本)0769518133
The proceedings contain 31 papers. The topics discussed include: G-snake double transformation (GDT) model for semi-automatic registration of internet maps;classification of web pages with geographic scope and level of details for mobile cache management;participatory decision-making for ecological planning;web locality based ranking utilizing location names and link structure;bus catcher: a context sensitive prototype system for public transportation users;a formal verification strategy for crash recovery in web-database applications;efficiency and performance of web cache reporting strategies;information concepts for content management;elegant decision tree algorithm for classification in data mining;ontology-based web annotation framework for hyperlink structures;an XML specification language to support a virtual marketplace of data mining e-services;a matrix approach for hierarchical web page clustering based on hyperlinks;and evaluating web access log mining algorithms: a cognitive approach.
The proceedings contain 32 papers. The special focus in this conference is on Data Mining, Knowledge Discovery, Mobile Databases, Spatiotemporal and Spatial Databases. The topics include: Privacy and security in asp a...
ISBN:
(纸本)3540441387
The proceedings contain 32 papers. The special focus in this conference is on Data Mining, Knowledge Discovery, Mobile Databases, Spatiotemporal and Spatial Databases. The topics include: Privacy and security in asp and web service environments;an axiomatic approach to defining approximation measures for functional dependencies;intelligent support for information retrieval in the www environment;an approach to improve text classification efficiency;semantic similarity in content-based filtering;data access paths for frequent itemset discovery;monitoring continuous location queries using mobile agents;optimistic concurrency control based on timestamp interval for broadcast environment;a flexible personalization architecture for wireless internet based on mobile agents;approximate algorithms for distance-based queries in high-dimensional data spaces using r-trees;efficient similarity search in feature spaces with the q-tree;spatio-temporal geographic information systems;an access method for integrating multi-scale geometric data;OLAP query evaluation in a database cluster;a framework to analyse and evaluate information systems specification languages;a semantic query optimization approach to optimize linear datalog programs;a meta model for structured workflows supporting workflow transformations;towards an exhaustive set of rewriting rules for Xquery optimization;architecture of a blended-query and result-visualization mechanism for web-accessible databases and associated implementation issues;accommodating changes in semistructured databases using multidimensional OEM;towards variability modelling for reuse in hypermedia engineering and complex temporal patterns detection over continuous data streams.
We propose a new multimedia data model, called the scalable data model (SDM), to describe how scalable pervasive Internet access with efficient data reuse can be achieved. This new data model not only provides a theor...
详细信息
We propose a new multimedia data model, called the scalable data model (SDM), to describe how scalable pervasive Internet access with efficient data reuse can be achieved. This new data model not only provides a theoretical foundation for maximum possible reuse of transcoded data for scalable pervasive Internet access, but its supporting architecture also ensures the feasibility and practicability because of its matching to the emerging trends in HTTP and multimedia data format.
The current systems for the support of online learning provide a certain degree of multimedia capability by allowing users to access online learning objects but are still limited when accessing the actual content of l...
详细信息
ISBN:
(纸本)0769515096
The current systems for the support of online learning provide a certain degree of multimedia capability by allowing users to access online learning objects but are still limited when accessing the actual content of learning objects and exchanging experiences about the learning objects. The work introduced in this paper describes an innovative multimedia tool for enriched communication between users and highly flexible control of the use of learning objects.
A number of content-based music retrieval systems have been presented in the last few years. Although query-by-humming has been one of the preferred strategies, the translation of voice input into note-like attributes...
详细信息
A number of content-based music retrieval systems have been presented in the last few years. Although query-by-humming has been one of the preferred strategies, the translation of voice input into note-like attributes has been tackled with naive algorithms. In this paper, we present a system developed to consider the peculiar facets of the singing voice. A pre-processing block for signal enhancement, an algorithm for pitch detection and a labeling stage for fine adjustment of intonation are addressed in the context of querying by singing. Experimental results confirm that this system is able to translate a sung performance into a representation close to a music score, thus particularly suited for melody-based search.
暂无评论