In this paper, we propose a novel video similarity measure model using video time density function (VTDF) and dynamic programming. First, we employ VTDF to describe the density of video activities in time domain by ca...
详细信息
In this paper, we propose a novel video similarity measure model using video time density function (VTDF) and dynamic programming. First, we employ VTDF to describe the density of video activities in time domain by calculating the inter-frame mutual information. Second, a temporal partition solution is applied to divide each video sequence into equi-sized temporal segments. Third, a new VTDF based similarity measure using correlation is calculated to measure the similarity between two temporal segments. Fourth, dynamic programming is then developed to find the optimal non-linear mapping between two video sequences. A new normalized similarity measure function combing both visual characteristics and temporal information together is to evaluate the semantic similarity of two video sequences. Experimental results show that the proposed measurement model is effective to explore the semantic similarity of video sequences.
This paper presents a novel rank aggregation approach based on contextual information aiming to improve the effectiveness of Content-Based Image Retrieval (CBIR) tasks. In our approach, information encoded in both dis...
详细信息
This paper presents a novel rank aggregation approach based on contextual information aiming to improve the effectiveness of Content-Based Image Retrieval (CBIR) tasks. In our approach, information encoded in both distances among images and ranked lists computed by CBIR systems are used for analyzing contextual information and then re-rank collection images. We conducted several experiments involving shape, color, and texture descriptors. We also evaluated our method in comparison to other rank aggregation approaches. Experimental results demonstrate the effectiveness of our method.
The problem of classification of continuous general data for content based retrieval and describe the scheme that able to classify the audio segments based on the MPEG-7 audio descriptors and description schemes that ...
详细信息
The problem of classification of continuous general data for content based retrieval and describe the scheme that able to classify the audio segments based on the MPEG-7 audio descriptors and description schemes that consist of tools for indexing audio media using probabilistic sound models. The descriptors provide containers for category labels as well as data structures for quantitative information about sound content. We describe the normative tools as well as informative methods for automatic description extraction.
A method for identifying JPEG 2000 images with different coding parameters, such as DWT-filters, code-block sizes, quantization step sizes, and resolution levels, is presented. The proposed method does not produce fal...
详细信息
ISBN:
(纸本)9781457705380
A method for identifying JPEG 2000 images with different coding parameters, such as DWT-filters, code-block sizes, quantization step sizes, and resolution levels, is presented. The proposed method does not produce false-negative matches regardless of the different coding parameters. Moreover, the proposed method is fast because it uses the number of zero-bit-planes. This number can be extracted from the JPEG 2000 codestream by only parsing the header information. Experimental results showed the effectiveness of the proposed method for digital cinema applications.
A 3D-DWT video watermarking technique is proposed in this article. The algorithm is implemented on the basis of human vision system (HVS). To improve the robustness of watermark against JPEG compression, the most rece...
详细信息
A 3D-DWT video watermarking technique is proposed in this article. The algorithm is implemented on the basis of human vision system (HVS). To improve the robustness of watermark against JPEG compression, the most recently proposed techniques embed watermark in to the low-frequency components of the video frames. However, these components hold significant information of the video. Directly replacing the low-frequency components with watermark may introduce undesirable degradation to video quality. Simulation results demonstrate that the embedded watermarks can be almost fully extracted from the JPEG-compressed frames with very high compression ratio. Furthermore, the algorithm is robust against uniform noise attack, frame dropping and frame swapping.
IPTV, which is based on the universal IP infrastructure, has the inherent nature of heterogeneity in terms of contents, networks, terminals, and users. An important solution to cope with such heterogeneity is content ...
详细信息
IPTV, which is based on the universal IP infrastructure, has the inherent nature of heterogeneity in terms of contents, networks, terminals, and users. An important solution to cope with such heterogeneity is content adaptation. This paper reviews the standardization issues related to content adaptation in IPTV standards. We first describe the basic architecture of content adaptation and its integration into the ITU-T IPTV architecture. Then typical methods of content adaptation in practical IPTV systems are discussed in detail. Especially, we highlight the standard metadata tools that are potential to support adaptation methods within ITU-T IPTV architecture. Some recent developments in other standard bodies are also discussed.
MPEG has been developing MPEG-V standard to synchronize real world and virtual world. The MPEG-V Part 5, specifically, provides standardized interface to send commands to the actuators and to receive information from ...
详细信息
ISBN:
(纸本)9781457707223
MPEG has been developing MPEG-V standard to synchronize real world and virtual world. The MPEG-V Part 5, specifically, provides standardized interface to send commands to the actuators and to receive information from the sensors. This paper proposes interace for Global position command, global position sensor, and altitude sensor in XML schema to control the unmanned flying vehicles. We also implemented the land station so that the position of the flying vehicle can be displayed on the screen in real time and the user can control the flying vehicle through the user interface.
This paper explains reasons for choosing a bitmap (BMP) as the basic format for inserting large amounts of additional information into the image. Structure of the BMP format is explained and shown on examples. Formula...
详细信息
This paper explains reasons for choosing a bitmap (BMP) as the basic format for inserting large amounts of additional information into the image. Structure of the BMP format is explained and shown on examples. Formula for calculating the difference of two images is proposed. Experiments were conducted on adding information to the bits of different weights in the bitmap. From the test results, conclusions were derived about the impact of adding information on the quality of original bitmap image.
MPEG has recently released a first public draft of the new Dynamic Adaptive Streaming over HTTP (DASH) specification. The new streaming technique is based on the 3GPP and partly also on the Open IPTV Forum (OIPF) spec...
详细信息
MPEG has recently released a first public draft of the new Dynamic Adaptive Streaming over HTTP (DASH) specification. The new streaming technique is based on the 3GPP and partly also on the Open IPTV Forum (OIPF) specifications. In this paper, we give an overview about the new DASH specification with a special focus on Live streaming services. In order to re-use existing web content distribution schemes, the new streaming technique provides the live stream as a sequence of files, which are continuously downloaded by the streaming client. This way of streaming introduces new delay components into the system streaming. Live Streaming technology is often used for events like sport events to allow other users to virtually participate. It is generally preferred to minimize the end-to-end delay for live services. In the paper, we identify and analyze the different delay components of the new adaptive HTTP streaming technique and how they contribute to the end-to-end delay of live services. We also discuss the dependencies and system implications when minimizing the end to end delay. The evaluation principles are also applicable for other adaptive HTTP streaming formats.
Dynamically reconfigurable hardware is a promising technology that combines in the same device both the high performance and the flexibility that many recent applications demand. However, one of its main drawbacks is ...
详细信息
Dynamically reconfigurable hardware is a promising technology that combines in the same device both the high performance and the flexibility that many recent applications demand. However, one of its main drawbacks is the reconfiguration overhead, which involves important delays in the task execution, usually in the order of hundreds of milliseconds, as well as high energy consumption. One of the most powerful ways to tackle this problem is configuration reuse, since reusing a task does not involve any reconfiguration overhead. In this paper we propose a configuration replacement policy for reconfigurable systems that maximizes task reuse in highly dynamic environments. We have integrated this policy in an external task-graph execution manager that applies task prefetch by loading and executing the tasks as soon as possible (ASAP). However, we have also modified this ASAP technique in order to make the replacements more flexible, by taking into account the mobility of the tasks and delaying some of the reconfigurations. In addition, this replacement policy is a hybrid design-time/run-time approach, which performs the bulk of the computations at design time in order to save run-time computations. Our results illustrate that the proposed strategy outperforms other state-of- the-art replacement policies in terms of reuse rates and achieves near-optimal reconfiguration overhead reductions. In addition, by performing the bulk of the computations at design time, we reduce the execution time of the replacement technique by 10 times with respect to an equivalent purely run-time one.
暂无评论