One of the major goals of the 5G technology roadrnap is to create disruptive innovation for the efficient use of the radio spectrum to enable rapid access to bandwidth-intensive multimedia services over wireless netwo...
详细信息
One of the major goals of the 5G technology roadrnap is to create disruptive innovation for the efficient use of the radio spectrum to enable rapid access to bandwidth-intensive multimedia services over wireless networks. The biggest challenge toward this goal lies in the difficulty in exploiting the multicast nature of the wireless channel in the presence of wireless users that rarely access the same content at the same time. Recently, the combined use of wireless edge caching and coded multicasting has been shown to be a promising approach to simultaneously serve multiple unicast demands via coded multicast transmissions, leading to order-of-magnitude bandwidth efficiency gains. I lowever, a crucial open question is how these theoretically proven throughput gains translate in the context of a practical implementation that accounts for all the required coding and protocol overheads. In this article, we first provide an overview of the emerging caching-aided coded multicast technique, including state-of-the-art schemes and their theoretical performance. We then focus on the most competitive scheme proposed to date and describe a fully working prototype implementation in CorteXlab, one of the few experimental facilities where wireless multiuser communication scenarios can be evaluated in a reproducible environment. We use our prototype implementation to evaluate the experimental performance of state-of-the-art caching-aided coded multicast schemes corn pared to state-of-the-art uncoded schemes, with special focus on the impact of coding computation and communication overhead on the overall bandwidth efficiency performance. Our experimental results show that coding overhead does not significantly affect the promising performance gains of coded multicasting in small-scale real world scenarios, practically validating its potential to become a key next generation CG technology.
This paper presents a novel power-aware motion estimation algorithm, called adaptive content-based subsample algorithm (ACSA), for battery-powered multimedia devices. While the battery status changes, the architecture...
详细信息
This paper presents a novel power-aware motion estimation algorithm, called adaptive content-based subsample algorithm (ACSA), for battery-powered multimedia devices. While the battery status changes, the architecture adaptively performs graceful tradeoffs between power consumption and compression quality. As the available energy decreases, the algorithm raises the subsample rate for maximizing battery lifetime. Differing from the existing subsample algorithms, the content-based algorithm first extracts edge pixels from a macro-block and then subsamples the remaining low-frequency part. In this way, we can alleviate the aliasing problem and thus keep the quality degradation low as the subsample rate increases. As shown in experimental results, the architecture can dynamically operate at different power consumption modes with little quality degradation according to the remaining capacity of battery pack while the power overhead of edge extraction is under 0.8%.
During the European Cantata project (ITEA project, 2006-2009), a Multi-content Analysis framework for the classification of compound images in various categories (text, graphical user interface, medical images, other ...
详细信息
ISBN:
(纸本)9780819479334
During the European Cantata project (ITEA project, 2006-2009), a Multi-content Analysis framework for the classification of compound images in various categories (text, graphical user interface, medical images, other complex images) was developed within Barco. The framework consists of six parts: a dataset, a feature selection method, a machine learning based Multi-content Analysis (MCA) algorithm, a Ground Truth, an evaluation module based on metrics and a presentation module. This methodology was built on a cascade of decision tree-based classifiers combined and trained with the AdaBoost meta-algorithm. In order to be able to train these classifiers on large training datasets without excessively increasing the training time, various optimizations were implemented. These optimizations were performed at two levels: the methodology itself (feature selection / elimination, dataset pre-computation) and the decision-tree training algorithm (binary threshold search, dataset presorting and alternate splitting algorithm). These optimizations have little or no negative impact on the classification performance of the resulting classifiers. As a result, the training time of the classifiers was significantly reduced, mainly because the optimized decision-tree training algorithm has a lower algorithmic complexity. The time saved through this optimized methodology was used to compare the results of a greater number of different training parameters.
To crawl large amounts of weakly-tagged images for computer vision tasks such as object detection and scene recognition, it is very important to develop new techniques for tag cleansing and word sense disambiguation (...
详细信息
ISBN:
(纸本)9780819479334
To crawl large amounts of weakly-tagged images for computer vision tasks such as object detection and scene recognition, it is very important to develop new techniques for tag cleansing and word sense disambiguation (i.e., removing irrelevant images from the crawled results). Based on this observation, a topic network is first generated to characterize both the semantic similarity contexts and the visual similarity contexts between the image topics more sufficiently. The topic network is used to represent the classes of objects and scenes of interest. Second, both the visual similarity contexts between the images and the semantic similarity contexts between their tags are integrated for tag cleansing and word sense disambiguation. By addressing the issues of polysemes and synonyms more effectively, our word sense disambiguation algorithm can determine the relevance between the images and the associated tags more precisely, and thus it can allow us to crawl large-scale weakly-tagged images for computer vision tasks.
The proceedings contains 61 papers from the SPIE International Society for Optical Engineering Journal on Electronic Imaging and multimediasystemsii. Topics discussed include: image retrievals;gray image recognition...
详细信息
The proceedings contains 61 papers from the SPIE International Society for Optical Engineering Journal on Electronic Imaging and multimediasystemsii. Topics discussed include: image retrievals;gray image recognition;content-based image retrieval;color pattern selections;video segmentations;declarative video data retrieval languages;network databases;fast spatial transformations;moving object segmentation methods;anisotropic segmentations;video segmentation algorithms;wavelet-derived gradient computations;dual-band imaging systems;genetic algorithms;thermal image compression systems;multimedia-on-demand services;personal digital assistant (PDA) systems;and real-time nonlinear video authoring systems.
The proceedings contains 61 papers from the SPIE International Society for Optical Engineering Journal on Electronic Imaging and multimediasystemsii. Topics discussed include: image retrievals;gray image recognition...
详细信息
The proceedings contains 61 papers from the SPIE International Society for Optical Engineering Journal on Electronic Imaging and multimediasystemsii. Topics discussed include: image retrievals;gray image recognition;content-based image retrieval;color pattern selections;video segmentations;declarative video data retrieval languages;network databases;fast spatial transformations;moving object segmentation methods;anisotropic segmentations;video segmentation algorithms;wavelet-derived gradient computations;dual-band imaging systems;genetic algorithms;thermal image compression systems;multimedia-on-demand services;personal digital assistant (PDA) systems;and real-time nonlinear video authoring systems.
Effective and flexible solutions for enabling reduced resource consumption and handling content and associated processing (indexing algorithms) diversity (in terms of location, technical characteristics, execution con...
详细信息
In this paper, we propose an effective framework for semantic analysis of human motion from a monocular video. As it is difficult to find a good motion description for humans, we focus on a reliable recognition of the...
详细信息
ISBN:
(纸本)9780819466198
In this paper, we propose an effective framework for semantic analysis of human motion from a monocular video. As it is difficult to find a good motion description for humans, we focus on a reliable recognition of the motion type and estimate the body orientation involved in the video sequence. Our framework analyzes the body motion in three modules: a pre-processing module, matching module and semantic module. The proposed framework includes novel object-level processing algorithms, such as a local descriptor and a global descriptor to detect body parts and analyze the shape of the whole body as well. Both descriptors jointly contribute to the matching process by incorporating them into a new weighted linear combination for matching. We also introduce a simple cost function based on time-index differences to distinguish motion types and cycles in human motions. Our system can provide three different types of analysis results: (1) foreground person detection;(2) motion recognition in the sequence;(3) 3-D modeling of human motion based on generic human models. The proposed framework was evaluated and proved its effectiveness as it achieves the motion recognition and body-orientation classification at the accuracy of 95 % and 98 %, respectively.
Recent research results in the field of multimediacontent Analysis (MCA) have been marked by an abundance of theoretical and algorithmic solutions covering narrow application domains only. In this paper we analyze th...
详细信息
ISBN:
(纸本)0863415954
Recent research results in the field of multimediacontent Analysis (MCA) have been marked by an abundance of theoretical and algorithmic solutions covering narrow application domains only. In this paper we analyze this tendency and its origin in more detail and explain why, in our view, this should not be considered "the way to go" in providing easy access to content in multimediasystems and applications of the future. In particular, we concentrate on the case study of digital video, which we see as a straightforward example of multimedia. Through this case study we will discuss the needs and challenges of improving the generic potential of MCA algorithms.
Future Internet usage will be dominated by the consumption of a rich variety of online multimedia services accessed from an exponentially growing number of multimedia capable mobile devices. As such, future Internet d...
详细信息
Future Internet usage will be dominated by the consumption of a rich variety of online multimedia services accessed from an exponentially growing number of multimedia capable mobile devices. As such, future Internet designs will be challenged to provide solutions that can deliver bandwidth-intensive delay-sensitive on-demand video-based services over increasingly crowded and bandwidth-limited wireless access networks. One of the main reasons for the bandwidth stress facing wireless network operators is the difficulty to exploit the multicast nature of the wireless medium when wireless users or access points rarely experience the same channel conditions or access the same content at the same time. In this paper, we present and analyze a novel wireless video delivery paradigm based on the combined use of channel-aware caching and coded multicasting that allows simultaneously serving multiple cache-enabled receivers that may be requesting different content and experiencing different channel conditions. To this end, we reformulate the caching-aided coded multicast problem as a joint source-channel coding problem and design an achievable scheme that preserves the cache-enabled multiplicative throughput gains of the error-free scenario, by guaranteeing per-receiver rates unaffected by the presence of receivers with worse channel conditions.
暂无评论