We answer the question, what should we say about V when we want to gamble on X, and what is it worth? If V=X, we show that every bit of description at rate R is worth a bit of increase /spl Delta/(R) in the doubling r...
详细信息
We answer the question, what should we say about V when we want to gamble on X, and what is it worth? If V=X, we show that every bit of description at rate R is worth a bit of increase /spl Delta/(R) in the doubling rate. Thus the efficiency /spl Delta/(R)/R is equal to 1. For general V, we provide a single letter characterization for /spl Delta/(R). When applied specifically to jointly normal (V,X) with correlation /spl rho/, we find the initial efficiency /spl Delta/'(0) is /spl rho//sup 2/. If V and X are Bernoulli random variables connected by a binary symmetric channel with parameter /spl rho/, the initial efficiency is (1-2p)/sup 2/. We finally show how much increase in doubling rate is possible when the sender can provide R bits of information about V and side information S is available only to the investor.
Dramatic progress in multimedia technology within the last decade has brought numerous problems and questions related to the image quality assessment and evaluation. All blocks of imaging chain have an impact on the f...
详细信息
ISBN:
(纸本)1424408210
Dramatic progress in multimedia technology within the last decade has brought numerous problems and questions related to the image quality assessment and evaluation. All blocks of imaging chain have an impact on the final image quality perceived by an observer - optical imaging system, sensing, source coding and compression, channel coding and transmission, retrieving, displaying inch printing etc. Therefore the study of image qualitative parameters is crucial at the recent development period of multimedia applied systems. The paper describes and summarizes the impacts of all relevant blocks: Optical Imaging System - description, parameters, Image Sensors and Displays - 2D sampling and 2D aperture, colorimetric aspects, source Image Compression - lossless and lossy techniques, distortions and artifacts, Methodology of Subjective Quality Tests, Objective Quality Criteria, Modeling of HVS, Subjective Image Quality Enhancement. Selected experimental image quality evaluation results based upon our activities under two research projects of the Czech Grant Agency and two research projects of the Ministry of Education are finally presented.
We analyze the performance of low-density generator matrix (LDGM) codes for lossy source coding. We first develop a generic technique for deriving lower bounds on the effective rate-distortion functions of binary line...
详细信息
We analyze the performance of low-density generator matrix (LDGM) codes for lossy source coding. We first develop a generic technique for deriving lower bounds on the effective rate-distortion functions of binary linear codes. This result provides a source coding analog of a classical result due to Gallager for channel coding over the binary symmetric channel. We illustrate this method for the ensemble of check-regular low- density generator matrix (LDGM) codes by deriving an explicit lower bound on its rate-distortion performance as a function of the check degree.
This paper addresses the problem of efficient and robust transmission of video over a hierarchical wireless infrastructure. First, we compress the data generated by the video sources using a multi-resolution wavelet c...
详细信息
This paper addresses the problem of efficient and robust transmission of video over a hierarchical wireless infrastructure. First, we compress the data generated by the video sources using a multi-resolution wavelet coder. Subsequently, the generated bitstreams are transmitted over a wireless infrastructure to multiple regional processing centers (RPC), which then send the distorted bitstreams to a common higher-level processing center/receiver. We propose a symmetric framework for the joint compression and error protection of the correlated video bitstreams received by the RPC. We show that significantly improved video performance can be obtained by our framework over a variety of channel conditions.
The work aims to enable the use of common software engineering techniques and tools for quantum programming languages (e.g., OpenQASM). With the increased interest in quantum computing, researchers are adopting the us...
详细信息
ISBN:
(数字)9798331541378
ISBN:
(纸本)9798331541385
The work aims to enable the use of common software engineering techniques and tools for quantum programming languages (e.g., OpenQASM). With the increased interest in quantum computing, researchers are adopting the use of higher-level quantum programming languages versus low-level circuit diagrams. While general purpose programming languages (e.g., C++, Python) are highly supported by a variety of software engineering tools, these novel programming languages for quantum computing have almost no support. Useable tools for debugging, static analysis, error detection, and transformation are currently non-existent. This work extends an existing software infrastructure (i.e., srcML) for the analysis, exploration, and manipulation of source code to OpenQASM. The srcML infrastructure, via parsing, generates abstract syntax information of programs to support high-level querying and analysis of the source code. With this, quantum developers can extract information and identify possible errors or inefficiencies in their programs. The paper presents the basic syntactic markup for OpenQASM. Also, a number of relevant quantum-based problems (e.g., iteration patterns, control recursion) are described and examples of how they are addressed using srcML are given.
This paper considers the problem of minimizing the communication cost for a general multi-hop network with correlated sources and multiple sinks. For the single sink scenario, it has been shown that this problem can b...
详细信息
ISBN:
(纸本)9781424478903
This paper considers the problem of minimizing the communication cost for a general multi-hop network with correlated sources and multiple sinks. For the single sink scenario, it has been shown that this problem can be decoupled, without loss of optimality, into two separate subproblems of distributed source coding and finding the optimal routing (transmission structure). It has further been established that, under certain assumptions, such decoupling also applies in the general case of multiple sinks and arbitrary network demands. We show that these assumptions are significantly restrictive, and further provide examples to substantiate the loss, including settings where removing the assumptions yields unbounded performance gains. Finally, an approach to solving the unconstrained problem, where routing and coding cannot be decoupled, is derived based on Han and Kobayashi's achievability region for multi-terminal coding.
A video transmission system based on combined source coding and multilevel modulation is proposed. For the proposed system, graceful degradation for noisy channels is obtained by finding good index maps between the qu...
详细信息
A video transmission system based on combined source coding and multilevel modulation is proposed. For the proposed system, graceful degradation for noisy channels is obtained by finding good index maps between the quantized source coder parameters and the amplitude levels of a multilevel QAM signal constellation. The index maps are designed both for good neighbor properties and for power efficient transmission by using simulated annealing. The performance of the proposed system is comparable to a reference system based on the H.263 coder for high CSNR values, and degrades far more gracefully for low CSNR values.
The Segment Anything Model (SAM) has exhibited outstanding performance in various image segmentation tasks. Despite being trained with over a billion masks, SAM faces challenges in mask prediction quality in numerous ...
详细信息
ISBN:
(数字)9798350390155
ISBN:
(纸本)9798350390162
The Segment Anything Model (SAM) has exhibited outstanding performance in various image segmentation tasks. Despite being trained with over a billion masks, SAM faces challenges in mask prediction quality in numerous scenarios, especially in real-world contexts. In this paper, we introduce a novel prompt-driven adapter into SAM, namely Prompt Adapter Segment Anything Model (PA-SAM), aiming to enhance the segmentation mask quality of the original SAM. By exclusively training the prompt adapter, PA-SAM extracts detailed information from images and optimizes the mask decoder feature at both sparse and dense prompt levels, improving the segmentation performance of SAM to produce high-quality masks. Experimental results demonstrate that our PA-SAM outperforms other SAM-based methods in high-quality, zero-shot, and open-set segmentation. We’re making the source code and models available at https://***/xzz2/pa-sam.
Face detection is a mandatory step in many computer vision applications, such as face recognition, emotion recognition, age detection, virtual makeup, and vital sign monitoring. Thanks to advancements in deep learning...
详细信息
Face detection is a mandatory step in many computer vision applications, such as face recognition, emotion recognition, age detection, virtual makeup, and vital sign monitoring. Thanks to advancements in deep learning and the introduction of annotated large-scale datasets, numerous applications have been developed for human faces. Recently, other domains, such as animals and cartoon characters, have started gaining attention but still lag far behind human faces. The biggest challenge is the limited number of annotated face datasets in these domains. The manual labeling of large-scale datasets is tedious and requires substantial human labor. In this regard, we present an inputagnostic face detector to ease the annotation of various face datasets. We propose a simple but effective data-centric approach instead of building a specific neural network architecture. Specifically, we trained a face detection model, YOLO5Face, on human, animal, and cartoon face datasets. The experiments show that the model can achieve accurate results in all domains. In addition, the model achieved decent results for animals and cartoon characters different from the ones in the training set. This implies that the model can extract agnostic facial features. We have made the source code and pre-trained models publicly available at https://***/IS2AI/AnyFace to stimulate research in these fields.
暂无评论