The images captured in the low-light conditions always suffer from low visibility. Enhancing the visibility of the low-light image is of broad application to various computer vision tasks. Based on the classical Retin...
详细信息
ISBN:
(纸本)9781728180687
The images captured in the low-light conditions always suffer from low visibility. Enhancing the visibility of the low-light image is of broad application to various computer vision tasks. Based on the classical Retinex model, previous methods assume the reflectance components as a well-exposed image. In this paper, we introduce the blurring distortion into the Retinex model to cover more general and challenging scenarios. We further propose a two-stage framework to extract the reflectance images and remove the blurring distortion separately. Specifically, we optimize the whole network by embedding a mechanism robust to the pixel misalignment in the training dataset. The experimental results show that our proposed method achieves promising results.
Learned image compression (LIC) has illustrated good ability for reconstruction quality driven tasks (e.g. PSNR, MS-SSIM) and machine vision tasks such as image understanding. However, most LIC frameworks are based on...
详细信息
ISBN:
(纸本)9781728185514
Learned image compression (LIC) has illustrated good ability for reconstruction quality driven tasks (e.g. PSNR, MS-SSIM) and machine vision tasks such as image understanding. However, most LIC frameworks are based on pixel domain, which requires the decoding process. In this paper, we develop a learned compressed domain framework for machine vision tasks. 1) By sending the compressed latent representation directly to the task network, the decoding computation can be eliminated to reduce the complexity. 2) By sorting the latent channels by entropy, only selective channels will be transmitted to the task network, which can reduce the bitrate. As a result, compared with the traditional pixel domain methods, we can reduce about 1/3 multiply-add operations (MACS) and 1/5 inference time while keeping the same accuracy. Moreover, proposed channel selection can contribute to at most 6.8% bitrate saving.
Most existing digital image watermarking schemes tend to have the inherent conflicts between imperceptibility and robustness because watermarks are embedded by parameter modification. image watermark techniques resolv...
详细信息
ISBN:
(纸本)9781479903085
Most existing digital image watermarking schemes tend to have the inherent conflicts between imperceptibility and robustness because watermarks are embedded by parameter modification. image watermark techniques resolve this dilemma by extracting invariant features from the image as "embedded" watermark. In this paper we propose an image zero watermark scheme based on visual attention regions of images. In the proposed scheme visual attention model carefully selects top-N salient areas, where a set of selected Scale Invariant Feature transform (SIFT) descriptors are extracted as a watermark. The distance of each pair of SIFT descriptors from the reference and test images are calculated by Kullback-Leibler (KL) divergence after mapping into the high dimensional space respectively. The final distance of two sets of SIFT descriptor are determined by ensemble similarity. The experimental results indicate that the proposed scheme outperforms image zero watermark scheme based on color and edge histograms (CEH) and is robust to attacks of geometric distortion, contrast/luminance distortion and JPEG compression.
Hidden Markov models (HMMs) have been widely used in various fields, including image categorization and retrieval. Most of the existing methods train HMMs by low-level features of image blocks;however, the block-based...
详细信息
ISBN:
(纸本)9780819466211
Hidden Markov models (HMMs) have been widely used in various fields, including image categorization and retrieval. Most of the existing methods train HMMs by low-level features of image blocks;however, the block-based features can not reflect high-level semantic concepts well. This paper proposes a new method to train HMMs by region-based features, which can be obtained after image segmentation. Our work can be characterized by two key properties: (1) Region-based HMM is adopted to achieve better categorization performance, for the region-based features accord with the human perception better. (2) Multi-layer semantic representation (MSR) is introduced to couple with region-based HMM in a long-term relevance feedback framework for image retrieval. The experimental results demonstrate the effectiveness of our proposal in both aspects of categorization and retrieval.
Recent advances in sensor technology and wide deployment of visual sensors lead to a new application whereas compression of images are not mainly for pixel recovery for human consumption, instead it is for communicati...
详细信息
ISBN:
(纸本)9781728185514
Recent advances in sensor technology and wide deployment of visual sensors lead to a new application whereas compression of images are not mainly for pixel recovery for human consumption, instead it is for communication to cloud side machine vision tasks like classification, identification, detection and tracking. This opens up new research dimensions for a learning based compression that directly optimizes loss function in vision tasks, and therefore achieves better compression performance vis-a-vis the pixel recovery and then performing vision tasks computing. In this work, we developed a learning based compression scheme that learns a compact feature representation and appropriate bitstreams for the task of visual object detection. Variational Auto-Encoder (VAE) framework is adopted for learning a compact representation, while a bridge network is trained to drive the detection loss function. Simulation results demonstrate that this approach is achieving a new state-of-the-art in task driven compression efficiency, compared with pixel recovery approaches, including both learning based and handcrafted solutions.
Polymerase chain reaction (PCR) and gel electrophoresis are two widely used techniques for genetic studies that require the bench scientist to perform many tedious manual steps. Advances in automation are making these...
详细信息
ISBN:
(纸本)9780819466211
Polymerase chain reaction (PCR) and gel electrophoresis are two widely used techniques for genetic studies that require the bench scientist to perform many tedious manual steps. Advances in automation are making these techniques more accessible, but detection and image analysis still remain labor-intensive. Although several commercial software packages are now available, DNA image analysis still requires some intervention by the user, and thus a certain level of imageprocessing expertise. To allow researchers to speed up their analyses and obtain more repeatable results, we present a fully automated image analysis system for DNA or protein studies with high accuracy. The proposed system is based mainly on four steps: automatic thresholding, shifting, filtering, and processing. The automatic thresholding that is used to equalize the gray values of the gel electrophoreses image background is one of the key and novel operations in this algorithm. An enhancement is also used to improve poor quality images that have faint DNA bands. Experimental results show that the proposed method eliminates defects due to noise for good and average quality gel electrophoresis images, while it also improves the appearance of poor quality images.
The focus of this paper is on automatic annotation for semantic image retrieval. This work is aimed at identifying visual descriptors that are most relevant, effective and suitable for semantic annotation tasks. We pr...
详细信息
ISBN:
(纸本)9781424412358
The focus of this paper is on automatic annotation for semantic image retrieval. This work is aimed at identifying visual descriptors that are most relevant, effective and suitable for semantic annotation tasks. We propose an image annotation system based on support vector machines and a combination of descriptors that includes a gradient direction histogram and several MPEG-7 visual descriptors. The system is tested on a large database of 7200 cityscape and landscape images. The results indicate that when descriptors are used individually, the proposed gradient direction histogram performs best. However, when descriptors are combined, the accuracy is improved. The presented results confirm that combining the gradient direction histogram and colour structure produces the best results.
Synthesizing images from text is an important problem and has various applications. Most of the existing studies of text-to-image generation utilize supervised methods and rely on a fully-labeled dataset, but detailed...
详细信息
ISBN:
(纸本)9781728180687
Synthesizing images from text is an important problem and has various applications. Most of the existing studies of text-to-image generation utilize supervised methods and rely on a fully-labeled dataset, but detailed and accurate descriptions of images are onerous to obtain. In this paper, we introduce a simple but effective semi-supervised approach that considers the feature of unlabeled images as " Pseudo Text Feature". Therefore, the unlabeled data can participate in the following training process. To achieve this, we design a Modality-invariant Semanticconsistent Module which aims to make the image feature and the text feature indistinguishable and maintain their semantic information. Extensive qualitative and quantitative experiments on MNIST and Oxford-102 flower datasets demonstrate the effectiveness of our semi-supervised method in comparison to supervised ones. We also show that the proposed method can be easily plugged into other visual generation models such as image translation and performs well.
An 81.6 GOPS object recognition processor is developed by using NoC and visualimageprocessing (VIP) memory. SIFT (Scale Invariant Feature Transform) object recognition requires huge computing power and data transact...
详细信息
ISBN:
(纸本)9781424407866
An 81.6 GOPS object recognition processor is developed by using NoC and visualimageprocessing (VIP) memory. SIFT (Scale Invariant Feature Transform) object recognition requires huge computing power and data transactions among tasks. The chip integrates 310 SIMD PEs for data/task level parallelism while the NoC facilitates inter-PE communications. The VIP memory searches local maximum pixel inside a 3x3 window in a single cycle providing 65.6 GOPS. The proposed processor achieves 15.9fps SIFT feature extraction at 200 MHz.
Advances in cameras and web technology have made it easy to capture and share large amounts of face videos over to an unknown audience with uncontrollable purposes. These raise increasing concerns about unwanted ident...
详细信息
ISBN:
(纸本)9781728185514
Advances in cameras and web technology have made it easy to capture and share large amounts of face videos over to an unknown audience with uncontrollable purposes. These raise increasing concerns about unwanted identity-relevant computer vision devices invading the characters's privacy. Previous de-identification methods rely on designing novel neural networks and processing face videos frame by frame, which ignore the data feature in redundancy and continuity. Besides, these techniques are incapable of well-balancing privacy and utility, and per-frame evaluation is easy to cause flicker. In this paper, we present deep motion flow, which can create remarkable de-identified face videos with a good privacy-utility tradeoff. It calculates the relative dense motion flow between every two adjacent original frames and runs the high quality image anonymization only on the first frame. The de-identified video will be obtained based on the anonymous first frame via the relative dense motion flow. Extensive experiments demonstrate the effectiveness of our proposed de-identification method.
暂无评论