检索结果-内蒙古大学图书馆

IEEE International conference on visual communications and image processing (VCIP)

作者： Chen, Liuhong Sun, Heming Zeng, Xiaoyang Fan, Yibo Fudan Univ Shanghai Peoples R China Waseda Univ Tokyo Japan JST PRESTO Saitama Japan

ISBN: (纸本)9781665475921

To speedup the image classification process which conventionally takes the reconstructed images as input, compressed domain methods choose to use the compressed images without decompression as input. Correspondingly, there will be a certain decline about the accuracy. Our goal in this paper is to raise the accuracy of compressed domain classification method using compressed images output by the NN-based image compression networks. Firstly, we design a hybrid objective loss function which contains the reconstruction loss of deep feature map. Secondly, one image reconstruction layer is integrated into the image classification network for up-sampling the compressed representation. These methods greatly help increase the compressed domain image classification accuracy and need no extra computational complexity. Experimental results on the benchmark imageNet prove that our design outperforms the latest work ResNet-41 with a large accuracy gain, about 4.49% on the top-1 classification accuracy. Besides, the accuracy lagging behinds the method using reconstructed images is also reduced to 0.47%. Moreover, our designed classification network has the lowest computational complexity and model complexity.

关键词： Compressed domain image analysis image classification LIC (Learned image Compression) feature reconstruction

来源：评论

学校读者我要写书评

暂无评论

Lookup Register-Tables with Interpolation for Effective image Transformation on x86/64 CPUs

Lookup Register-Tables with Interpolation for Effective Imag...

引用

2024 conference on visual communications and image processing

作者： Kamei, Hirokazu Honda, Soichiro Hayashi, Kohei Maeda, Yoshihiro Fukushima, Norishige Nagoya Inst Technol Grad Sch Engn Nagoya Aichi Japan Shibaura Inst Technol Coll Engn Tokyo Japan

ISBN: (纸本)9798331529543;9798331529550

Lookup tables (LUTs) are commonly used to speed up image processing by handling complex mathematical functions like sine and exponential calculations. They are used in various applications such as camera image processing, high-dynamic range imaging, and edge-preserving filtering. However, due to the increasing gap between computing and input/output performance, LUTs are becoming less effective. Even though specific circuits like SIMD can improve LUT efficiency, they still need to bridge the performance gap fully. The gap makes it difficult to choose between direct numerical and LUT calculations. For this problem, a register-LUTs method with the nearest neighbor was proposed;however, it is limited for functions with narrow-range values approaching zero. In this paper, we propose a method for using register LUTs to process images efficiently over a wide range of values. Our contributions include proposing register-LUT with linear interpolation for efficient computation, using a smaller data type for further efficiency, and suggesting an efficient data retrieving method.

关键词： SIMD register-LUT interpolation x86/64 CPU image processing

来源：评论

学校读者我要写书评

暂无评论

Focused video estimation from defocused video sequences

Focused video estimation from defocused video sequences

引用

conference on visual communications and image processing 2008

作者： Yang, Junlan Schonfeld, Dan Mohamed, Magdi Univ Illinois ECE Dept Multimedia Commun Lab Chicago IL 60607 USA Motorola Labs Phys Realizat Res Ctr Excellence Schaumburg IL USA

ISBN: (纸本)9780819469946

This paper proposes a novel technique for estimating focused video frames captured by an out-of-focus moving camera. It relies on the idea of Depth from Defocus (DFD), however overcomes the shortage of DFD by reforming the problem in a computer vision framework. It introduces a moving-camera scenario and explores the relationship between the camera motion and the resulting blur characteristics in captured images. This knowledge leads to a successful blur estimation and focused image estimation. The performance of this algorithm is demonstrated through error analysis and computer simulated experiments.

关键词： Depth from Defocus (DFD) focusing camera motion image restoration

来源：评论

学校读者我要写书评

暂无评论

Improving Latent Quantization of Learned image Compression with Gradient Scaling

Improving Latent Quantization of Learned Image Compression w...

引用

IEEE International conference on visual communications and image processing (VCIP)

作者： Sun, Heming Yu, Lu Katto, Jiro Waseda Univ Waseda Res Inst Sci & Engn Tokyo Japan Zhejiang Univ Inst Informat & Commun Engn Hangzhou Peoples R China JST PRESTO 4-1-8 Honcho Kawaguchi Saitama Japan Waseda Univ Dept Comp Sci & Commun Engn Tokyo Japan

ISBN: (纸本)9781665475921

Learned image compression (LIC) has shown its superior compression ability. Quantization is an inevitable stage to generate quantized latent for the entropy coding. To solve the non-differentiable problem of quantization in the training phase, many differentiable approximated quantization methods have been proposed. However, the derivative of quantized latent to non-quantized latent are set as one in most of the previous methods. As a result, the quantization error between non-quantized and quantized latent is not taken into consideration in the gradient descent. To address this issue, we exploit the gradient scaling method to scale the gradient of non-quantized latent in the back-propagation. The experimental results show that we can outperform the recent LIC quantization methods.

关键词： Learned image compression Quantization Gradient scaling

来源：评论

学校读者我要写书评

暂无评论

DCTResNet: Transform Domain image Deblocking for Motion Blur images

DCTResNet: Transform Domain Image Deblocking for Motion Blur...

引用

IEEE International conference on visual communications and image processing (VCIP) - visual communications in the Era of AI and Limited Resources

作者： Maharjan, Paras Xu, Ning Xu, Xuan Song, Yuyan Li, Zhu Univ Missouri Kansas City MO 64110 USA Kwai Inc Palo Alto CA USA

ISBN: (纸本)9781728185514

Pixel recovery with deep learning has shown to be very effective for a variety of low-level vision tasks like image super-resolution, denoising, and deblurring. Most existing works operate in the spatial domain, and there are few works that exploit the transform domain for image restoration tasks. In this paper, we present a transform domain approach for image deblocking using a deep neural network called DCTResNet. Our application is compressed video motion deblur, where the input video frame has blocking artifacts that make the deblurring task very challenging. Specifically, we use a block-wise Discrete Cosine Transform (DCT) to decompose the image into its low and high-frequency sub-band images and exploit the strong subband specific features for more effective deblocking solutions. Since JPEG also uses DCT for image compression, using DCT sub-band images for image deblocking helps to learn the JPEG compression prior to effectively correct the blocking artifacts. Our experimental results show that both PSNR and SSIM for DCTResNet perform more favorably than other state-of-the-art (SOTA) methods, while significantly faster in inference time.

关键词： Discrete cosine transforms

来源：评论

学校读者我要写书评

暂无评论

Perceptual image Compression With Conditional Diffusion Transformers

Perceptual Image Compression With Conditional Diffusion Tran...

引用

2024 conference on visual communications and image processing

作者： Mao, Rui Feng, Xinmin Gao, Changsheng Li, Li Liu, Dong Sun, Xiaoyan Univ Sci & Technol China MOE Key Lab Brain Inspired Intelligent Percept & Hefei Peoples R China

ISBN: (纸本)9798331529543;9798331529550

Generative models have significantly advanced generative AI, particularly in image and video generation. Recognizing their potential, researchers have begun exploring their application in image compression. However, existing methods face two primary challenges: limited performance improvement and high model complexity. In this paper, to address these two challenges, we propose a perceptual image compression solution by introducing a conditional diffusion model. Given that compression performance heavily depends on the decoder's generative capability, we base our decoder on the diffusion transformer architecture. To address the model complexity problem, we implement the diffusion transformer architecture with Swin transformer. Equipped with enhanced generative capability, we further augment the decoder with informative features using a multi-scale feature fusion module. Experimental results demonstrate that our approach surpasses existing perceptual image compression methods while achieving lower model complexity.

关键词： Diffusion transformer (DiT) image generation perceptual image compression Swin transformer

来源：评论

学校读者我要写书评

暂无评论

Motion-compensated noise estimation for effective video processing

Motion-compensated noise estimation for effective video proc...

引用

conference on visual communications and image processing 2008

作者： Song, Byung Cheol Kim, Nak Hoon Samsung Elect Co Ltd Digital Media R&D Ctr Suwon South Korea

ISBN: (纸本)9780819469946

For effective noise removal prior to video processing, noise power or noise variance of an input video sequence needs to be found exactly, but it is actually a very difficult process. This paper presents an accurate noise variance estimation algorithm based on motion compensation between two adjacent noisy pictures. Firstly, motion estimation is performed for each block in a picture, and the residue. variance of the best motion-compensated block is calculated. Then, a noise variance estimate of the picture is obtained by adaptively averaging and properly scaling the variances close to the best variance. The simulation results show that the proposed noise estimation algorithm is very accurate and stable irrespective of noise level.

关键词： noise estimation motion video

来源：评论

学校读者我要写书评

暂无评论

An efficient JPE62000 tier-1 coder hardware implementation for real-time video processing

引用

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS 2003年第4期49卷 780-786页

作者： Schumacher, PR Xilinx Inc Xilinx Res Labs Longmont CO 80503 USA

The recently approved digital still image standard known as JPEG2000 promises to be an excellent image and video format for use with a large range of applications. For adoption of the standard to take place in the consumer marketplace, implementations supporting real-time encoding and decoding of popular image and video formats must be created. It is a well-known fact that the major bottleneck of a JPEG2000 system is the bit/context modeling and arithmetic coding tasks (also known as tier-1 coding). This paper discusses a hardware implementation of a tier-1 coder that exploits available parallelisms. The proposed technique described in this paper is approximately 50% faster than the best technique described in the literature(1).

关键词： field programmable gate arrays image processing JPEG2000 video signal processing

来源：评论

学校读者我要写书评

暂无评论

Frequency-aware Learned image Compression for Quality Scalability

Frequency-aware Learned Image Compression for Quality Scalab...

引用

IEEE International conference on visual communications and image processing (VCIP)

作者： Choi, Hyomin Racape, Fabien Hamidi-Rad, Shahab Ulhaq, Mateen Feltman, Simon Interdigital Emerging Technol Lab Los Altos CA 94022 USA Simon Fraser Univ Sch Engn Sci Burnaby BC Canada

ISBN: (纸本)9781665475921

Spatial frequency analysis and transforms serve a central role in most engineered image and video lossy codecs, but are rarely employed in neural network (NN)-based approaches. We propose a novel NN-based image coding framework that utilizes forward wavelet transforms to decompose the input signal by spatial frequency. Our encoder generates separate bitstreams for each latent representation of low and high frequencies. This enables our decoder to selectively decode bitstreams in a quality-scalable manner. Hence, the decoder can produce an enhanced image by using an enhancement bitstream in addition to the base bitstream. Furthermore, our method is able to enhance only a specific region of interest (ROI) by using a corresponding part of the enhancement latent representation. Our experiments demonstrate that the proposed method shows competitive rate-distortion performance compared to several non-scalable image codecs. We also showcase the effectiveness of our two-level quality scalability, as well as its practicality in ROI quality enhancement.

关键词： End-to-end compression learned image compression quality scalability wavelet decomposition

来源：评论

学校读者我要写书评

暂无评论

Nanopore Sequencing Simulator for DNA Data Storage

Nanopore Sequencing Simulator for DNA Data Storage

引用

IEEE International conference on visual communications and image processing (VCIP) - visual communications in the Era of AI and Limited Resources

作者： San Antonio, Eva Gil Heinis, Thomas Carteron, Louis Dimopoulou, Melpomeni Antonini, Marc Univ Cote Azur CNRS Lab I3S UMR 7271 Sophia Antipolis France Imperial Coll London Dept Comp London England

ISBN: (纸本)9781728185514

The exponential increase of digital data and the limited capacity of current storage devices have made clear the need for exploring new storage solutions. Thanks to its biological properties, DNA has proven to be a potential candidate for this task, allowing the storage of information at a high density for hundreds or even thousands of years. With the release of nanopore sequencing technologies, DNA data storage is one step closer to become a reality. Many works have proposed solutions for the simulation of this sequencing step, aiming to ease the development of algorithms addressing nanopore-sequenced reads. However, these simulators target the sequencing of complete genomes, whose characteristics differ from the ones of synthetic DNA. This work presents a nanopore sequencing simulator targeting synthetic DNA on the context of DNA data storage.

关键词： DNA data storage nanopore sequencing sequencing simulator image coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：