In order to solve the problems of digital expression, content retrieval and cultural calculation of traditional Manchu paper-cut patterns in Changbai Mountain area, an XML-based information storage model was proposed....
详细信息
With the advent of learned image compression, numerous models have been developed. These models make use of non-linear transforms that are learnt during the training process, where an image is transformed into a laten...
详细信息
ISBN:
(纸本)9781510679344;9781510679351
With the advent of learned image compression, numerous models have been developed. These models make use of non-linear transforms that are learnt during the training process, where an image is transformed into a latent space, quantized and entropy coded. At the decoder, the quantized latent is recovered and transformed back to image space through a synthesis transform. In this work, we attempt to present an analysis of the energy distribution across channels. In our prior works, we demonstrated the features captured by the analysis transform, that can provide insights into the bitrate distribution across channels. Building on that, we extend our findings with quantitative measurements. We consider various learned image codecs that are based on the variational autoencoder framework and compare them with Karhunen Loeve Transform (KLT) in terms of energy compaction. We also compare the closeness of the learned transforms to KLT to study the relationship between the design of classical codecs and learned codecs.
To gather reliable evidence and submit it to the court, forensic applications are used. Due to recent advancements in technology, many crimes now involve the modification of photos. Finding the original evidence and p...
详细信息
image watermarking is an important tool for copyright management, and various watermarking methods have been proposed. Tensor decomposition, which has attracted much attention in imageprocessing, has been applied as ...
详细信息
ISBN:
(纸本)9798350376975;9798350376968
image watermarking is an important tool for copyright management, and various watermarking methods have been proposed. Tensor decomposition, which has attracted much attention in imageprocessing, has been applied as a watermarking method, however, no method has been proposed for color cover images with grayscale watermarks. Therefore, we propose a color image watermarking method that involves using a grayscale watermark. We evaluated the attack resistance of the proposed method through experiments.
digitalimageprocessing is the class of methods that is used in all research domain. With the help of a computer algorithm, the manipulation of an image is done that provides the required information about the image ...
详细信息
We review the history of the development of one of the most iconic tools in image and video coding - the zigzag scan. Despite its simplicity, we will show that its development was a non-trivial process that took sever...
详细信息
ISBN:
(纸本)9781510679344;9781510679351
We review the history of the development of one of the most iconic tools in image and video coding - the zigzag scan. Despite its simplicity, we will show that its development was a non-trivial process that took several years, multiple iterations, and multiple ideas that eventually led to the formation of its final "zigzag" shape. Remarkably, we also discover that early variants of the zigzag scan appeared before the invention of the DCT, intra-predictors, and many other techniques in image and video coding algorithms. It is one of the oldest and most fundamental techniques in this context. This paper also traces the evolution of image and video codec architectures over the last six decades and brings examples of uses of the zigzag scan in modern-era image and video coding standards.
Rapid development and deployment of GPU based computation has led to an improvement in diffusion generation of video and images. Further, a rapid reduction in the effective cost of compression using NNC techniques pro...
详细信息
ISBN:
(纸本)9781510679344;9781510679351
Rapid development and deployment of GPU based computation has led to an improvement in diffusion generation of video and images. Further, a rapid reduction in the effective cost of compression using NNC techniques provides opportunities to compress images and videos in new ways. The overall structure of diffusion based generative video and images is leveraged to take advantage of the compressed latent to lower overall compression costs and latency. This paper presents an architecture to compress a latent for transmission and reduce overall latency and cost as compared to alternatives using traditional Codecs or NNC on the raw image. It explores a proof of concept based on image compression of a latent. It further presents computational cost, quantitative and perceptual quality, and latency for this architecture as compared to the alternatives.
The International Workshop on Artificial Intelligence for Signal, imageprocessing, and Multimedia (AI-SIPM) aims to provide a platform for researchers, practitioners, and industry professionals to exchange ideas, dis...
详细信息
ISBN:
(纸本)9798400706028
The International Workshop on Artificial Intelligence for Signal, imageprocessing, and Multimedia (AI-SIPM) aims to provide a platform for researchers, practitioners, and industry professionals to exchange ideas, discuss recent advancements, and explore future directions in the field of artificial intelligence (AI) applied to signal processing, imageprocessing, and multimedia technologies. This workshop will feature presentations of novel research findings, practical applications, and innovative solutions addressing various challenges and opportunities in AI-driven signal and imageprocessing, as well as multimedia analysis and understanding. Researchers and practitioners from academia, industry, and government agencies are invited to submit their original research contributions and participate in discussions that foster collaboration and knowledge sharing across different domains. Through this workshop, we aim to accelerate advancements in AI-driven technologies for signal processing, image analysis, and multimedia applications, contributing to the advancement of research and innovation in this rapidly evolving field.
The image compression field is witnessing a shift in paradigm thanks to the rise of neural network-based models. In this context, the JPEG committee is in the process of standardizing the first learning-based image co...
详细信息
ISBN:
(纸本)9781510679344;9781510679351
The image compression field is witnessing a shift in paradigm thanks to the rise of neural network-based models. In this context, the JPEG committee is in the process of standardizing the first learning-based image compression standard, known as JPEG AI. While most of the research to date has focused on image coding for humans, JPEG AI plans to address both human and machine vision by presenting several non-normative decoders addressing multiple imageprocessing and computer vision tasks in addition to standard reconstruction. While the impact of conventional image compression on computer vision tasks has already been addressed, no study has been conducted to assess the impact of learning-based image compression on such tasks. In this paper, the impact of learning-based image compression, including JPEG AI, on computer vision tasks is reviewed and discussed, mainly focusing on the image classification task along with object detection and segmentation. This study reviews the impact of image compression with JPEG AI on various computer vision models. It shows the superiority of JPEG AI over other conventional and learning-based compression models.
image smoothening is a highly utilized operation in digitalimageprocessing focused on reducing the impact of noise. In field programmable gate arrays (FPGA) based implementation, the working of the image smoothening...
详细信息
暂无评论