This paper proposes a novel methodology that utilizes a newly developed visual Internet of Things (IoT) system for resilient natural disaster mitigation. This system enables the detection of disasters through remote c...
详细信息
image style transfer is a technique in computer vision by which the artistic style of one image is applied to the content of another while keeping the structural features. image style transfer finds applications in cr...
详细信息
ISBN:
(数字)9798331529543
ISBN:
(纸本)9798331529550
image style transfer is a technique in computer vision by which the artistic style of one image is applied to the content of another while keeping the structural features. image style transfer finds applications in creating artwork, design and branding, entertainment and media, and many other fields. Current image style transfer methods fail to satisfactorily retain global characteristics and local details simultaneously. This paper proposes a hybrid transformer architecture that incorporates mixed convolutional network modules. By integrating the transformer and the convolutional modules, the global and local features are captured and fused. Experimental results demonstrate that the proposed method achieves more favorable visual fidelity, reducing the combined content and style loss by at least 10% as compared with the state-of-the-art approaches.
Quanta image sensors are a novel paradigm in image sensor technology. Their direct application to quanta image sensors-based imaging systems is challenging because a bit-plane image is a set of binary images. In this ...
详细信息
ISBN:
(数字)9798331529543
ISBN:
(纸本)9798331529550
Quanta image sensors are a novel paradigm in image sensor technology. Their direct application to quanta image sensors-based imaging systems is challenging because a bit-plane image is a set of binary images. In this paper, we introduce spatio-temporal priors based on the intensity invariance and smoothness characteristics of the motion vector. Specifically, we model when the image sequences align with the correct motion vector, the spatiotemporal structure becomes more consistent. Moreover, the spatial smoothness prior is incorporated through the smoothing filtering of the evaluation metrics of motion vector candidates. The experimental results show that the proposed method is more effective than conventional methods.
In the 1960s, Hubel et al. proposed the concept of receptive field through the study of cat visual cortex cells[1]. In the 1980s, Fukushima [2] proposed the concept of neurocognitive machine based on the concept of re...
详细信息
Effective image coding techniques are crucial for digital image storage and transmission. Traditional methods struggle to maintain high visual quality at low bitrates. In this paper, we present MobileViT-GAN, a novel ...
Effective image coding techniques are crucial for digital image storage and transmission. Traditional methods struggle to maintain high visual quality at low bitrates. In this paper, we present MobileViT-GAN, a novel generative adversarial network (GAN) architecture for low bitrate image compression. We propose a lightweight transformer-based discriminator to improve coding performance, compared to convolutional neural network-based discriminators. Additionally, we introduce a smoothness loss function to mitigate artifacts in decoded images, further improving visual quality in low bitrate image coding. We evaluate our proposed method against traditional and state-of-the-art GAN-based image compression techniques, showcasing its superiority in terms of compression ratio and decoded image quality.
Acquisition and consumption of visual media such as digital image and videos is becoming one of the most important forms of modern communication. However, since the creation and sharing of images is increasing exponen...
详细信息
ISBN:
(数字)9783030948931
ISBN:
(纸本)9783030948931;9783030948924
Acquisition and consumption of visual media such as digital image and videos is becoming one of the most important forms of modern communication. However, since the creation and sharing of images is increasing exponentially, images as a media form suffer from being devalued, as the quality of single images are getting less and less important, and the frequency of the shared content turns to be the focus. In this work, an interactive system which allows users to interact with volatile and diverting artwork based on their eye movement only is presented. The system uses real-time image-abstraction techniques to create an artwork unique to each situation. It supports multiple, distinct interaction modes, which share common design principles, enabling users to experience game-like interactions focusing on eye-movement and the diverting image content itself. This approach hints at possible future research in the field of relaxation exercises and casual art consumption and creation.
In the era of digitization and big data, the world is inundated with an ever-growing volume of visual content, be it images or videos. As organizations strive to harness the potential of these multimedia data sources,...
详细信息
Programming through machine learning methods has been gradually replacing many repetitive and tedious works. In the mission of identifying of mechanically exfoliated 2Dmaterials usingmicroscope, which is a laboriouswo...
详细信息
ISBN:
(数字)9789811903908
ISBN:
(纸本)9789811903908;9789811903892
Programming through machine learning methods has been gradually replacing many repetitive and tedious works. In the mission of identifying of mechanically exfoliated 2Dmaterials usingmicroscope, which is a laboriouswork inappropriate to be manually executed, the machine learning methods perform attractive potentials in the rapid and accurate targeting of available productions. Based on the algorithm of image segmentation and target positioning, this paper discusses the feasibility of program-controlled searching of 2D nanosheets from miscellaneous cracks. By intruding visual GUI displaying, the program-enabled automatic recognition may emancipate humans from the hard work.
Recently, providing explainable deep learning models has sparked a lot of attention. In this paper, we take a further step in this direction. We introduce a time-efficient method, called Ablation-CAM++, which can gene...
详细信息
With the rapid development of whole brain imaging technology, a large number of brain images have been produced, which puts forward a great demand for efficient brain image compression methods. At present, the most co...
详细信息
ISBN:
(纸本)9781728185514
With the rapid development of whole brain imaging technology, a large number of brain images have been produced, which puts forward a great demand for efficient brain image compression methods. At present, the most commonly used compression methods are all based on 3-D wavelet transform, such as JP3D. However, traditional 3-D wavelet transforms are designed manually with certain assumptions on the signal, but brain images are not as ideal as assumed. What's more, they are not directly optimized for compression task. In order to solve these problems, we propose a trainable 3-D wavelet transform based on the lifting scheme, in which the predict and update steps are replaced by 3-D convolutional neural networks. Then the proposed transform is embedded into an end-to-end compression scheme called iWave3D, which is trained with a large amount of brain images to directly minimize the rate-distortion loss. Experimental results demonstrate that our method outperforms JP3D significantly by 2.012 dB in terms of average BD-PSNR.
暂无评论