In recent years, numerous internet of Things (IoT) traffic classification techniques based on deep learning have been proposed, demonstrating impressive classification performance. With ubiquitous access to massive he...
详细信息
We propose a method for generating spurious features by leveraging large-scale text-to-image diffusion models. Although the previous work detects spurious features in a large-scale dataset like imageNet and introduces...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
We propose a method for generating spurious features by leveraging large-scale text-to-image diffusion models. Although the previous work detects spurious features in a large-scale dataset like imageNet and introduces Spurious imageNet, we found that not all spurious images are spurious across different classifiers because images that contain spurious features were filtered by a limited number of classifiers. Although spurious images help measure the reliance of a classifier, filtering many images from the internet to find more spurious features is time-consuming. To this end, we utilize an existing approach of personalizing large-scale text-to-image diffusion models with available discovered spurious images and propose a new spurious feature similarity loss based on neural features of an adversarially robust model. Precisely, we fine-tune Stable Diffusion with several reference images from Spurious imageNet with a modified objective incorporating the proposed spurious-feature similarity loss. Experiment results show that our method can generate spurious images that are consistently spurious across different classifiers. Moreover, the generated spurious images are visually similar to reference images from Spurious imageNet.
User-generated content (UGC) is ubiquitous across the internet as a result of billions of videos and images being uploaded each day. All kinds of UGC media are affected by natural distortions, occurring both during an...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
User-generated content (UGC) is ubiquitous across the internet as a result of billions of videos and images being uploaded each day. All kinds of UGC media are affected by natural distortions, occurring both during and after capture, which are inherently diverse and commingled. These distortions have different perceptual effects based on the media content. Given recent dramatic increases in the consumption of short-form content, the analysis and control of their perceptual quality has become an important problem. Regardless of the content, many UGC videos have overlaid and embedded texts in them, which are visually salient. Hence text quality has a significant impact on the global perception of video or image quality and needs to be studied. One of the most important factors in perceptual text quality in user-generated media is legibility, which has been studied very little in the context of computer vision. Predicting text legibility can also help in text recognition applications such as image search or document identification. This work aims at modeling text legibility using computer vision techniques and thus studying the relationship between text quality and legibility. We propose a modified dataset variant of COCO-Text [1] and a model for predicting text legibility for both handwritten and machine-generated texts. We also demonstrate how models trained to predict text legibility can help in the prediction of text (perceptual) quality. The dataset and models can be accessed here https://***/research/Quality/***.
Semantic communication aims to transmit the underlying semantic information of a signal from the sender to the receiver, where the key requirement is to ensure that the receiver reconstructs a signal semantically (alm...
详细信息
With the advancement of technology, the collection of data used in Intelligent Transportation systems has become increasingly easy, notably with the emergence of the Mobile CrowdSensing paradigm. This paradigm could p...
详细信息
ISBN:
(纸本)9798350333398
With the advancement of technology, the collection of data used in Intelligent Transportation systems has become increasingly easy, notably with the emergence of the Mobile CrowdSensing paradigm. This paradigm could provide insights into traffic situation, road condition, pedestrians' behaviours, public transportation situation, and so on. Through the use of the powerful sensors in the mobile, different types of data can be generated, such as Gyroscope data, light sensors data, Magnetometer data, GPS data, Accelerometer data, videos, and images. However, the use of MCS raises issues of data reliability, as it involves the participation of several mobile owners with different 'levels' of trustworthiness. An important related issue is image forgery, i.e. the manipulation of images in order to deliberately provide misleading information, which constitutes a threat to the decision-makers of the MSC-based applications in Intelligent Transportation systems. In this paper, an overview on image forgery is presented. We examine the workflows, approaches and techniques employed for image forgery detection and localization. Then, we provide a brief review of some image splicing localization techniques. Finally, we provide a comparative analysis of handcrafted feature extraction-based techniques and deep learning-based techniques. The aim of this paper is to draw attention to the image forgery problem that could threaten any Mobile CrowdSensing application dedicated to image collection, and to provide an overview of different existing techniques that could be used to overcome this problem.
In recent years, wildfires occur frequently as global warming. People set a large number of sensors to monitor the wild land. However, the poor network in remote area can hardly afford the big data transmission while ...
详细信息
Drip irrigation systems, while efficient in water usage, often suffer from clogged emitters, leading to crop loss. To address this challenge, a drone-based solution is proposed. Equipped with image processing capabili...
详细信息
In this paper, a home care system with location information is designed. From this we can learn from the physiological information and location information of the elders. The physiological information part includes th...
详细信息
The rapid growth of the internet of Things (loT) has created significant opportunities for the future of telecommunications. Research on physical layer authentication with channel features holds promise to improve wir...
详细信息
Hyperspectral super-resolution involves combining low-resolution hyperspectral images with high-resolution multispectral images to produce a high-resolution hyperspectral image. Recently, although many methods for hyp...
详细信息
暂无评论