Traditional image steganography methods embed secret data into cover images but often face detection by steganalysis tools due to detectable pixel modifications, which increase the risk of data leakage. These techniqu...
详细信息
High-bandwidth memory (HBM) placed side-by-side with ASIC on silicon interposer is capable of delivering the TB/s bandwidth. To maintain such high bandwidth, it is crucial to perform extensive signal integrity analysi...
详细信息
The first step in the development and application of marine resources is to process the captured ocean optical images. Compared with the general image, the underwater image processing is more complex, which increases ...
详细信息
The Retinex algorithms find wide applications as image enhancers, for their capability of preserving edges, while at the same time attenuating smooth gradients and chromatic dominants. They are characterized by the fa...
详细信息
ISBN:
(纸本)9781665464956
The Retinex algorithms find wide applications as image enhancers, for their capability of preserving edges, while at the same time attenuating smooth gradients and chromatic dominants. They are characterized by the fact that the output chromatic intensity of a pixel is not determined in isolation (or looking only at the contiguous pixels) but through an operation of comparison to different local and remote areas of the image. This local/global comparison implies also a high computational cost for the algorithms: their complexity is not linear with the number of pixels;furthermore, the more systematic the comparison, the higher the complexity. Thus, most Retinex algorithms are unfit for real-time processing. The recent development of efficient Machine Learning architectures for image Processing has raised the question of whether one of the Retinex "transforms" could be efficiently learned by training a feed-forward Artificial Neural Network, thus creating a model characterized by short processing time. Selecting a variant of the Random Spray Retinex model - FuzzyRSR - as representative of the Retinex family, and choosing suitably structured autoencoder neural networks, we found that we could accurately reproduce the Retinex effects. The computational cost of the training phase was moderate, while that of the inference phases was linear in the number of pixels, and three orders of magnitude lower than the one of FuzzyRSR, thus making the ANN implementation of Retinex suitable for real-time processing.
Brain tumor anomaly detection plays a critical role in the field of computer-aided diagnosis, which has attracted ever-increasing focus from the medical community However, brain tumor data are scarce and tough to clas...
详细信息
ISBN:
(纸本)9781665468916
Brain tumor anomaly detection plays a critical role in the field of computer-aided diagnosis, which has attracted ever-increasing focus from the medical community However, brain tumor data are scarce and tough to classify. Unsupervised methods enable the reduction of huge labeling costs to be applied to brain tumor anomaly detection during the training only given normal brain images. However, the existing unsupervised methods distinguish whether the input image is abnormal in the image space, which cannot effectively learn the discriminative features. In this paper, we propose a novel brain tumor anomaly detection method via Latent Feature Regularization based Adversarial Network (LFRA-Net), which leverages a latent feature regularizer into adversarial learning to obtain the discriminative features. Comprehensive experiments on BraTS, HCP, MNIST, and CIFAR-10 datasets evaluate the effectiveness of our LFRANet, which outperforms state-of-the-art unsupervised learning methods.
Currently, aspect-based multimodal sentiment analysis (ABMSA) remains a highly hot research field, aiming to leverage various modalities such as images and text to determine the sentiment orientation of viewpoint enti...
详细信息
Wavelet transform techniques are crucial to image processing, particularly in medical imaging, where the need for precise feature extraction is critical. This article describes the design and FPGA implementation of an...
详细信息
The end-to-end image dehazing network relies on paired training data. However, there is limited training data available for real non-homogeneous dehazing, which limits the performance of dehazing networks on real non-...
详细信息
internet of Things (IoT) is a growing computing trend that encompasses every connected thing. Over the recent years, IoT has recorded an exponential growth, leading to billions of smart devices, and still increasing. ...
详细信息
With the proliferation of social media data, Multimodal Named Entity Recognition (MNER) has received much attention;using different data modalities is crucial for the development of natural language processing and neu...
详细信息
ISBN:
(纸本)9798350359329;9798350359312
With the proliferation of social media data, Multimodal Named Entity Recognition (MNER) has received much attention;using different data modalities is crucial for the development of natural language processing and neural networks. However, existing methods suffer from two drawbacks: 1) textimage pairs in the data only sometimes correspond to each other, and it is impossible to rely on contextual information due to the short text nature of social media. 2) Despite the introduction of visual information, heterogeneity gaps may occur in previous complex fusion methods, leading to misidentification. This paper proposes a new synthetic image with a selected graphic alignment network(SAMNER) to address these challenges and construct a matching relationship between external images and text. To solve the graphic mismatch problem, we use a stable diffusion model to generate the images and perform entity labeling. Specifically, we generate images and perform entity labeling through the stable diffusion model to generate the image with the highest match to the text, filter the generated images by the internal image set to generate the best image, and then perform multimodal fusion to predict the entity labeling, we design a simple and effective multimodal attentional alignment mechanism to obtain a better visual representation, and we conduct a large number of experiments. The experiments prove that our model produces competitive results on the two publicly available datasets.
暂无评论