This study introduces CLIP-Flow,a novel network for generating images from a given image or *** effectively utilize the rich semantics contained in both modalities,we designed a semantics-guided methodology for image-...
详细信息
This study introduces CLIP-Flow,a novel network for generating images from a given image or *** effectively utilize the rich semantics contained in both modalities,we designed a semantics-guided methodology for image-and text-to-image *** particular,we adopted Contrastive Language-Image Pretraining(CLIP)as an encoder to extract semantics and StyleGAN as a decoder to generate images from such ***,to bridge the embedding space of CLIP and latent space of StyleGAN,real NVP is employed and modified with activation normalization and invertible *** the images and text in CLIP share the same representation space,text prompts can be fed directly into CLIP-Flow to achieve text-to-image *** conducted extensive experiments on several datasets to validate the effectiveness of the proposed image-to-image synthesis *** addition,we tested on the public dataset Multi-Modal CelebA-HQ,for text-to-image *** validated that our approach can generate high-quality text-matching images,and is comparable with state-of-the-art methods,both qualitatively and quantitatively.
Improving website security to prevent malicious online activities is crucial,and CAPTCHA(Completely Automated Public Turing test to tell computers and Humans Apart)has emerged as a key strategy for distinguishing huma...
详细信息
Improving website security to prevent malicious online activities is crucial,and CAPTCHA(Completely Automated Public Turing test to tell computers and Humans Apart)has emerged as a key strategy for distinguishing human users from automated ***-based CAPTCHAs,designed to be easily decipherable by humans yet challenging for machines,are a common form of this ***,advancements in deep learning have facilitated the creation of models adept at recognizing these text-based CAPTCHAs with surprising *** our comprehensive investigation into CAPTCHA recognition,we have tailored the renowned UpDown image captioning model specifically for this *** approach innovatively combines an encoder to extract both global and local features,significantly boosting the model’s capability to identify complex details within CAPTCHA *** the decoding phase,we have adopted a refined attention mechanism,integrating enhanced visual attention with dual layers of Long Short-Term Memory(LSTM)networks to elevate CAPTCHA recognition *** rigorous testing across four varied datasets,including those from Weibo,BoC,Gregwar,and Captcha 0.3,demonstrates the versatility and effectiveness of our *** results not only highlight the efficiency of our approach but also offer profound insights into its applicability across different CAPTCHA types,contributing to a deeper understanding of CAPTCHA recognition technology.
Due to the unclear distribution characteristics and causes of fluoride in groundwater of Mihe-Weihe River Basin(China),there is a higher risk for the future development and utilization of ***,based on the systematic s...
详细信息
Due to the unclear distribution characteristics and causes of fluoride in groundwater of Mihe-Weihe River Basin(China),there is a higher risk for the future development and utilization of ***,based on the systematic sampling and analysis,the distribution features and enrichment mechanism for fluoride in groundwater were studied by the graphic method,hydrogeochemical modeling,the proportionality factor between conventional ions and factor *** results show that the fluorine content in groundwater is generally on the high side,with a large area of medium-fluorine water(0.5–1.0 mg/L),and high-fluorine water is chiefly in the interfluvial lowlands and alluvial-marine plain,which mainly contains HCO_(3)·Cl-Na-and HCO_(3)^(-)Na-type *** vertical zonation characteristics of the fluorine content decrease with increasing depth to the water *** high flouride groundwater during the wet season is chiefly controlled by the weathering and dissolution of fluorine-containing minerals,as well as the influence of rock weathering,evaporation and *** weak alkaline environment that is rich in sodium and poor in calcium during the dry season is the main reason for the enrichment of ***,an integrated assessment model is established using rough set theory and an improved matter element extension model,and the level of groundwater pollution caused by fluoride in the Mihe-Weihe River Basin during the wet and dry seasons in the Shandong Peninsula is defined to show the necessity for local management measures to reduce the potential risks caused by groundwater quality.
In recent years, academics have placed a high value on multi-modal emotion identification, as well as extensive research has been conducted in the areas of video, text, voice, and physical signal emotion detection. Th...
详细信息
In the realm of deep learning, Generative Adversarial Networks (GANs) have emerged as a topic of significant interest for their potential to enhance model performance and enable effective data augmentation. This paper...
详细信息
Because of the problems of low detection accuracy and long detection time in traditional online learning state detection methods, a new method based on posture recognition is proposed. First of all, a pinhole camera p...
详细信息
Interrupted Sampling Repeater Jamming (ISRJ) can produce several false targets through intermittent sampling and forwarding of the intercepted signals. The paper proposes an interference identification and suppression...
详细信息
Optoelectronic synapses that integrate visual perception and pre-processing hold significant potential for neuromorphic vision systems(NVSs). However, due to a lack of wavelength sensitivity, existing NVS mainly foc...
详细信息
Optoelectronic synapses that integrate visual perception and pre-processing hold significant potential for neuromorphic vision systems(NVSs). However, due to a lack of wavelength sensitivity, existing NVS mainly focuses on gray-scale image processing, making it challenging to recognize color images. Additionally, the high power consumption of optoelectronic synapses, compared to the 10 fJ energy consumption of biological synapses, limits their broader application. To address these challenges, an energy-efficient NVS capable of color target recognition in a noisy environment was developed,utilizing a MoS2optoelectronic synapse with wavelength sensitivity. Benefiting from the distinct photon capture capabilities of 450, 535, and 650 nm light, the optoelectronic synapse exhibits wavelength-dependent synaptic plasticity, including excitatory postsynaptic current(EPSC), paired-pulse facilitation(PPF), and long-term plasticity(LTP). These properties can effectively mimic the visual memory and color discrimination functions of the human vision system. Results demonstrate that the NVS, based on MoS2optoelectronic synapses, can eliminate the color noise at the sensor level, increasing color image recognition accuracy from 50% to 90%. Importantly, the optoelectronic synapse operates at a low voltage spike of0.0005 V, consuming only 0.075 fJ per spike, surpassing the energy efficiency of both existing optoelectronic and biological synapses. This ultra-low power, color-sensitive device eliminates the need for color filters and offers great promise for future deployment in filter-free NVS.
We introduce a novel method using a new generative model that automatically learns effective representations of the target and background appearance to detect,segment and track each instance in a video *** from curren...
详细信息
We introduce a novel method using a new generative model that automatically learns effective representations of the target and background appearance to detect,segment and track each instance in a video *** from current discriminative tracking-by-detection solutions,our proposed hierarchical structural embedding learning can predict more highquality masks with accurate boundary details over spatio-temporal space via the normalizing *** formulate the instance inference procedure as a hierarchical spatio-temporal embedded learning across time and *** the video clip,our method first coarsely locates pixels belonging to a particular instance with Gaussian distribution and then builds a novel mixing distribution to promote the instance boundary by fusing hierarchical appearance embedding information in a coarse-to-fine *** the mixing distribution,we utilize a factorization condition normalized flow fashion to estimate the distribution parameters to improve the segmentation *** qualitative,quantitative,and ablation experiments are performed on three representative video instance segmentation benchmarks(i.e.,YouTube-VIS19,YouTube-VIS21,and OVIS)and the effectiveness of the proposed method is *** impressively,the superior performance of our model on an unsupervised video object segmentation dataset(i.e.,DAVIS19)proves its *** algorithm implementations are publicly available at https://***/zyqin19/HEVis.
Federated Graph Neural Networks (FedGNNs) have achieved significant success in representation learning for graph data, enabling collaborative training among multiple parties without sharing their raw graph data and so...
详细信息
Federated Graph Neural Networks (FedGNNs) have achieved significant success in representation learning for graph data, enabling collaborative training among multiple parties without sharing their raw graph data and solving the data isolation problem faced by centralized GNNs in data-sensitive scenarios. Despite the plethora of prior work on inference attacks against centralized GNNs, the vulnerability of FedGNNs to inference attacks has not yet been widely explored. It is still unclear whether the privacy leakage risks of centralized GNNs will also be introduced in FedGNNs. To bridge this gap, we present PIAFGNN, the first property inference attack (PIA) against FedGNNs. Compared with prior works on centralized GNNs, in PIAFGNN, the attacker can only obtain the global embedding gradient distributed by the central server. The attacker converts the task of stealing the target user’s local embeddings into a regression problem, using a regression model to generate the target graph node embeddings. By training shadow models and property classifiers, the attacker can infer the basic property information within the target graph that is of interest. Experiments on three benchmark graph datasets demonstrate that PIAFGNN achieves attack accuracy of over 70% in most cases, even approaching the attack accuracy of inference attacks against centralized GNNs in some instances, which is much higher than the attack accuracy of the random guessing method. Furthermore, we observe that common defense mechanisms cannot mitigate our attack without affecting the model’s performance on mainly classification tasks.
暂无评论