Generative image steganography is a technique that directly generates stego images from secret *** traditional methods,it theoretically resists steganalysis because there is no cover ***,the existing generative image ...
详细信息
Generative image steganography is a technique that directly generates stego images from secret *** traditional methods,it theoretically resists steganalysis because there is no cover ***,the existing generative image steganography methods generally have good steganography performance,but there is still potential room for enhancing both the quality of stego images and the accuracy of secret information ***,this paper proposes a generative image steganography algorithm based on attribute feature transformation and invertible mapping ***,the reference image is disentangled by a content and an attribute encoder to obtain content features and attribute features,***,a mean mapping rule is introduced to map the binary secret information into a noise vector,conforming to the distribution of attribute *** noise vector is input into the generator to produce the attribute transformed stego image with the content feature of the reference ***,we design an adversarial loss,a reconstruction loss,and an image diversity loss to train the proposed *** results demonstrate that the stego images generated by the proposed method are of high quality,with an average extraction accuracy of 99.4%for the hidden ***,since the stego image has a uniform distribution similar to the attribute-transformed image without secret information,it effectively resists both subjective and objective steganalysis.
The accurate identification of students in need is crucial for governments and colleges to allocate resources more effectively and enhance social equity and educational fairness. Existing approaches to identifying stu...
详细信息
People-centric activity recognition is one of the most critical technologies in a wide range of real-world applications,including intelligent transportation systems, healthcare services, and brain-computer interfaces....
详细信息
People-centric activity recognition is one of the most critical technologies in a wide range of real-world applications,including intelligent transportation systems, healthcare services, and brain-computer interfaces. Large-scale data collection and annotation make the application of machine learning algorithms prohibitively expensive when adapting to new tasks. One way of circumventing this limitation is to train the model in a semi-supervised learning manner that utilizes a percentage of unlabeled data to reduce the labeling burden in prediction tasks. Despite their appeal, these models often assume that labeled and unlabeled data come from similar distributions, which leads to the domain shift problem caused by the presence of distribution gaps. To address these limitations, we propose herein a novel method for people-centric activity recognition,called domain generalization with semi-supervised learning(DGSSL), that effectively enhances the representation learning and domain alignment capabilities of a model. We first design a new autoregressive discriminator for adversarial training between unlabeled and labeled source domains, extracting domain-specific features to reduce the distribution gaps. Second, we introduce two reconstruction tasks to capture the task-specific features to avoid losing information related to representation learning while maintaining task-specific consistency. Finally, benefiting from the collaborative optimization of these two tasks, the model can accurately predict both the domain and category labels of the source domains for the classification task. We conduct extensive experiments on three real-world sensing datasets. The experimental results show that DGSSL surpasses the three state-of-the-art methods with better performance and generalization.
Large models have recently played a dominant role in natural language processing and multimodal vision-language learning. However, their effectiveness in text-related visual tasks remains relatively unexplored. In thi...
详细信息
Large models have recently played a dominant role in natural language processing and multimodal vision-language learning. However, their effectiveness in text-related visual tasks remains relatively unexplored. In this paper, we conducted a comprehensive evaluation of large multimodal models, such as GPT4V and Gemini, in various text-related visual tasks including text recognition, scene text-centric visual question answering(VQA), document-oriented VQA, key information extraction(KIE), and handwritten mathematical expression recognition(HMER). To facilitate the assessment of optical character recognition(OCR) capabilities in large multimodal models, we propose OCRBench, a comprehensive evaluation benchmark. OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available. Furthermore, our study reveals both the strengths and weaknesses of these models, particularly in handling multilingual text, handwritten text, non-semantic text, and mathematical expression *** importantly, the baseline results presented in this study could provide a foundational framework for the conception and assessment of innovative strategies targeted at enhancing zero-shot multimodal *** evaluation pipeline and benchmark are available at https://***/Yuliang-Liu/Multimodal OCR.
Software trustworthiness is an essential criterion for evaluating software quality. In componentbased software, different components play different roles and different users give different grades of trustworthiness af...
详细信息
Software trustworthiness is an essential criterion for evaluating software quality. In componentbased software, different components play different roles and different users give different grades of trustworthiness after using the software. The two elements will both affect the trustworthiness of software. When the software quality is evaluated comprehensively, it is necessary to consider the weight of component and user feedback. According to different construction of components, the different trustworthiness measurement models are established based on the weight of components and user feedback. Algorithms of these trustworthiness measurement models are designed in order to obtain the corresponding trustworthiness measurement value automatically. The feasibility of these trustworthiness measurement models is demonstrated by a train ticket purchase system.
A compact filtering antenna system with wide-angle scanning is proposed for vehicle to infrastructure(V2I) communication which would handle complex communication scenarios. In this work, a wide beam filtering antenna ...
详细信息
A compact filtering antenna system with wide-angle scanning is proposed for vehicle to infrastructure(V2I) communication which would handle complex communication scenarios. In this work, a wide beam filtering antenna is realized by using some inductive resistance structures such as metal pins and pillars, and capacitive structures such as slots, parasitical patches to produce the radiation nulls at two sides of the operating frequency band and improve the impedance matching in the passband. Meanwhile, the wide beam capability is also realized by the above structure. Furthermore, two H-and E-plane linear arrays are designed for the beam scanning capability with filtering characteristics based on the proposed antenna. To verify the proposed design concept, a prototype is fabricated and measured. The measurement and simulation agree well, demonstrating an excellent filtering characteristic with the operating frequency band from 3.18 to 3.45 GHz(about 8.1%), the high total efficiency of about 88%, and 3-d B-beamwidth of more than 100° and 120° in the above two arrays, respectively. Additionally, the proposed arrays can realize the beam scanning up to the coverage of 112° and 120° with a lower gain reduction and a good filtering characteristic, respectively.
Emotion recognition using biological brain signals needs to be reliable to attain effective signal processing and feature extraction techniques. The impact of emotions in interpretations, conversations, and decision-m...
详细信息
Emotion recognition using biological brain signals needs to be reliable to attain effective signal processing and feature extraction techniques. The impact of emotions in interpretations, conversations, and decision-making, has made automatic emotion recognition and examination of a significant feature in the field of psychiatric disease treatment and cure. The problem arises from the limited spatial resolution of EEG recorders. Predetermined quantities of electroencephalography (EEG) channels are used by existing algorithms, which combine several methods to extract significant data. The major intention of this study was to focus on enhancing the efficiency of recognizing emotions using signals from the brain through an experimental, adaptive selective channel selection approach that recognizes that brain function shows distinctive behaviors that vary from one individual to another individual and from one state of emotions to another. We apply a Bernoulli–Laplace-based Bayesian model to map each emotion from the scalp senses to brain sources to resolve this issue of emotion mapping. The standard low-resolution electromagnetic tomography (sLORETA) technique is employed to instantiate the source signals. We employed a progressive graph convolutional neural network (PG-CNN) to identify the sources of the suggested localization model and the emotional EEG as the main graph nodes. In this study, the proposed framework uses a PG-CNN adjacency matrix to express the connectivity between the EEG source signals and the matrix. Research on an EEG dataset of parents of an ASD (autism spectrum disorder) child has been utilized to investigate the ways of parenting of the child's mother and father. We engage with identifying the personality of parental behaviors when regulating the child and supervising his or her daily activities. These recorded datasets incorporated by the proposed method identify five emotions from brain source modeling, which significantly improves the accurac
Fabric defect detection is a critical task in the textile industry, requiring high precision and recall to ensure effective quality control. This study presents an enhanced YOLOv8-based framework that integrates novel...
详细信息
Unsupervised domain adaptation (UDA) is a popular technique to reduce the manual annotation cost in semantic segmentation. However, due to the absence of strong supervision in the target domain, UDA is prone to biasin...
详细信息
Unsupervised domain adaptation (UDA) is a popular technique to reduce the manual annotation cost in semantic segmentation. However, due to the absence of strong supervision in the target domain, UDA is prone to biasing the decision boundary towards the source domain. To alleviate this issue, this paper proposes a more effective semi-supervised domain adaptation (SSDA) method for semantic segmentation via active learning with feature- and semantic-level alignments. Specifically, active learning is utilized to select those samples with high diversity and uncertainty from the target domain for labeling. These selected data could provide reliable clues for domain transfer since they reveal the intrinsic distribution of the target domain as well as including hard samples at boundaries. Moreover, to better adapt the segmentation model from the source data to the labeled target data selected above, we propose a scheme based on both feature- and semantic-level domain alignments. The feature-level domain alignment imposes the distribution consistency between the Transformer features of the two domains by adversarial learning, which is a global alignment. In contrast, the semantic-level domain alignment optimizes the affinity and divergence of the semantic representations across domains via contrastive learning, which is a local alignment. These two alignments jointly bridge the domain gap from both the global and the local views, respectively. In addition, the pseudo labels of the unlabeled data are generated to expand the labeled data and further strengthen the cross-domain segmentation in a self-training manner. Extensive experiments on segmentation benchmarks demonstrate the effectiveness of our proposed method. IEEE
Older adults are often underserved and marginalized in technology engagement due to their reluctance and the barriers they face in adopting and engaging with mainstream technology. However, Pinxiaoquan, a social featu...
详细信息
暂无评论