检索结果-内蒙古大学图书馆

38th Conference on Neural information Processing Systems, NeurIPS 2024

作者： Meng, Benyuan Xu, Qianqian Wang, Zitai Yang, Zhiyong Cao, Xiaochun Huang, Qingming Institute of Information Engineering CAS China School of Cyber Security University of Chinese Academy of Sciences China Key Lab. of Intelligent Information Processing Institute of Computing Technology CAS China Peng Cheng Laboratory China School of Computer Science and Tech. University of Chinese Academy of Sciences China School of Cyber Science and Tech. Shenzhen Campus of Sun Yat-sen University China Key Laboratory of Big Data Mining and Knowledge Management CAS China

Diffusion models are powerful generative models, and this capability can also be applied to discrimination. The inner activations of a pre-trained diffusion model can serve as features for discriminative tasks, namely, diffusion feature. We discover that diffusion feature has been hindered by a hidden yet universal phenomenon that we call content shift. To be specific, there are content differences between features and the input image, such as the exact shape of a certain object. We locate the cause of content shift as one inherent characteristic of diffusion models, which suggests the broad existence of this phenomenon in diffusion feature. Further empirical study also indicates that its negative impact is not negligible even when content shift is not visually perceivable. Hence, we propose to suppress content shift to enhance the overall quality of diffusion features. Specifically, content shift is related to the information drift during the process of recovering an image from the noisy input, pointing out the possibility of turning off-the-shelf generation techniques into tools for content shift suppression. We further propose a practical guideline named GATE to efficiently evaluate the potential benefit of a technique and provide an implementation of our methodology. Despite the simplicity, the proposed approach has achieved superior results on various tasks and datasets, validating its potential as a generic booster for diffusion features. Our code is availab.e at this url. © 2024 Neural information processing systems foundation. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features 38

Not All Diffusion Model Activations Have Been Evaluated as D...

引用

38th Conference on Neural information Processing Systems, NeurIPS 2024

作者： Meng, Benyuan Xu, Qianqian Wang, Zitai Cao, Xiaochun Huang, Qingming Institute of Information Engineering CAS China School of Cyber Security University of Chinese Academy of Sciences China Key Lab. of Intelligent Information Processing Institute of Computing Technology CAS China Peng Cheng Laboratory China School of Cyber Science and Tech. Shenzhen Campus of Sun Yat-sen University China School of Computer Science and Tech. University of Chinese Academy of Sciences China Key Laboratory of Big Data Mining and Knowledge Management CAS China

Diffusion models are initially designed for image generation. Recent research shows that the internal signals within their backbones, named activations, can also serve as dense features for various discriminative tasks such as semantic segmentation. Given numerous activations, selecting a small yet effective subset poses a fundamental problem. To this end, the early study of this field performs a large-scale quantitative comparison of the discriminative ability of the activations. However, we find that many potential activations have not been evaluated, such as the queries and keys used to compute attention scores. Moreover, recent advancements in diffusion architectures bring many new activations, such as those within embedded ViT modules. Both combined, activation selection remains unresolved but overlooked. To tackle this issue, this paper takes a further step with a much broader range of activations evaluated. Considering the significant increase in activations, a full-scale quantitative comparison is no longer operational. Instead, we seek to understand the properties of these activations, such that the activations that are clearly inferior can be filtered out in advance via simple qualitative evaluation. After careful analysis, we discover three properties universal among diffusion models, enabling this study to go beyond specific models. On top of this, we present effective feature selection solutions for several popular diffusion models. Finally, the experiments across multiple discriminative tasks validate the superiority of our method over the SOTA competitors. Our code is availab.e at this url. © 2024 Neural information processing systems foundation. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Novel Meta-Hierarchical Active Inference Reinforcement Learning Approach for QoS-Driven Resource Allocation in Dynamic Clouds

A Novel Meta-Hierarchical Active Inference Reinforcement Lea...

引用

2024 IEEE Global Communications Conference, GLOBECOM 2024

作者： Xian, Peijie He, Ying Yu, F. Richard Du, Jianbo Shenzhen Univ. College of Computer Science and Software Engineering Shenzhen China Xi'an Univ. of Posts & Telecomm. Shaanxi Key Lab. of Information Comm. Network and Security Shaanxi Xi'an China Carleton Univ. School of Information Technology OttawaON Canada

ISBN: (纸本)9798350351255

The cloud computing environment is highly dynamic due to a variety of external factors such as seasonal changes, market trends, and social events. In this case, tenant behavior patterns exhibit significant variability. Therefore, the cloud computing resource allocation model must continuously adapt to these changes. To this end, we propose an adaptive algorithm for dynamic cloud resource allocation based on meta-hierarchical active inference reinforcement learning (MHAIRL). The algorithm combines active inference with meta-hierarchical reinforcement learning. It improves the overall performance of the algorithm, as well as quickly adapts to environmental changes, and improves the generalization performance. In addition, we design a novel polling scheduling framework combined with long short-term memory (LSTM) network. The framework ensures scheduling fairness and flexibility while greatly reducing the state and action space dimensions of the agent. Extensive simulation results show that our method outperforms baseline algorithms in quality of service (QoS) metrics and significantly improves system performance in highly dynamic cloud resource allocation. © 2024 IEEE.

关键词： Active learning

来源：评论

学校读者我要写书评

暂无评论

IMPROVED XCEPTION WITH DUAL ATTENTION MECHANISM AND FEATURE FUSION FOR FACE FORGERY DETECTION 4

IMPROVED XCEPTION WITH DUAL ATTENTION MECHANISM AND FEATURE ...

引用

4th International Conference on data Intelligence and security, ICDIS 2022

作者： Lin, Hao Luo, Weiqi Wei, Kangkang Liu, Minglin Sun Yat-sen University Guangdong Key Lab of Information Security Technology Guangzhou China School of Computer Science and Engineering Sun Yat-sen University Guangzhou China

ISBN: (数字)9781665459686

ISBN: (纸本)9781665459686

Face forgery detection has become a research hotspot in recent years, and many related methods have been proposed until now. For those images with low quality and/or diverse sources, the detection performances of existing methods are still far from satisfactory. In this paper, we propose an improved Xception with dual attention mechanism and feature fusion for face forgery detection. Different from the middle flow in original Xception model, we try to catch different high-semantic features of face images using different levels of convolution, and introduce the convolutional block attention module and feature fusion to refine and reorganize those high-semantic features. In the exit flow, we employ the self-attention mechanism and depthwise separable convolution to learn the global information and local information of the fused features separately to improve the classification ability of the proposed model. Experimental results evaluated on three Deepfake datasets demonstrate that the proposed method outperforms Xception as well as other related methods both in effectiveness and generalization ability. © 2022 IEEE.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

A comprehensive survey on shadow removal from document images: datasets, methods, and opportunities

引用

Vicinagearth 2025年第1期2卷 1-18页

作者： Wang, Bingshu Li, Changping Zou, Wenbin Zhang, Yongjun Chen, Xuhang Chen, C.L. Philip School of Software Northwestern Polytechnical University Xi’an China Guangdong Provincial Key Laboratory of Intelligent Information Processing & Shenzhen Key Laboratory of Media Security Shenzhen University Shenzhen China Guangdong Key Laboratory of Intelligent Information Processing College of Electronics and Information Engineering Shenzhen University Shenzhen China Yongjun Zhang is with the State Key Laboratory of Public Big Data College of Computer Science and Technology Guizhou University Guiyang China School of Computer Science and Engineering Huizhou University Huizhou China School of Computer Science and Engineering South China University of Technology and Pazhou Lab Guangzhou China

With the rapid development of document digitization, people have become accustomed to capturing and processing documents using electronic devices such as smartphones. However, the captured document images often suffer from issues like shadows and noise due to environmental factors, which can affect their readability. To improve the quality of captured document images, researchers have proposed a series of models or frameworks and applied them in distinct scenarios such as image enhancement, and document information extraction. In this paper, we primarily focus on shadow removal methods and open-source datasets. We concentrate on recent advancements in this area, first organizing and analyzing nine availab.e datasets. Then, the methods are categorized into conventional methods and neural network-based methods. Conventional methods use manually designed features and include shadow map-based approaches and illumination-based approaches. Neural network-based methods automatically generate features from data and are divided into single-stage approaches and multi-stage approaches. We detail representative algorithms and briefly describe some typical techniques. Finally, we analyze and discuss experimental results, identifying the limitations of datasets and methods. Future research directions are discussed, and nine suggestions for shadow removal from document images are proposed. To our knowledge, this is the first survey of shadow removal methods and related datasets from document images.

关键词：

来源：评论

学校读者我要写书评

暂无评论

The Power of Bamboo: On the Post-Compromise security for Searchable Symmetric Encryption 30

The Power of Bamboo: On the Post-Compromise Security for Sea...

引用

30th Annual Network and Distributed System security Symposium, NDSS 2023

作者： Chen, Tianyang Xu, Peng Picek, Stjepan Luo, Bo Susilo, Willy Jin, Hai Liang, Kaitai National Engineering Research Center for Big Data Technology and System Services Computing Technology and System Lab China Hubei Key Laboratory of Distributed System Security Hubei Engineering Research Center on Big Data Security School of Cyber Science and Engineering China Cluster and Grid Computing Lab School of Computer Science and Technology China Huazhong University of Science and Technology Wuhan430074 China Digital Security Group Radboud University Nijmegen Netherlands Department of EECS Institute of Information Sciences The University of Kansas LawrenceKS United States Institute of Cybersecurity and Cryptology School of Computing and Information Technology University of Wollongong WollongongNSW2522 Australia Faculty of Electrical Engineering Mathematics and Computer Science Delft University of Technology Delft2628 CD Netherlands

ISBN: (纸本)1891562835

Dynamic searchable symmetric encryption (DSSE) enables users to delegate the keyword search over dynamically updated encrypted databases to an honest-but-curious server without losing keyword privacy. This paper studies a new and practical security risk to DSSE, namely, secret key compromise (e.g., a user’s secret key is leaked or stolen), which threatens all the security guarantees offered by existing DSSE schemes. To address this open problem, we introduce the notion of searchable encryption with key-update (SEKU) that provides users with the option of non-interactive key updates. We further define the notion of post-compromise secure with respect to leakage functions to study whether DSSE schemes can still provide data security after the client’s secret key is compromised. We demonstrate that post-compromise security is achievable with a proposed protocol called "Bamboo". Interestingly, the leakage functions of Bamboo satisfy the requirements for both forward and backward security. We conduct a performance evaluation of Bamboo using a real-world dataset and compare its runtime efficiency with the existing forward-and-backward secure DSSE schemes. The result shows that Bamboo provides strong security with better or comparable performance. © 2023 30th Annual Network and Distributed System security Symposium, NDSS 2023. All Rights Reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

PGD-Imp: Rethinking and Unleashing Potential of Classic PGD with Dual Strategies for Imperceptible Adversarial Attacks

PGD-Imp: Rethinking and Unleashing Potential of Classic PGD ...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Jin Li Zitong Yu Ziqiang He Z. Jane Wang Xiangui Kang Guangdong Key Lab of Information Security School of Computer Science and Engineering Sun Yat-Sen University School of Computing and Information Technology Great Bay University Electrical and Computer Engineering Dept University of British Columbia

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

Imperceptible adversarial attacks have recently attracted increasing research interests. Existing methods typically incorporate external modules or loss terms other than a simple l p -norm into the attack process to achieve imperceptibility, while we argue that such additional designs may not be necessary. In this paper, we rethink the essence of imperceptible attacks and propose two simple yet effective strategies to unleash the potential of PGD, the common and classical attack, for imperceptibility from an optimization perspective. Specifically, the Dynamic Step Size is introduced to find the optimal solution with minimal attack cost towards the decision boundary of the attacked model, and the Adaptive Early Stop strategy is adopted to reduce the redundant strength of adversarial perturbations to the minimum level. The proposed PGD-Imperceptible (PGD-Imp) attack achieves state-of-the-art results in imperceptible adversarial attacks for both untargeted and targeted scenarios. When performing untargeted attacks against ResNet-50, PGD-Imp attains 100% (+0.3%) ASR, 0.89 (-1.76) l 2 distance, and 52.93 (+9.2) PSNR with 57s (-371s) running time, significantly outperforming existing methods.

关键词： Adaptation models Costs Perturbation methods Machine learning Signal processing Acoustics Speech processing Optimization

来源：评论

学校读者我要写书评

暂无评论

Autoencoder-Based Latent Block-Diagonal Representation for Subspace Clustering

引用

IEEE Transactions on Cybernetics 2022年第6期52卷 5408-5418页

作者： Xu, Yesong Chen, Shuo Li, Jun Han, Zongyan Yang, Jian Nanjing University of Science and Technology PCA Laboratory Key Lab. of Intelligent Percept. and Syst. for High-Dimensional Information of Ministry of Education Nanjing210094 China Nanjing University of Science and Technology Jiangsu Key Laboratory of Image and Video Understanding for Social Security School of Computer Science and Engineering Nanjing210094 China RIKEN Center for Advanced Intelligence Project Tokyo103-0027 Japan Nanjing University of Science and Technology School of Computer Science and Engineering Nanjing210094 China

Block-diagonal representation (BDR) is an effective subspace clustering method. The existing BDR methods usually obtain a self-expression coefficient matrix from the original features by a shallow linear model. However, the underlying structure of real-world data is often nonlinear, thus those methods cannot faithfully reflect the intrinsic relationship among samples. To address this problem, we propose a novel latent BDR (LBDR) model to perform the subspace clustering on a nonlinear structure, which jointly learns an autoencoder and a BDR matrix. The autoencoder, which consists of a nonlinear encoder and a linear decoder, plays an important role to learn features from the nonlinear samples. Meanwhile, the learned features are used as a new dictionary for a linear model with block-diagonal regularization, which can ensure good performances for spectral clustering. Moreover, we theoretically prove that the learned features are located in the linear space, thus ensuring the effectiveness of the linear model using self-expression. Extensive experiments on various real-world datasets verify the superiority of our LBDR over the state-of-the-art subspace clustering approaches. © 2013 IEEE.

关键词： Decoding

来源：评论

学校读者我要写书评

暂无评论

An Ensemble of CapsNet and KSVM Using Feature Fusion: Application to COVID-19 Detection 3

An Ensemble of CapsNet and KSVM Using Feature Fusion: Applic...

引用

3rd International Conference on Pattern Recognition and Machine Learning, PRML 2022

作者： Aggrey, Esther Stacy E. B. Zhen, Qin Kodjiku, Seth Larweh Aidoo, Evans Fiasam, Linda Delali Mensah, Acheampong Edward University of Electronic Science & Technology of China School of Information & Software Engineering Chengdu China University of Electronic Science & Technology of China School of Information & Software Engineering Network and Data Security Key Lab of Sichuan Province Chengdu China Zhejiang Gongshang University School of Computer & Information Engineering Hangzhou China

ISBN: (数字)9781665499507

ISBN: (纸本)9781665499507

COVID-19 virus is a major worldwide pandemic that is growing at a fast pace throughout the world. The usual approach for diagnosing COVID-19 is the use of a real-time polymerase chain reaction (RT-PCR) based nucleic acid test. However, RT-PCR has lower sensitivity in the early phases of COVID-19 detection. Recent studies have indicated that X-ray images may be useful throughout the early detection of the virus. Human screening has been shown to be cost-effective, susceptible to mistakes, and time-demanding, which has sparked an interest in using Convolutional Neural Networks (CNNs) to automate the process. CNNs, on the other hand, fail to view the exact placement of features as advantageous in medical imaging. Furthermore, for successful training and prediction, CNNs need a huge quantity of datasets. CNNs are rapidly reducing picture resolution, resulting in worsening accuracy in classification. We used newly created capsule networks (CapsNets) in our study to circumvent these disadvantages. The primary contribution is to improve the identification of SARS-CoV-2 with images obtained from X-ray by coupling capsule network with a kernel support vector machine (KSVM). The technique was evaluated using a publicly availab.e dataset, and the proposed model shows that the accuracy of the CapsNet-KSVM based model is improved by 94.6% accuracy, 95% sensitivity, and 98% specificity, which outperforms the traditional CNN and other existing ensemble models. The proposed CapsNet-KSVM based system can be employed to identify the presence of COVID-19 in the human body using X-ray images. © 2022 IEEE.

关键词： COVID-19

来源：评论

学校读者我要写书评

暂无评论

Resampling Factor Estimation via Dual-Stream Convolutional Neural Network

引用

computers, Materials & Continua 2021年第1期66卷 647-657页

作者： Shangjun Luo Junwei Luo Wei Lu Yanmei Fang Jinhua Zeng Shaopei Shi Yue Zhang School of Data and Computer Science Guangdong Province Key Laboratory of Information Security TechnologyMinistry of Education Key Laboratory of Machine Intelligence and Advanced ComputingSun Yat-sen UniversityGuangzhou510006China Academy of Forensic Science Shanghai200063China College of Information Science and Technology Jinan UniversityGuangzhou510632China Department of Computer Science University of Massachusetts LowellLowellMA 01854USA

The estimation of image resampling factors is an important problem in image *** all the resampling factor estimation methods,spectrumbased methods are one of the most widely used methods and have attracted a lot of research ***,because of inherent ambiguity,spectrum-based methods fail to discriminate upscale and downscale operations without any prior *** general,the application of resampling leaves detectable traces in both spatial domain and frequency domain of a resampled ***,the resampling process will introduce correlations between neighboring *** this case,a set of periodic pixels that are correlated to their neighbors can be found in a resampled ***,the resampled image has distinct and strong peaks on spectrum while the spectrum of original image has no clear ***,in this paper,we propose a dual-stream convolutional neural network for image resampling factors *** of the two streams is gray stream whose purpose is to extract resampling traces features directly from the rescaled *** other is frequency stream that discovers the differences of spectrum between rescaled and original *** features from two streams are then fused to construct a feature representation including the resampling traces left in spatial and frequency domain,which is later fed into softmax layer for resampling factor *** results show that the proposed method is effective on resampling factor estimation and outperforms some CNN-based methods.

关键词： Image forensics image resampling detection parameter estimation convolutional neural network

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：