检索结果-内蒙古大学图书馆

Proceedings of the 38th International Conference on Neural Information Processing Systems

作者： Benyuan Meng Qianqian Xu Zitai Wang Zhiyong Yang Xiaochun Cao Qingming Huang Institute of Information Engineering CAS and School of Cyber Security University of Chinese Academy of Sciences Key Lab. of Intelligent Information Processing Institute of Computing Technology CAS and Peng Cheng Laboratory Key Lab. of Intelligent Information Processing Institute of Computing Technology CAS School of Computer Science and Tech. University of Chinese Academy of Sciences School of Cyber Science and Tech. Shenzhen Campus of Sun Yat-sen University School of Computer Science and Tech. University of Chinese Academy of Sciences and Key Lab. of Intelligent Information Processing Institute of Computing Technology CAS and Key Laboratory of Big Data Mining and Knowledge Management CAS

ISBN: (纸本)9798331314385

Diffusion models are powerful generative models, and this capability can also be applied to discrimination. The inner activations of a pre-trained diffusion model can serve as features for discriminative tasks, namely, diffusion feature. We discover that diffusion feature has been hindered by a hidden yet universal phenomenon that we call content shift. To be specific, there are content differences between features and the input image, such as the exact shape of a certain object. We locate the cause of content shift as one inherent characteristic of diffusion models, which suggests the broad existence of this phenomenon in diffusion feature. Further empirical study also indicates that its negative impact is not negligible even when content shift is not visually perceivable. Hence, we propose to suppress content shift to enhance the overall quality of diffusion features. Specifically, content shift is related to the information drift during the process of recovering an image from the noisy input, pointing out the possibility of turning off-the-shelf generation techniques into tools for content shift suppression. We further propose a practical guideline named GATE to efficiently evaluate the potential benefit of a technique and provide an implementation of our methodology. Despite the simplicity, the proposed approach has achieved superior results on various tasks and datasets, validating its potential as a generic booster for diffusion features. Our code is available at https://***/Darkbblue/diffusion-content-shift.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Enhancing EEG-Based Cross-Subject Emotion Recognition via Adaptive Source Joint Domain Adaptation

引用

IEEE Transactions on Affective computing 2024年

作者： Liu, Ke Luo, Xin Zhu, Wenrui Yu, Zhuliang Yu, Hong Xiao, Bin Wu, Wei Chongqing University of Posts and Telecommunications School of Computer Science and Technology Chongqing400065 China Chongqing University of Posts and Telecommunications Key Laboratory of Big Data Intelligent Computing China South China University of Technology College of Automation Science and Engineering Guangzhou510641 China Shanghai Jiao Tong University School of Medicine Department of Neurology Songjjiang Hospital and Songjiang Research Institute Shanghai Key Laboratory of Emotions and Affective Disorders Shanghai201600 China

EEG emotion recognition is crucial in both human-machine interaction and healthcare. However, recognizing emotions across different subjects remains challenging due to individual variability. While existing multi-source domain adaptation methods have been utilized for cross-subject EEG emotion decoding, they often struggle with irrelevant or weakly relevant source domains, leading to negative transfer. Additionally, variations within subdomains are often neglected in these studies. We propose a joint domain adaptation method, Adaptive Source Joint Domain Adaptation (ASJDA) to address these issues. ASJDA utilizes an unsupervised adaptive source selection strategy to select a subset of source domains by evaluating the Jensen-Shannon divergence between the source and target domains, choosing those most relevant to the target. Subsequently, it implements joint domain adaptation with these chosen sources at both the domain and category subdomain levels. Our proposed method outperforms existing state-of-the-art methods, achieving cross-subject accuracies of 96.81% in SEED, 89.69% in SEED-IV, and 69.31% in DEAP. This work significantly advances the state of the art in EEG emotion recognition by effectively addressing the challenges of cross-subject variability. The source code for ASJDA is available at https://***/Pam098/ASJDA. © 2010-2012 IEEE.

关键词： adaptive source selection cross-subject domain adaptation EEG emotion recognition

来源：评论

学校读者我要写书评

暂无评论

Hyperspectral Image Restoration via a New Symmetric ADMM Approach

引用

Mathematical Problems in engineering 2023年第1期2023卷

作者： Wang, Shuo Zhu, Zhibin Zhao, Ruwen Zhang, Benxin School of Electronic Engineering and Automation Key Laboratory of Automatic Detecting Technology and Instruments Guilin University of Electronic Technology Guilin541004 China School of Mathematics and Computing Science Guangxi Colleges and Universities Key Laboratory of Data Analysis and Computation Guilin University of Electronic Technology Guilin541004 China Guilin541004 China

For the hyperspectral image (HSI) denoising problem, a symmetric proximal alternating direction method multiplier (spADMM) is proposed to solve the sparse optimization problem which cannot be solved accurately by traditional ADMM. The proposed method finds a high-quality recovery method using the traditional low-rank Tucker decomposition method, which can fully take into account the overall spatial and spectral correlation between HSI bands by using the Tucker decomposition. By choosing appropriate steps to update the Lagrange multipliers twice, it makes the selection and use of variables more flexible and better for solving sparsity problems. To maintain stability, we also add appropriate proximity terms to solve the problem during the computation. Experiments have shown that the spADMM has better results than the traditional ADMM. The final experimental results on the dataset demonstrate the effectiveness of the method. © 2023 Shuo Wang et al.

关键词： Lagrange multipliers

来源：评论

学校读者我要写书评

暂无评论

Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation

引用

IEEE Transactions on Pattern Analysis and Machine Intelligence 2025年第7期47卷 5520-5537页

作者： Jian Liu Wei Sun Hui Yang Pengchao Deng Chongpei Liu Nicu Sebe Hossein Rahmani Ajmal Mian National Engineering Research Center for Robot Visual Perception and Control Technology College of Electrical and Information Engineering School of Robotics and the State Key Laboratory of Advanced Design and Manufacturing for Vehicle Body Hunan University Changsha China Institute of Artificial Intelligence and Robotics Xi’an Jiaotong University Xi’an China Department of Information Engineering and Computer Science University of Trento Trento Italy School of Computing and Communications Lancaster University Lancaster U.K. Department of Computer Science The University of Western Australia Crawley WA Australia

Nine-degrees-of-freedom (9-DoF) object pose and size estimation is crucial for enabling augmented reality and robotic manipulation. Category-level methods have received extensive research attention due to their potential for generalization to intra-class unknown objects. However, these methods require manual collection and labeling of large-scale real-world training data. To address this problem, we introduce a diffusion-based paradigm for domain-generalized category-level 9-DoF object pose estimation. Our motivation is to leverage the latent generalization ability of the diffusion model to address the domain generalization challenge in object pose estimation. This entails training the model exclusively on rendered synthetic data to achieve generalization to real-world scenes. We propose an effective diffusion model to redefine 9-DoF object pose estimation from a generative perspective. Our model does not require any 3D shape priors during training or inference. By employing the Denoising Diffusion Implicit Model, we demonstrate that the reverse diffusion process can be executed in as few as 3 steps, achieving near real-time performance. Finally, we design a robotic grasping system comprising both hardware and software components. Through comprehensive experiments on two benchmark datasets and the real-world robotic system, we show that our method achieves state-of-the-art domain generalization performance.

关键词： Shape Solid modeling Three-dimensional displays Pose estimation Training Robots Grasping Diffusion models Real-time systems Point cloud compression

来源：评论

学校读者我要写书评

暂无评论

How to Perform Energy-Balanced Underwater data Collection in AUV-Aided UASNs: A Social Welfare-Based Node Clustering Approach

引用

IEEE Transactions on Computational Social Systems 2024年

作者： Lin, Chuan Han, Guangjie Lu, Chang Hussain Shah, Syed Bilal Zhang, Yu Wang, Feng Northeastern University Software College Shenyang110819 China Hohai University Key Laboratory of Data Analytics and Optimization for Smart Industry Ministry of Education Nanjing210013 China Hohai University Key Laboratory of Maritime Intelligent Network Information Technology Ministry of Education Nanjing210013 China Dalian University of Technology School of Software Dalian116024 China Dar Al-Hekma University School of Engineering Computing and Informatics Jeddah21589 Saudi Arabia Suzhou Vocational University School of Engineering Suzhou215004 China

The rapid evolution of the Internet of Underwater Things (IoUT) has led to the widespread adoption of autonomous underwater vehicle (AUV)-assisted underwater acoustic sensor networks (UASNs) for various applications such as marine environment monitoring and resource exploration. This article introduces an energy-balanced data collection scheme tailored for AUV-supported UASNs. The proposed scheme combines a node clustering method based on a social welfare function and an intelligent path planning strategy for the AUV. The node clustering approach integrates canopy and K-means algorithms for initial node clustering, followed by reclustering using an enhanced hierarchical clustering algorithm. To balance energy distribution, Atkinson's social welfare function is employed to select and rotate cluster heads (CHs) within each cluster. To address limited CH memory constraints, a lossless compression technique is introduced to reduce data storage requirements at the CHs. Moreover, the article introduces the use of the deep Q-network (DQN) technique for AUV path planning, considering multiple pertinent factors simultaneously. Simulation results demonstrate that the proposed data collection scheme effectively reduces energy consumption, prolongs network lifespan, and enhances data collection efficiency when compared to recent research endeavors. © 2014 IEEE.

关键词： Marine applications

来源：评论

学校读者我要写书评

暂无评论

engineering single-molecule fluorescence withasymmetric nano-antennas

引用

Light(Science & Applications) 2021年第5期10卷 858-866页

作者： Wenqi Zhao Xiaochaoran Tian Zhening Fang Shiyi Xiao Meng Qiu Qiong He Wei Feng Fuyou Li Yuanbo Zhang Lei Zhou Yan-Wen Tan State Key Laboratory of Surface Physics and Department of Physics Fudan UniversityShanghai 200433China Shanghai Institute for Advanced Communication and Data Science Shanghai UniversityShanghai 200444China Key Laboratory of Specialty Fiber Optics and Optical Access Networks Joint International Research Laboratory of Specialty Fiber Optics and Advanced CommunicationShanghai UniversityShanghai 200444China Department of Chemistry and State Key Laboratory of Molecular Engineering of Polymers Fudan UniversityShanghai 200433China Institute for Nanoelectronic Devices and Quantum Computing Fudan UniversityShanghai 200433China Collaborative Innovation Center of Advanced Microstructures Nanjing 210093China Multiscale Research Institute of Complex Systems Fudan UniversityShanghai 200433China

As a powerful tool for studying molecular dynamics in bioscience,single-molecule fluorescence detection providesdynamical information buried in ensemble *** in the near-infrared(NIR)is particularly usefulbecause it offers higher signal-to-noise ratio and increased penetration depth in tissue compared with *** low quantum yield of most NIR fluorophores,however,makes the detection of single-moleculefluorescence ***,we use asymmetric plasmonic nano-antenna to enhance the fluorescence intensity ofAlEE1000,a typical NIR dye,by a factor up to *** asymmetric nano-antenna achieve such an enhancement mainlyby increasing the quantum yield(to~80%)rather than the local field,which degrades the molecules'*** coupled-mode-theory analysis reveals that the enhancements stem from resonance-matching between antennaand molecule and,more importantly,from optimizing the coupling between the near-and far-ield modes withdesigner asymmetric *** work provides a universal scheme for engineering single-molecule fluorescence inthe near-infrared regime.

关键词： stability. asymmetric quantum

来源：评论

学校读者我要写书评

暂无评论

Detector Collapse: Physical-World Backdooring Object Detection to Catastrophic Overload or Blindness in Autonomous Driving

arXiv

引用

arXiv 2024年

作者： Zhang, Hangtao Hu, Shengshan Wang, Yichen Zhang, Leo Yu Zhou, Ziqi Wang, Xianlong Zhang, Yanjun Chen, Chao School of Cyber Science and Engineering Huazhong University of Science and Technology China School of Computer Science and Technology Huazhong University of Science and Technology China National Engineering Research Center for Big Data Technology and System Services Computing Technology and System Lab Hubei Engineering Research Center on Big Data Security China Hubei Key Laboratory of Distributed System Security China Cluster and Grid Computing Lab China School of Information and Communication Technology Griffith University Australia University of Technology Sydney Australia RMIT University Australia

Object detection tasks, crucial in safety-critical systems like autonomous driving, focus on pinpointing object locations. These detectors are known to be susceptible to backdoor attacks. However, existing backdoor techniques have primarily been adapted from classification tasks, overlooking deeper vulnerabilities specific to object detection. This paper is dedicated to bridging this gap by introducing Detector Collapse (DC), a brand-new backdoor attack paradigm tailored for object detection. DC is designed to instantly incapacitate detectors (i.e., severely impairing detector's performance and culminating in a denial-of-service). To this end, we develop two innovative attack schemes: SPONGE for triggering widespread misidentifications and BLINDING for rendering objects invisible. Remarkably, we introduce a novel poisoning strategy exploiting natural objects, enabling DC to act as a practical backdoor in real-world environments. Our experiments on different detectors across several benchmarks show a significant improvement (∼10%-60% absolute and ∼2-7× relative) in attack efficacy over state-of-the-art attacks. Copyright © 2024, The Authors. All rights reserved.

关键词： Denial-of-service attack

来源：评论

学校读者我要写书评

暂无评论

EmT: A Novel Transformer for Generalized Cross-subject EEG Emotion Recognition

arXiv

引用

arXiv 2024年

作者： Ding, Yi Tong, Chengxuan Zhang, Shuailei Jiang, Muyun Li, Yong Liang, Kevin Lim Jun Guan, Cuntai College of Computing and Data Science Nanyang Technological University 50 Nanyang Avenue Singapore639798 Singapore Wilmar International Singapore School of Computer Science and Engineering Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications Southeast University Nanjing210096 China

Integrating prior knowledge of neurophysiology into neural network architecture enhances the performance of emotion decoding. While numerous techniques emphasize learning spatial and short-term temporal patterns, there has been limited emphasis on capturing the vital long-term contextual information associated with emotional cognitive processes. In order to address this discrepancy, we introduce a novel transformer model called emotion transformer (EmT). EmT is designed to excel in both generalized cross-subject EEG emotion classification and regression tasks. In EmT, EEG signals are transformed into a temporal graph format, creating a sequence of EEG feature graphs using a temporal graph construction module (TGC). A novel residual multi-view pyramid GCN module (RMPG) is then proposed to learn dynamic graph representations for each EEG feature graph within the series, and the learned representations of each graph are fused into one token. Furthermore, we design a temporal contextual transformer module (TCT) with two types of token mixers to learn the temporal contextual information. Finally, the task-specific output module (TSO) generates the desired outputs. Experiments on four publicly available datasets show that EmT achieves higher results than the baseline methods for both EEG emotion classification and regression tasks. The code is available at https://***/yi-ding-cs/EmT. Copyright © 2024, The Authors. All rights reserved.

关键词： Emotion Recognition

来源：评论

学校读者我要写书评

暂无评论

Causal-IQA: towards the generalization of image quality assessment based on causal inference 24

Causal-IQA: towards the generalization of image quality asse...

引用

Proceedings of the 41st International Conference on Machine Learning

作者： Yan Zhong Xingyu Wu Li Zhang Chenxi Yang Tingting Jiang School of Mathematical Sciences and National Engineering Research Center of Visual Technology National Key Laboratory for Multimedia Information Processing School of Computer Science Peking University Beijing China Department of Computing The Hong Kong Polytechnic University Hong Kong SAR China Hefei Institute of Physical Science Chinese Academy of Sciences University of Science and Technology of China Hefei China National Engineering Research Center of Visual Technology National Key Laboratory for Multimedia Information Processing School of Computer Science and 5National Biomedical Imaging Center Peking University Beijing China

Due to the high cost of Image Quality Assessment (IQA) datasets, achieving robust generalization remains challenging for prevalent deep learning-based IQA methods. To address this, this paper proposes a novel end-to-end blind IQA method: Causal-IQA. Specifically, we first analyze the causal mechanisms in IQA tasks and construct a causal graph to understand the interplay and confounding effects between distortion types, image contents, and subjective human ratings. Then, through shifting the focus from correlations to causality, Causal-IQA aims to improve the estimation accuracy of image quality scores by mitigating the confounding effects using a causality-based optimization strategy. This optimization strategy is implemented on the sample subsets constructed by a Counterfactual Division process based on the Backdoor Criterion. Extensive experiments illustrate the superiority of Causal-IQA.

关键词：

来源：评论

学校读者我要写书评

暂无评论

ECLIPSE: Expunging Clean-label Indiscriminate Poisons via Sparse Diffusion Purification

arXiv

引用

arXiv 2024年

作者： Wang, Xianlong Hu, Shengshan Zhang, Yechao Zhou, Ziqi Zhang, Leo Yu Xu, Peng Wan, Wei Jin, Hai National Engineering Research Center for Big Data Technology and System China Services Computing Technology and System Lab China Cluster and Grid Computing Lab China Hubei Engineering Research Center on Big Data Security China Hubei Key Laboratory of Distributed System Security China School of Cyber Science and Engineering Huazhong University of Science and Technology Wuhan430074 China School of Computer Science and Technology Huazhong University of Science and Technology Wuhan430074 China School of Information and Communication Technology Griffith University SouthportQLD4215 Australia

Clean-label indiscriminate poisoning attacks add invisible perturbations to correctly labeled training images, thus dramatically reducing the generalization capability of the victim models. Recently, defense mechanisms such as adversarial training, image transformation techniques, and image purification have been proposed. However, these schemes are either susceptible to adaptive attacks, built on unrealistic assumptions, or only effective against specific poison types, limiting their universal applicability. In this research, we propose a more universally effective, practical, and robust defense scheme called ECLIPSE. We first investigate the impact of Gaussian noise on the poisons and theoretically prove that any kind of poison will be largely assimilated when imposing sufficient random noise. In light of this, we assume the victim has access to an extremely limited number of clean images (a more practical scene) and subsequently enlarge this sparse set for training a denoising probabilistic model (a universal denoising tool). We then introduce Gaussian noise to absorb the poisons and apply the model for denoising, resulting in a roughly purified dataset. Finally, to address the trade-off of the inconsistency in the assimilation sensitivity of different poisons by Gaussian noise, we propose a lightweight corruption compensation module to effectively eliminate residual poisons, providing a more universal defense approach. Extensive experiments demonstrate that our defense approach outperforms 10 state-of-the-art defenses. We also propose an adaptive attack against ECLIPSE and verify the robustness of our defense scheme. Our code is available at https://***/CGCL-codes/ECLIPSE. Copyright © 2024, The Authors. All rights reserved.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：