检索结果-内蒙古大学图书馆

Learning to Learn Transferable Generative Attack for Person Re-Identification

IEEE Transactions on Image Processing 2025年 PP卷 PP页

作者： Bian, Yuan Liu, Min Wang, Xueping Ma, Yunfeng Wang, Yaonan Hunan University National Engineering Research Center of Robot Visual Perception and Control Technology College of Electrical and Information Engineering Hunan Changsha China Hunan Normal University Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing College of Information Science and Engineering Hunan Changsha China

Deep learning-based person re-identification (reid) models are widely employed in surveillance systems and inevitably inherit the vulnerability of deep networks to adversarial attacks. Existing attacks merely consider cross-dataset and cross-model transferability, ignoring the cross-test capability to perturb models trained in different domains. To powerfully examine the robustness of real-world re-id models, the Meta Transferable Generative Attack (MTGA) method is proposed, which adopts meta-learning optimization to promote the generative attacker producing highly transferable adversarial examples by learning comprehensively simulated transfer-based crossmodel&dataset&test black-box meta attack tasks. Specifically, cross-model&dataset black-box attack tasks are first mimicked by selecting different re-id models and datasets for meta-train and meta-test attack processes. As different models may focus on different feature regions, the Perturbation Random Erasing module is further devised to prevent the attacker from learning to only corrupt model-specific features. To boost the attacker learning to possess cross-test transferability, the Normalization Mix strategy is introduced to imitate diverse feature embedding spaces by mixing multi-domain statistics of target models. Extensive experiments show the superiority of MTGA, especially in cross-model&dataset and cross-model&dataset&test attacks, our MTGA outperforms the SOTA methods by 20.0% and 11.3% on mean mAP drop rate, respectively. © 1992-2012 IEEE.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

HNSRRT*:A Path Planning Algorithm Based On Heuristic Non-Uniform Sampling Method In Complex Obstacle Environment

HNSRRT*:A Path Planning Algorithm Based On Heuristic Non-Uni...

引用

Chinese control Conference (CCC)

作者： Zhiwen Xu Hui Zhang Bo Chen Xidong Zhou Songtao Yin Lian Yang College of Electrical and Information Engineering Changsha University of Science and Technology Hunan China College of Robotics and Robot Visual Perception and Control Technology National Engineering Research Center Hunan University Hunan China

The traditional sampling-based algorithm such as Rapidly Random-exploring Tree (RRT) and various varieties have achieved tremendous success in the area of path planning. However, their excessive exploration in the state space leads to long time to find the optimal solution, large memory usage and cannot guarantee the quality of the planned path(generally evaluated by the cost of search time and the length of path) in sophisticated space. In this article, we propose an optimal path planning algorithm based on heuristic non-uniform sampling, namely the HNSRRT*, which successfully plans path in complex obstacle environments with optimal length and minimum time cost. The HNSRRT* utilizes heuristic function to generate non-uniform sampling distribution by Gaussian distribution,and constraints on sampling points can reduce the time wasted and path length increase caused by excessive exploration. We test the proposed HNSRRT* in 2D and 3D complex obstacle environment,comparing it with the three traditional sampling-base algorithms. The simulation results indicated that the effectiveness and efficiency of HNSRRT* and have an obvious improvement in term of time cost, path length compared with the existing algorithms.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Skeleton-Based Human Action Recognition with Noisy Labels

Skeleton-Based Human Action Recognition with Noisy Labels

引用

IEEE/RSJ International Conference on Intelligent robots and Systems (IROS)

作者： Yi Xu Kunyu Peng Di Wen Ruiping Liu Junwei Zheng Yufan Chen Jiaming Zhang Alina Roitberg Kailun Yang Rainer Stiefelhagen Anthropomatics and Robotics Karlsruhe Institute of Technology Germany Institute for Artificial Intelligence University of Stuttgart Germany School of Robotics Hunan University China National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University China

ISBN: (数字)9798350377705

ISBN: (纸本)9798350377712

Understanding human actions from body poses is critical for assistive robots sharing space with humans in order to make informed and safe decisions about the next interaction. However, precise temporal localization and annotation of activity sequences is time-consuming and the resulting labels are often noisy. If not effectively addressed, label noise negatively affects the model’s training, resulting in lower recognition quality. Despite its importance, addressing label noise for skeleton-based action recognition has been overlooked so far. In this study, we bridge this gap by implementing a framework that augments well-established skeleton-based human action recognition methods with label-denoising strategies from various research areas to serve as the initial benchmark. Observations reveal that these baselines yield only marginal performance when dealing with sparse skeleton data. Consequently, we introduce a novel methodology, NoiseEraSAR, which integrates global sample selection, co-teaching, and Cross-Modal Mixture-of-Experts (CM-MOE) strategies, aimed at mitigating the adverse impacts of label noise. Our proposed approach demonstrates better performance on the established benchmark, setting new state-of-the-art standards. The source code for this study will be made accessible at https://***/xuyizdby/NoiseEraSAR.

关键词： Training Source coding Noise Training data Benchmark testing Skeleton Noise measurement Human activity recognition Streams Standards

来源：评论

学校读者我要写书评

暂无评论

L₁ Adaptive control-Based Formation Tracking of Multiple Quadrotors Without Linear Velocity Feedback Under Unknown Disturbances

引用

IEEE Transactions on Automation Science and engineering 2024年 22卷 5804-5815页

作者： Yang Hu Zhiqiang Miao Yaonan Wang Haoming Tang Xiangke Wang Wei He College of Electrical and Information Engineering Hunan University Changsha China National Engineering Research Center for Robot Visual Perception and Control Changsha China College of Intelligence Science and Technology National University of Defense Technology Changsha China School of Intelligence Science and Technology and the Key Laboratory of Intelligent Bionic Unmanned Systems Ministry of Education University of Science and Technology Beijing Beijing China

This paper addresses the problem of formation control for a quadrotor swarm (QS) system with directed graph topology under external environmental disturbances and unreliable internal state acquisition. The proposed distributed robust control framework, based on a gemetric controller, incorporates ${\mathcal {L}}_control$ adaptive controllers and differentiator systems. First, the geometric formation controller is designed to implement the formation control of the nominal system. Then, ${\mathcal {L}}_control$ adaptive controllers are designed separately for each quadrotor’s position loop and attitude loop subsystems to address the effects of uncertainties such as external time-varying disturbances (matched and unmatched disturbances) and different mass variations of quadrotors. Furthermore, the differentiator system is devised to accurately estimate the higher-order derivatives of the non-directly-measurable velocity information and the virtual translation control signal, which enhances system accuracy while reducing computational complexity. The Lyapunov stability theory is employed to analyze the stability of the closed-loop system. Finally, the effectiveness and exceptional performance of this approach in QS formation control were validated through numerical simulation and experimental results. Note to Practitioners—The inspiration for this article comes from the issue of formation control in a cluster of quadrotor drones, which is also applicable to formation control in other types of drones. In this paper, a formation control algorithm based on ${\mathcal {L}}_control$ adaptive control strategy and arbitrary-order differentiation is designed. This algorithm can address not only the issue of time-varying wind disturbances frequently encountered during quadrotor drone flights but also the effects of unpredictable velocities and inconsistent masses of quadrotor drones. The disturbance rejection capability of this scheme enables quadrotor drones to be applied more safely and r

关键词： Quadrotors Formation control Adaptive control Topology Drones Autonomous aerial vehicles Uncertainty

来源：评论

学校读者我要写书评

暂无评论

A Flexible Framework for Universal Computational Aberration Correction via Automatic Lens Library Generation and Domain Adaptation

arXiv

引用

arXiv 2024年

作者： Jiang, Qi Gao, Yao Gao, Shaohua Yi, Zhonghua Sun, Lei Shi, Hao Yang, Kailun Wang, Kaiwei Bai, Jian State Key Laboratory of Extreme Photonics and Instrumentation College of Optical Science and Engineering Zhejiang University Hangzhou310027 China National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University Changsha410082 China

Emerging universal Computational Aberration Correction (CAC) paradigms provide an inspiring solution to light-weight and high-quality imaging without repeated data preparation and model training to accommodate new lens designs. However, the training databases in these approaches, i.e., the lens libraries (LensLibs), suffer from their limited coverage of real-world aberration behaviors. Moreover, it is challenging to train a universal model for reliable results in a zero-shot manner, whose inflexible tuning pipeline is also confined to the lens-descriptions-known case. In this work, we set up an OmniLens framework for universal CAC, considering both the generalization ability and flexibility. OmniLens extends the idea of universal CAC to a broader concept, where a base model is trained as the pre-trained model for three cases, including zero-shot CAC with the pre-trained model, few-shot CAC with a little lens-specific data for fine-tuning, and domain adaptive CAC using domain adaptation for lens-descriptions-unknown lens. In terms of OmniLens’s data foundation, we first propose an Evolution-based Automatic Optical Design (EAOD) pipeline to construct the LensLib automatically, coined AODLib, whose diversity is enriched by an evolution framework, with comprehensive constraints and a hybrid optimization strategy for achieving realistic aberration behaviors. For network design, we introduce the guidance of high-quality codebook priors to facilitate both zero-shot CAC and few-shot CAC, which enhances the model’s generalization ability, while also boosting its convergence in a few-shot case. Furthermore, based on the statistical observation of dark channel priors in optical degradation, we design an unsupervised regularization term to adapt the base model to the target descriptions-unknown lens using its aberration images without ground truth. We validate the proposed OmniLens framework on 4 manually designed low-end lenses with various structures and aberration behaviors.

关键词： Aberrations

来源：评论

学校读者我要写书评

暂无评论

Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation

arXiv

引用

arXiv 2024年

作者： Liu, Ruiping Zhang, Jiaming Peng, Kunyu Chen, Yufan Cao, Ke Zheng, Junwei Sarfraz, M. Saquib Yang, Kailun Stiefelhagen, Rainer Institute for Anthropomatics and Robotics Karlsruhe Institute of Technology Germany Mercedes-Benz Tech Innovation Germany School of Robotics Hunan University China National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University China

Integrating information from multiple modalities enhances the robustness of scene perception systems in autonomous vehicles, providing a more comprehensive and reliable sensory framework. However, the modality incompleteness in multi-modal segmentation remains under-explored. In this work, we establish a task called Modality-Incomplete Scene Segmentation (MISS), which encompasses both system-level modality absence and sensor-level modality errors. To avoid the predominant modality reliance in multi-modal fusion, we introduce a Missing-aware Modal Switch (MMS) strategy to proactively manage missing modalities during training. Utilizing bit-level batch-wise sampling enhances the model’s performance in both complete and incomplete testing scenarios. Furthermore, we introduce the Fourier Prompt Tuning (FPT) method to incorporate representative spectral information into a limited number of learnable prompts that maintain robustness against all MISS scenarios. Akin to fine-tuning effects but with fewer tunable parameters (1.1%). Extensive experiments prove the efficacy of our proposed approach, showcasing an improvement of 5.84% mIoU over the prior state-of-the-art parameter-efficient methods in modality missing. The source code is publicly available at https://***/RuipingL/MISS. Copyright © 2024, The Authors. All rights reserved.

关键词： Sensory perception

来源：评论

学校读者我要写书评

暂无评论

Tightly-Coupled LiDAR-visual SLAM Based on Geometric Features for Mobile Agents

Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Feature...

引用

IEEE International Conference on robotics and Biomimetics

作者： Ke Cao Ruiping Liu Ze Wang Kunyu Peng Jiaming Zhang Junwei Zheng Zhifeng Teng Kailun Yang Rainer Stiefelhagen Institute for Anthropomatics and Robotics Karlsruhe Institute of Technology Germany School of Robotics Hunan University China National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University China

The mobile robot relies on SLAM (Simultaneous Localization and Mapping) to provide autonomous navigation and task execution in complex and unknown environments. However, it is hard to develop a dedicated algorithm for mobile robots due to dynamic and challenging situations, such as poor lighting conditions and motion blur. To tackle this issue, we propose a tightly-coupled LiDAR-visual SLAM based on geometric features, which includes two sub-systems (LiDAR and monocular visual SLAM) and a fusion framework. The fusion framework associates the depth and semantics of the multi-modal geometric features to complement the visual line landmarks and to add direction optimization in Bundle Adjustment (BA). This further constrains visual odometry. On the other hand, the entire line segment detected by the visual subsystem overcomes the limitation of the LiDAR subsystem, which can only perform the local calculation for geometric features. It adjusts the direction of linear feature points and filters out outliers, leading to a higher accurate odometry system. Finally, we employ a module to detect the subsystem’s operation, providing the LiDAR subsystem’s output as a complementary trajectory to our system while visual subsystem tracking fails. The evaluation results on the public dataset M2DGR, gathered from ground robots across various indoor and outdoor scenarios, show that our system achieves more accurate and robust pose estimation compared to current state-of-the-art multi-modal methods.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation

Fourier Prompt Tuning for Modality-Incomplete Scene Segmenta...

引用

IEEE Symposium on Intelligent Vehicle

作者： Ruiping Liu Jiaming Zhang Kunyu Peng Yufan Chen Ke Cao Junwei Zheng M. Saquib Sarfraz Kailun Yang Rainer Stiefelhagen Institute for Anthropomatics and Robotics Karlsruhe Institute of Technology Germany Mercedes-Benz Tech Innovation Germany School of Robotics Hunan University China National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University China

ISBN: (数字)9798350348811

ISBN: (纸本)9798350348828

关键词： Training Rain Source coding Semantic segmentation Semantics Switches Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts

arXiv

引用

arXiv 2025年

作者： Huang, Yizhou Yang, Fan Zhu, Guoliang Li, Gen Shi, Hao Zuo, Yukun Chen, Wenrui Li, Zhiyong Yang, Kailun School of Robotics and the National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University China School of Informatics University of Edinburgh United Kingdom State Key Laboratory of Extreme Photonics and Instrumentation Zhejiang University China

Affordance refers to the functional properties that an agent perceives and utilizes from its environment, and is key perceptual information required for robots to perform actions. This information is rich and multimodal in nature. Existing multimodal affordance methods face limitations in extracting useful information, mainly due to simple structural designs, basic fusion methods, and large model parameters, making it difficult to meet the performance requirements for practical deployment. To address these issues, this paper proposes the BiT-Align image-depth-text affordance mapping framework. The framework includes a Bypass Prompt Module (BPM) and a Text Feature Guidance (TFG) attention selection mechanism. BPM integrates the auxiliary modality depth image directly as a prompt to the primary modality RGB image, embedding it into the primary modality encoder without introducing additional encoders. This reduces the model’s parameter count and effectively improves functional region localization accuracy. The TFG mechanism guides the selection and enhancement of attention heads in the image encoder using textual features, improving the understanding of affordance characteristics. Experimental results demonstrate that the proposed method achieves significant performance improvements on public AGD20K and HICO-IIF datasets. On the AGD20K dataset, compared with the current state-of-the-art method, we achieve a 6.0% improvement in the KLD metric, while reducing model parameters by 88.8%, demonstrating practical application values. The source code will be made publicly available at https://***/DAWDSE/BiT-Align. Copyright © 2025, The Authors. All rights reserved.

关键词： Image coding

来源：评论

学校读者我要写书评

暂无评论

Skeleton-Based Human Action Recognition with Noisy Labels

arXiv

引用

arXiv 2024年

作者： Xu, Yi Peng, Kunyu Wen, Di Liu, Ruiping Zheng, Junwei Chen, Yufan Zhang, Jiaming Roitberg, Alina Yang, Kailun Stiefelhagen, Rainer Institute for Anthropomatics and Robotics Karlsruhe Institute of Technology Germany Institute for Artificial Intelligence University of Stuttgart Germany School of Robotics Hunan University China National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University China

Understanding human actions from body poses is critical for assistive robots sharing space with humans in order to make informed and safe decisions about the next interaction. However, precise temporal localization and annotation of activity sequences is time-consuming and the resulting labels are often noisy. If not effectively addressed, label noise negatively affects the model’s training, resulting in lower recognition quality. Despite its importance, addressing label noise for skeleton-based action recognition has been overlooked so far. In this study, we bridge this gap by implementing a framework that augments well-established skeleton-based human action recognition methods with label-denoising strategies from various research areas to serve as the initial benchmark. Observations reveal that these baselines yield only marginal performance when dealing with sparse skeleton data. Consequently, we introduce a novel methodology, NoiseEraSAR, which integrates global sample selection, co-teaching, and Cross-Modal Mixture-of-Experts (CM-MOE) strategies, aimed at mitigating the adverse impacts of label noise. Our proposed approach demonstrates better performance on the established benchmark, setting new state-of-the-art standards. The source code for this study is accessible at https://***/xuyizdby/NoiseEraSAR. Copyright © 2024, The Authors. All rights reserved.

关键词： Musculoskeletal system

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：