作者:
Bian, YuanLiu, MinWang, XuepingMa, YunfengWang, YaonanHunan University
National Engineering Research Center of Robot Visual Perception and Control Technology College of Electrical and Information Engineering Hunan Changsha China Hunan Normal University
Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing College of Information Science and Engineering Hunan Changsha China
Deep learning-based person re-identification (reid) models are widely employed in surveillance systems and inevitably inherit the vulnerability of deep networks to adversarial attacks. Existing attacks merely consider...
详细信息
The traditional sampling-based algorithm such as Rapidly Random-exploring Tree (RRT) and various varieties have achieved tremendous success in the area of path planning. However, their excessive exploration in the sta...
The traditional sampling-based algorithm such as Rapidly Random-exploring Tree (RRT) and various varieties have achieved tremendous success in the area of path planning. However, their excessive exploration in the state space leads to long time to find the optimal solution, large memory usage and cannot guarantee the quality of the planned path(generally evaluated by the cost of search time and the length of path) in sophisticated space. In this article, we propose an optimal path planning algorithm based on heuristic non-uniform sampling, namely the HNSRRT*, which successfully plans path in complex obstacle environments with optimal length and minimum time cost. The HNSRRT* utilizes heuristic function to generate non-uniform sampling distribution by Gaussian distribution,and constraints on sampling points can reduce the time wasted and path length increase caused by excessive exploration. We test the proposed HNSRRT* in 2D and 3D complex obstacle environment,comparing it with the three traditional sampling-base algorithms. The simulation results indicated that the effectiveness and efficiency of HNSRRT* and have an obvious improvement in term of time cost, path length compared with the existing algorithms.
Understanding human actions from body poses is critical for assistive robots sharing space with humans in order to make informed and safe decisions about the next interaction. However, precise temporal localization an...
详细信息
ISBN:
(数字)9798350377705
ISBN:
(纸本)9798350377712
Understanding human actions from body poses is critical for assistive robots sharing space with humans in order to make informed and safe decisions about the next interaction. However, precise temporal localization and annotation of activity sequences is time-consuming and the resulting labels are often noisy. If not effectively addressed, label noise negatively affects the model’s training, resulting in lower recognition quality. Despite its importance, addressing label noise for skeleton-based action recognition has been overlooked so far. In this study, we bridge this gap by implementing a framework that augments well-established skeleton-based human action recognition methods with label-denoising strategies from various research areas to serve as the initial benchmark. Observations reveal that these baselines yield only marginal performance when dealing with sparse skeleton data. Consequently, we introduce a novel methodology, NoiseEraSAR, which integrates global sample selection, co-teaching, and Cross-Modal Mixture-of-Experts (CM-MOE) strategies, aimed at mitigating the adverse impacts of label noise. Our proposed approach demonstrates better performance on the established benchmark, setting new state-of-the-art standards. The source code for this study will be made accessible at https://***/xuyizdby/NoiseEraSAR.
This paper addresses the problem of formation control for a quadrotor swarm (QS) system with directed graph topology under external environmental disturbances and unreliable internal state acquisition. The proposed di...
详细信息
This paper addresses the problem of formation control for a quadrotor swarm (QS) system with directed graph topology under external environmental disturbances and unreliable internal state acquisition. The proposed distributed robust control framework, based on a gemetric controller, incorporates ${\mathcal {L}}_control$ adaptive controllers and differentiator systems. First, the geometric formation controller is designed to implement the formation control of the nominal system. Then, ${\mathcal {L}}_control$ adaptive controllers are designed separately for each quadrotor’s position loop and attitude loop subsystems to address the effects of uncertainties such as external time-varying disturbances (matched and unmatched disturbances) and different mass variations of quadrotors. Furthermore, the differentiator system is devised to accurately estimate the higher-order derivatives of the non-directly-measurable velocity information and the virtual translation control signal, which enhances system accuracy while reducing computational complexity. The Lyapunov stability theory is employed to analyze the stability of the closed-loop system. Finally, the effectiveness and exceptional performance of this approach in QS formation control were validated through numerical simulation and experimental results. Note to Practitioners—The inspiration for this article comes from the issue of formation control in a cluster of quadrotor drones, which is also applicable to formation control in other types of drones. In this paper, a formation control algorithm based on ${\mathcal {L}}_control$ adaptive control strategy and arbitrary-order differentiation is designed. This algorithm can address not only the issue of time-varying wind disturbances frequently encountered during quadrotor drone flights but also the effects of unpredictable velocities and inconsistent masses of quadrotor drones. The disturbance rejection capability of this scheme enables quadrotor drones to be applied more safely and r
Emerging universal Computational Aberration Correction (CAC) paradigms provide an inspiring solution to light-weight and high-quality imaging without repeated data preparation and model training to accommodate new len...
详细信息
Emerging universal Computational Aberration Correction (CAC) paradigms provide an inspiring solution to light-weight and high-quality imaging without repeated data preparation and model training to accommodate new lens designs. However, the training databases in these approaches, i.e., the lens libraries (LensLibs), suffer from their limited coverage of real-world aberration behaviors. Moreover, it is challenging to train a universal model for reliable results in a zero-shot manner, whose inflexible tuning pipeline is also confined to the lens-descriptions-known case. In this work, we set up an OmniLens framework for universal CAC, considering both the generalization ability and flexibility. OmniLens extends the idea of universal CAC to a broader concept, where a base model is trained as the pre-trained model for three cases, including zero-shot CAC with the pre-trained model, few-shot CAC with a little lens-specific data for fine-tuning, and domain adaptive CAC using domain adaptation for lens-descriptions-unknown lens. In terms of OmniLens’s data foundation, we first propose an Evolution-based Automatic Optical Design (EAOD) pipeline to construct the LensLib automatically, coined AODLib, whose diversity is enriched by an evolution framework, with comprehensive constraints and a hybrid optimization strategy for achieving realistic aberration behaviors. For network design, we introduce the guidance of high-quality codebook priors to facilitate both zero-shot CAC and few-shot CAC, which enhances the model’s generalization ability, while also boosting its convergence in a few-shot case. Furthermore, based on the statistical observation of dark channel priors in optical degradation, we design an unsupervised regularization term to adapt the base model to the target descriptions-unknown lens using its aberration images without ground truth. We validate the proposed OmniLens framework on 4 manually designed low-end lenses with various structures and aberration behaviors.
Integrating information from multiple modalities enhances the robustness of scene perception systems in autonomous vehicles, providing a more comprehensive and reliable sensory framework. However, the modality incompl...
详细信息
The mobile robot relies on SLAM (Simultaneous Localization and Mapping) to provide autonomous navigation and task execution in complex and unknown environments. However, it is hard to develop a dedicated algorithm for...
The mobile robot relies on SLAM (Simultaneous Localization and Mapping) to provide autonomous navigation and task execution in complex and unknown environments. However, it is hard to develop a dedicated algorithm for mobile robots due to dynamic and challenging situations, such as poor lighting conditions and motion blur. To tackle this issue, we propose a tightly-coupled LiDAR-visual SLAM based on geometric features, which includes two sub-systems (LiDAR and monocular visual SLAM) and a fusion framework. The fusion framework associates the depth and semantics of the multi-modal geometric features to complement the visual line landmarks and to add direction optimization in Bundle Adjustment (BA). This further constrains visual odometry. On the other hand, the entire line segment detected by the visual subsystem overcomes the limitation of the LiDAR subsystem, which can only perform the local calculation for geometric features. It adjusts the direction of linear feature points and filters out outliers, leading to a higher accurate odometry system. Finally, we employ a module to detect the subsystem’s operation, providing the LiDAR subsystem’s output as a complementary trajectory to our system while visual subsystem tracking fails. The evaluation results on the public dataset M2DGR, gathered from ground robots across various indoor and outdoor scenarios, show that our system achieves more accurate and robust pose estimation compared to current state-of-the-art multi-modal methods.
Integrating information from multiple modalities enhances the robustness of scene perception systems in autonomous vehicles, providing a more comprehensive and reliable sensory framework. However, the modality incompl...
详细信息
ISBN:
(数字)9798350348811
ISBN:
(纸本)9798350348828
Integrating information from multiple modalities enhances the robustness of scene perception systems in autonomous vehicles, providing a more comprehensive and reliable sensory framework. However, the modality incompleteness in multi-modal segmentation remains under-explored. In this work, we establish a task called Modality-Incomplete Scene Segmentation (MISS), which encompasses both system-level modality absence and sensor-level modality errors. To avoid the predominant modality reliance in multi-modal fusion, we introduce a Missing-aware Modal Switch (MMS) strategy to proactively manage missing modalities during training. Utilizing bit-level batch-wise sampling enhances the model’s performance in both complete and incomplete testing scenarios. Furthermore, we introduce the Fourier Prompt Tuning (FPT) method to incorporate representative spectral information into a limited number of learnable prompts that maintain robustness against all MISS scenarios. Akin to fine-tuning effects but with fewer tunable parameters (1.1%). Extensive experiments prove the efficacy of our proposed approach, showcasing an improvement of 5.84% mIoU over the prior state-of-the-art parameter-efficient methods in modality missing. The source code is publicly available at https://***/RuipingL/MISS.
Affordance refers to the functional properties that an agent perceives and utilizes from its environment, and is key perceptual information required for robots to perform actions. This information is rich and multimod...
详细信息
Understanding human actions from body poses is critical for assistive robots sharing space with humans in order to make informed and safe decisions about the next interaction. However, precise temporal localization an...
详细信息
暂无评论