检索结果-内蒙古大学图书馆

Complexity based investigation in collaborative assembly scenarios via non intrusive techniques 4

学校读者我要写书评

暂无评论

Complexity based investigation in collaborative assembly sce...

4th International Conference on Industry 4.0 and Smart Manufacturing, ISM 2022

作者： Panagou, Sotirios Sileo, Monica Papoutsakis, Konstantinos Fruggiero, Fabio Qammaz, Ammar Argyros, Antonis School of Engineering University of Basilicata Potenza85100 Italy Computational Vision and Robotics Laboratory Institute of Computer Science Foundation for Research and Technology - Hellas Heraklion70013 Greece

Human and robot collaboration in assembly tasks is an integral part in modern manufactories. Robots provide advantages in both process and productivity with their repeatability and usability in different tasks, while human operators provide flexibility and can act as safeguards. However, process complexity increases which can lower the overall quality. Increased complexity can negatively influence decision making due to cognitive load on human operators, which can lead to lower quality, be it product, process or human work. Moreover, it can lead to safety risks, human-system error and accidents. In this work, we present the preliminary results on an experiment performed with student-participants, based on an assembly task. The experiment was set up to emulate an industrial assembly, and data collection was performed through qualitative and non-intrusive quantitative methods. Questionnaires were used to assess perceptual task complexity and cognitive load, while a stereo camera provided recordings for after-task analysis on process errors and human work quality based on a 3D skeleton-based human pose estimation and tracking method. The aim of the study is to investigate causes of errors and implications on quality. Future direction of the work is discussed. © 2022 The Authors. Published by ELSEVIER B.V.

关键词： Decision making

Complexity based investigation in collaborative assembly scenarios via non intrusive techniques

学校读者我要写书评

暂无评论

Procedia computer Science 2023年 217卷 478-485页

作者： Sotirios Panagou Monica Sileo Konstantinos Papoutsakis Fabio Fruggiero Ammar Qammaz Antonis Argyros School of Engineering University of Basilicata 85100 Potenza Italy Computational Vision and Robotics Laboratory Institute of Computer Science Foundation for Research and Technology – Hellas 70013 Heraklion Greece

关键词： collaboration assembly task complexity cognitive load human pose estimation tracking

A single 2D pose with context is worth hundreds for 3D human pose estimation 23

学校读者我要写书评

暂无评论

A single 2D pose with context is worth hundreds for 3D human...

Proceedings of the 37th International Conference on Neural Information Processing Systems

作者： Qitao Zhao Ce Zheng Mengyuan Liu Chen Chen Robotics Institute Carnegie Mellon University Center for Research in Computer Vision University of Central Florida Key Laboratory of Machine Perception Peking University Shenzhen Graduate School

关键词：

HPPS: A Hierarchical Progressive Perception System for Luggage Trolley Detection and Localization at Airports

学校读者我要写书评

暂无评论

arXiv 2024年

作者： Sun, Zhirui Zhang, Zhe Zhao, Jieting Ye, Hanjing Wang, Jiankun Shenzhen Key Laboratory of Robotics Perception and Intelligence Department of Electronic and Electrical Engineering Southern University of Science and Technology Shenzhen China Jiaxing Research Institute Southern University of Science and Technology Jiaxing China Shenzhen Key Laboratory of Robotics and Computer Vision Department of Electronic and Electrical Engineering Southern University of Science and Technology Shenzhen China

The robotic autonomous luggage trolley collection system employs robots to gather and transport scattered luggage trolleys at airports. However, existing methods for detecting and locating these luggage trolleys often fail when they are not fully visible. To address this, we introduce the Hierarchical Progressive Perception System (HPPS), which enhances the detection and localization of luggage trolleys under partial occlusion. The HPPS processes the luggage trolley’s position (x, y) and orientation (θ) separately, which requires only RGB images for labeling and training, eliminating the need for 3D coordinates and alignment. The HPPS can accurately determine the position of the luggage trolley with just one well-detected keypoint and estimate the luggage trolley’s orientation when it is partially occluded. Once the luggage trolley’s initial pose is detected, HPPS updates this information continuously to refine its accuracy until the robot begins grasping. The experiments on detection and localization demonstrate that HPPS is more reliable under partial occlusion compared to existing methods. Its effectiveness and robustness have also been confirmed through practical tests in actual luggage trolley collection tasks. A website about this work is available at HPPS. Copyright © 2024, The Authors. All rights reserved.

关键词： robotics

An Experimental Study of Keypoint Descriptor Fusion

学校读者我要写书评

暂无评论

An Experimental Study of Keypoint Descriptor Fusion

IEEE International Conference on robotics and Biomimetics

作者： Yaling Pan Li He Yisheng Guan Hong Zhang Biomimetic Intelligent Robotics Research Laboratory (BIRL) Guangdong University of Technology Guangzhou China Shenzhen Key Laboratory of Robotics and Computer Vision Southern University of Science and Technology Shenzhen China

ISBN: (纸本)9781665481106

Local feature descriptors play a crucial role in computer vision problems, especially robot motion. Existing descriptors are highly accurate, but their performance de-pends on the influence of distracting factors, such as illumi-nation and viewpoint. There is room for further improvement of these descriptors. In this paper, we provide an in-depth analysis of several exciting features of the descriptor fusion model (DFM) we have proposed in our recent work, which uses an autoencoder to combine descriptors and exploit their respective advantages. With this DFM framework, we fur-ther validate that fused descriptors can retain advantageous properties and that our DFM is a generally applicable method with respect to various component descriptors. Specifically, we evaluate multiple combinations of hand-crafted and CNN descriptors concerning their performance on a benchmark dataset with illumination and viewpoint changes to obtain comprehensive experimental results. The results show that the fused descriptors have better matching accuracy than their component descriptors.

关键词： Robot motion computer vision Analytical models Biomimetics Lighting Benchmark testing

Boosting Adversarial Transferability across Model Genus by Deformation-Constrained Warping

学校读者我要写书评

暂无评论

arXiv 2024年

作者： Lin, Qinliang Luo, Cheng Niu, Zenghao He, Xilin Xie, Weicheng Hou, Yuanbo Shen, Linlin Song, Siyang Computer Vision Institute School of Computer Science & Software Engineering Shenzhen University China Shenzhen Institute of Artificial Intelligence and Robotics for Society China Guangdong Key Laboratory of Intelligent Information Processing China WAVES Research Group Ghent University Belgium University of Leicester United Kingdom

Adversarial examples generated by a surrogate model typically exhibit limited transferability to unknown target systems. To address this problem, many transferability enhancement approaches (e.g., input transformation and model augmentation) have been proposed. However, they show poor performances in attacking systems having different model genera from the surrogate model. In this paper, we propose a novel and generic attacking strategy, called Deformation-Constrained Warping Attack (DeCoWA), that can be effectively applied to cross model genus attack. Specifically, DeCoWA firstly augments input examples via an elastic deformation, namely Deformation-Constrained Warping (DeCoW), to obtain rich local details of the augmented input. To avoid severe distortion of global semantics led by random deformation, DeCoW further constrains the strength and direction of the warping transformation by a novel adaptive control strategy. Extensive experiments demonstrate that the transferable examples crafted by our DeCoWA on CNN surrogates can significantly hinder the performance of Transformers (and vice versa) on various tasks, including image classification, video action recognition, and audio recognition. Code is made available at https://***/LinQinLiang/DeCoWA. © 2024, CC BY-NC-SA.

关键词： Semantics

A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation

学校读者我要写书评

暂无评论

arXiv 2023年

The dominant paradigm in 3D human pose estimation that lifts a 2D pose sequence to 3D heavily relies on long-term temporal clues (i.e., using a daunting number of video frames) for improved accuracy, which incurs performance saturation, intractable computation and the non-causal problem. This can be attributed to their inherent inability to perceive spatial context as plain 2D joint coordinates carry no visual cues. To address this issue, we propose a straightforward yet powerful solution: leveraging the readily available intermediate visual representations produced by off-the-shelf (pre-trained) 2D pose detectors - no finetuning on the 3D task is even needed. The key observation is that, while the pose detector learns to localize 2D joints, such representations (e.g., feature maps) implicitly encode the joint-centric spatial context thanks to the regional operations in backbone networks. We design a simple baseline named Context-Aware PoseFormer to showcase its effectiveness. Without access to any temporal information, the proposed method significantly outperforms its context-agnostic counterpart, PoseFormer [77], and other state-of-the-art methods using up to hundreds of video frames regarding both speed and precision. Project page: ***/ContextAware-PoseFormer. Copyright © 2023, The Authors. All rights reserved.

关键词： Machine learning

Edge-guided Representation Learning for Underwater Object Detection

学校读者我要写书评

暂无评论

arXiv 2023年

作者： Dai, Linhui Liu, Hong Song, Pinhao Tang, Hao Ding, Runwei Li, Shengquan Key Laboratory of Machine Perception Shenzhen Graduate School Peking University Shenzhen China Robotics Research Group KU Leuven Leuven Belgium Computer Vision Lab ETH Zurich Zurich Switzerland Peng Cheng Laboratory Shenzhen China

Underwater object detection (UOD) is crucial for marine economic development, environmental protection, and the planet’s sustainable development. The main challenges of this task arise from low-contrast, small objects, and mimicry of aquatic organisms. The key to addressing these challenges is to focus the model on obtaining more discriminative information. We observe that the edges of underwater objects are highly unique and can be distinguished from low-contrast or mimicry environments based on their edges. Motivated by this observation, we propose an Edge-guided Representation Learning Network, termed ERL-Net, that aims to achieve discriminative representation learning and aggregation under the guidance of edge cues. Firstly, we introduce an edge-guided attention module to model the explicit boundary information, which generates more discriminative features. Secondly, a feature aggregation module is proposed to aggregate the multi-scale discriminative features by regrouping them into three levels, effectively aggregating global and local information for locating and recognizing underwater objects. Finally, we propose a wide and asymmetric receptive field block to enable features to have a wider receptive field, allowing the model to focus on more small object information. Comprehensive experiments on three challenging underwater datasets show that our method achieves superior performance on the UOD task. Copyright © 2023, The Authors. All rights reserved.

关键词： Object detection

CSSL-MHTR: Continual Self-Supervised Learning for Scalable Multi-script Handwritten Text Recognition

学校读者我要写书评

暂无评论

arXiv 2023年

作者： Dhiaf, Marwa Souibgui, Mohamed Ali Wang, Kai Liu, Yuyang Kessentini, Yousri Fornés, Alicia Rouhou, Ahmed Cheikh InstaDeep United Kingdom Computer Vision Center UAB Spain Digital Research Center of Sfax SM@RTS Tunisia State Key Laboratory of Robotics China Shenyang Institute of Automation Chinese Academy of Sciences China

Self-supervised learning has recently emerged as a strong alternative in document analysis. These approaches are now capable of learning high-quality image representations and overcoming the limitations of supervised methods, which require a large amount of labeled data. However, these methods are unable to capture new knowledge in an incremental fashion, i.e. when data is presented to the model sequentially, which is closer to the realistic scenario. In this paper, we explore the potential of continual self-supervised learning to alleviate the catastrophic forgetting problem in handwritten text recognition, as an example of sequence recognition. Our method consists in adding intermediate layers called adapters for each task, and efficiently distilling knowledge from the previous model while learning the current task. Our proposed framework is efficient in both computation and memory complexity. To demonstrate its effectiveness, we evaluate our method by transferring the learned model to diverse text recognition downstream tasks, including Latin and non-Latin scripts. As far as we know, this is the first application of continual self-supervised learning for handwritten text recognition. We attain state-of-the-art performance in English, Italian, and Russian, whilst adding only a few parameters per task. The code and trained models will be publicly available1 © 2023, CC BY.

关键词： Supervised learning