检索结果-内蒙古大学图书馆

arXiv 2023年

作者： Cao, Yihong Zhang, Hui Lu, Xiao Xiao, Zheng Yang, Kailun Wang, Yaonan College of Computer Science and Electronic Engineering Hunan University Changsha410082 China National Engineering Research Center of Robot Vision Perception and Control Technology School of Robotics Hunan University Changsha410082 China College of Engineering and Design Hunan Normal University Changsha410082 China

Domain adaptive semantic segmentation enables robust pixel-wise understanding in real-world driving scenes. Source-free domain adaptation, as a more practical technique, addresses the concerns of data privacy and storage limitations in typical unsupervised domain adaptation methods, making it especially relevant in the context of intelligent vehicles. It utilizes a well-trained source model and unlabeled target data to achieve adaptation in the target domain. However, in the absence of source data and target labels, current solutions cannot sufficiently reduce the impact of domain shift and fully leverage the information from the target data. In this paper, we propose an end-to-end source-free domain adaptation semantic segmentation method via Importance-Aware and Prototype-Contrast (IAPC) learning. The proposed IAPC framework effectively extracts domain-invariant knowledge from the well-trained source model and learns domain-specific knowledge from the unlabeled target domain. Specifically, considering the problem of domain shift in the prediction of the target domain by the source model, we put forward an importance-aware mechanism for the biased target prediction probability distribution to extract domain-invariant knowledge from the source model. We further introduce a prototype-contrast strategy, which includes a prototype-symmetric cross-entropy loss and a prototype-enhanced cross-entropy loss, to learn target intra-domain knowledge without relying on labels. A comprehensive variety of experiments on two domain adaptive semantic segmentation benchmarks demonstrates that the proposed end-to-end IAPC solution outperforms existing state-of-the-art methods. The source code is publicly available at https://***/yihong-97/Source-free-IAPC. Copyright © 2023, The Authors. All rights reserved.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Steady Tracker: Tracking a Target Stably Using a Quadrotor

Steady Tracker: Tracking a Target Stably Using a Quadrotor

引用

IEEE International Conference on robotics and Biomimetics

作者： Hongwen Li Hang Zhong Yongsheng Lv Jianjun Sha Yu Long Yaonan Wang Harbin Engineering University Harbin Qingdao Innovation and Development Center of Harbin Engineering University Qingdao China School of Robotics National Engineering Research Center for Robot Visual Perception and Control Hunan University College of Electrical and Information Engineering Hunan University Changsha China

ISBN: (纸本)9781665481106

Maneuvering target tracking of Unmanned Aerial Vehicle(UAV) in cluttered environments is a challenging issue owing to the unknown motion intention of the target and the complex moving environments. As the complexity of the environment increases, stable and secure target tracking is increasingly difficult to guarantee. To address the issue, this paper proposes a stable quadrotor tracking solution. The proposed solution contains two parts: target motion prediction and tracking path searching. The target motion prediction method predicts the future target motion based on the obtained target observations while considering observation noise and prediction errors. The tracking path searching method utilizes a sampling-based search method, using the homotopy of paths to ensure that the tracking path and the target position are in the same space. Finally, simulations, real-world experiments and statistical analysis verify the correctness and effectiveness of the proposed approach.

关键词： Target tracking Statistical analysis Prediction methods Search problems Turning Prediction algorithms Convex functions

来源：评论

学校读者我要写书评

暂无评论

Implicit Modality Mining: An End-to-End Method for Multimodal Information Extraction

引用

Journal of Electronic research and Application 2024年第2期8卷 124-139页

作者： Jinle Lu Qinglang Guo School of Cyber Science and Technology University of Science and Technology of ChinaHefei 230027Anhui ProvinceChina National Engineering Research Center for Public Safety Risk Perception and Control by Big Data(RPP) CETC Academy of Electronics and Information Technology Group Co.Ltd.China Academic of Electronics and Information TechnologyBeijing 100041China

Multimodal named entity recognition(MNER)and relation extraction(MRE)are key in social media analysis but face challenges like inefficient visual processing and non-optimal modality interaction.(1)Heavy visual embedding:the process of visual embedding is both time and computationally expensive due to the prerequisite extraction of explicit visual cues from the original image before input into the multimodal ***,these approaches cannot achieve efficient online reasoning;(2)suboptimal interaction handling:the prevalent method of managing interaction between different modalities typically relies on the alternation of self-attention and cross-attention mechanisms or excessive dependence on the gating *** explicit modeling method may fail to capture some nuanced relations between image and text,ultimately undermining the model’s capability to extract optimal *** address these challenges,we introduce Implicit Modality Mining(IMM),a novel end-to-end framework for fine-grained image-text correlation without heavy visual *** uses an Implicit Semantic Alignment module with a Transformer for cross-modal clues and an Insert-Activation module to effectively utilize these *** approach achieves state-of-the-art performance on three datasets.

关键词： Multimodal Named entity recognition Relation extraction Patch projection

来源：评论

学校读者我要写书评

暂无评论

Admittance Based robot Force control Framework for Server Board Assembly

Admittance Based Robot Force Control Framework for Server Bo...

引用

IEEE International Conference on Industrial technology (ICIT)

作者： Yunlong Ma Yaonan Wang Yiming Jiang Xianen Zhou Ge Zhu Qing Zhu Institute of Artificial Intelligence University of Science and Technology Beijing Beijing China Xiangjiang Laboratory Hunan China College of Electrical and Information Engineering Hunan University Changsha China School of Robotics Hunan University Changsha China National Engineering Research Center for Robot Vision Perception and Control Technology Hunan University Changsha China ZTE Corporation Shenzhen China

ISBN: (数字)9798350340266

ISBN: (纸本)9798350340273

In server board assembly tasks, the effect of vision-based robot assembly schemes is not ideal due to the small installation gap and the blocking of vision. Adding force sensors and force controllers can be a good solution to the above problems and increase the flexibility in the assembly process. In this paper, we propose a force control strategy applied to a server board assembly task. The method is divided into two main parts. In the first part, the zero offset of the force sensor and the load gravity are calibrated and compensated, so that the external force on the load is accurately obtained. In the second part, the admittance controller is designed to achieve compliant behavior between the robot and the environment. Finally, the experimental verification of the board insertion is carried out on the experimental platform. Experimental results verify the effectiveness and practicability of the proposed method.

关键词： Process control robot sensing systems Manipulators Servers Force sensors Admittance Force control

来源：评论

学校读者我要写书评

暂无评论

DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction

arXiv

引用

arXiv 2024年

作者： Li, Siyu Lin, Jiacheng Shi, Hao Zhang, Jiaming Wang, Song Yao, You Li, Zhiyong Yang, Kailun The School of Robotics The National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University Changsha410082 China The College of Computer Science and Electronic Engineering Hunan University Changsha410082 China The State Key Laboratory of Extreme Photonics and Instrumentation The National Engineering Research Center of Optical Instrumentation Zhejiang University Hangzhou310027 China The Institute for Anthropomatics and Robotics Karlsruhe Institute of Technology Karlsruhe76131 Germany The College of Computer Science Zhejiang University Hangzhou310027 China The USC Viterbi School of Engineering The University of Southern California Los AngelesCA90089 United States

Temporal information plays a pivotal role in Bird’s-Eye-View (BEV) driving scene understanding, which can alleviate the visual information sparsity. However, the indiscriminate temporal fusion method will cause the barrier of feature redundancy when constructing vectorized High-Definition (HD) maps. In this paper, we revisit the temporal fusion of vectorized HD maps, focusing on temporal instance consistency and temporal map consistency learning. To improve the representation of instances in single-frame maps, we introduce a novel method, DTCLMapper. This approach uses a dual-stream temporal consistency learning module that combines instance embedding with geometry maps. In the instance embedding component, our approach integrates temporal Instance Consistency Learning (ICL), ensuring consistency from vector points and instance features aggregated from points. A vectorized points pre-selection module is employed to enhance the regression efficiency of vector points from each instance. Then aggregated instance features obtained from the vectorized points preselection module are grounded in contrastive learning to realize temporal consistency, where positive and negative samples are selected based on position and semantic information. The geometry mapping component introduces Map Consistency Learning (MCL) designed with self-supervised learning. The MCL enhances the generalization capability of our consistent learning approach by concentrating on the global location and distribution constraints of the instances. Extensive experiments on well-recognized benchmarks indicate that the proposed DTCLMapper achieves state-of-the-art performance in vectorized mapping tasks, reaching 61.9% and 65.1% mAP scores on the nuScenes and Argoverse datasets, respectively. The source code is available at https://***/lynn-yu/DTCLMapper. Copyright © 2024, The Authors. All rights reserved.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Quantum machine learning for multiclass classification beyond kernel methods

arXiv

引用

arXiv 2024年

作者： Ding, Chao Wang, Shi Wang, Yaonan Gao, Weibo College of Electrical and Information Engineering Hunan University Changsha410082 China Division of Physics and Applied Physics School of Physical and Mathematical Sciences Nanyang Technological University Singapore637371 Singapore National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University Changsha410082 China Centre for Quantum Technologies National University of Singapore Singapore117543 Singapore The Photonics Institute Centre for Disruptive Photonic Technologies Nanyang Technological University Singapore637371 Singapore

Quantum machine learning is considered one of the current research fields with great potential. In recent years, Havlíček et al. [Nature 567, 209-212 (2019)] have proposed a quantum machine learning algorithm with quantum-enhanced feature spaces, which effectively addressed a binary classification problem on a superconducting processor and offered a potential pathway to achieving quantum advantage. However, a straightforward binary classification algorithm falls short in solving multiclass classification problems. In this paper, we propose a quantum algorithm that rigorously demonstrates that quantum kernel methods enhance the efficiency of multiclass classification in real-world applications, providing quantum advantage. To demonstrate quantum advantage, we design six distinct quantum kernels within the quantum algorithm to map input data into quantum state spaces and estimate the corresponding quantum kernel matrices. The results from quantum simulations reveal that the quantum algorithm outperforms its classical counterpart in handling six real-world multiclass classification problems. Furthermore, we leverage a variety of performance metrics to comprehensively evaluate the classification and generalization performance of the quantum algorithm. The results demonstrate that the quantum algorithm achieves superior classification and better generalization performance relative to classical counterparts. Copyright © 2024, The Authors. All rights reserved.

关键词： Quantum efficiency

来源：评论

学校读者我要写书评

暂无评论

GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction

arXiv

引用

arXiv 2024年

作者： Li, Siyu Yang, Kailun Shi, Hao Wang, Song Yao, You Li, Zhiyong The School of Robotics The National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University Changsha410082 China The State Key Laboratory of Extreme Photonics and Instrumentation Zhejiang University Hangzhou310027 China The College of Computer Science Zhejiang University Hangzhou310027 China Shanghai Supremind Technology Company Ltd. Shanghai201210 China The USC Viterbi School of Engineering The University of Southern California Los AngelesCA90089 United States

Online High-Definition (HD) maps have emerged as the preferred option for autonomous driving, overshadowing the counterpart offline HD maps due to flexible update capability and lower maintenance costs. However, contemporary online HD map models embed parameters of visual sensors into training, resulting in a significant decrease in generalization performance when applied to visual sensors with different parameters. Inspired by the inherent potential of Inverse Perspective Mapping (IPM), where camera parameters are decoupled from the training process, we have designed a universal map generation framework, GenMapping. The framework is established with a triadic synergy architecture, including principal and dual auxiliary branches. When faced with a coarse road image with local distortion translated via IPM, the principal branch learns robust global features under the state space models. The two auxiliary branches are a dense perspective branch and a sparse prior branch. The former exploits the correlation information between static and moving objects, whereas the latter introduces the prior knowledge of OpenStreetMap (OSM). The triple-enhanced merging module is crafted to synergistically integrate the unique spatial features from all three branches. To further improve generalization capabilities, a Cross-View Map Learning (CVML) scheme is leveraged to realize joint learning within the common space. Additionally, a Bidirectional Data Augmentation (BiDA) module is introduced to mitigate reliance on datasets concurrently. A thorough array of experimental results shows that the proposed model surpasses current state-of-the-art methods in both semantic mapping and vectorized mapping, while also maintaining a rapid inference speed. Moreover, in cross-dataset experiments, the generalization of semantic mapping is improved by 17.3% in mIoU, while vectorized mapping is improved by 12.1% in mAP. The source code will be publicly available at https://***/lynn-yu/GenMappin

关键词： Photomapping

来源：评论

学校读者我要写书评

暂无评论

Computational Imaging for Machine perception: Transferring Semantic Segmentation beyond Aberrations

arXiv

引用

arXiv 2022年

作者： Jiang, Qi Shi, Hao Gao, Shaohua Zhang, Jiaming Yang, Kailun Sun, Lei Ni, Huajian Wang, Kaiwei The State Key Laboratory of Extreme Photonics and Instrumentation The National Engineering Research Center of Optical Instrumentation Zhejiang University Hangzhou310027 China The Institute for Anthropomatics and Robotics Karlsruhe Institute of Technology Karlsruhe76131 Germany The School of Robotics The National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University Changsha410082 China Shanghai SUPREMIND Technology Company Ltd Shanghai201210 China

Semantic scene understanding with Minimalist Optical Systems (MOS) in mobile and wearable applications remains a challenge due to the corrupted imaging quality induced by optical aberrations. However, previous works only focus on improving the subjective imaging quality through the Computational Imaging (CI) technique, ignoring the feasibility of advancing semantic segmentation. In this paper, we pioneer the investigation of Semantic Segmentation under Optical Aberrations (SSOA) with MOS. To benchmark SSOA, we construct Virtual Prototype Lens (VPL) groups through optical simulation, generating Cityscapes-ab and KITTI-360-ab datasets under different behaviors and levels of aberrations. We look into SSOA via an unsupervised domain adaptation perspective to address the scarcity of labeled aberration data in real-world scenarios. Further, we propose Computational Imaging Assisted Domain Adaptation (CIADA) to leverage prior knowledge of CI for robust performance in SSOA. Based on our benchmark, we conduct experiments on the robustness of classical segmenters against aberrations. In addition, extensive evaluations of possible solutions to SSOA reveal that CIADA achieves superior performance under all aberration distributions, bridging the gap between computational imaging and downstream applications for MOS. The project page is at https://***/zju-jiangqi/CIADA. Copyright © 2022, The Authors. All rights reserved.

关键词： Aberrations

来源：评论

学校读者我要写书评

暂无评论

LF-VISLAM: A SLAM Framework for Large Field-of-View Cameras with Negative Imaging Plane on Mobile Agents

arXiv

引用

arXiv 2022年

作者： Wang, Ze Yang, Kailun Shi, Hao Li, Peng Gao, Fei Bai, Jian Wang, Kaiwei State Key Laboratory of Extreme Photonics and Instrumentation Zhejiang University China School of Robotics Hunan University China National Engineering Research Center of Robot Visual Perception and Control Technology Hunan University China State Key Laboratory of Industrial Control Technology Zhejiang University China Huzhou Institute of Zhejiang University Zhejiang University China

Simultaneous Localization And Mapping (SLAM) has become a crucial aspect in the fields of autonomous driving and robotics. One crucial component of visual SLAM is the Field-of-View (FoV) of the camera, as a larger FoV allows for a wider range of surrounding elements and features to be perceived. However, when the FoV of the camera reaches the negative half-plane, traditional methods for representing image feature points using (u, v, 1)T become ineffective. While the panoramic FoV is advantageous for loop closure, its benefits are not easily realized under large-attitude-angle differences where loop-closure frames cannot be easily matched by existing methods. As loop closure on wide-FoV panoramic data further comes with a large number of outliers, traditional outlier rejection methods are not directly applicable. To address these issues, we propose LF-VISLAM, a visual Inertial SLAM framework for cameras with extremely Large FoV with loop closure. A three-dimensional vector with unit length is introduced to effectively represent feature points even on the negative half-plane. The attitude information of the SLAM system is leveraged to guide the feature point detection of the loop closure. Additionally, a new outlier rejection method Copyright © 2022, The Authors. All rights reserved.

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

Single-Leader-Dual-Follower Teleoperation in Object-Holding Task with Internal Force Regulation 26

Single-Leader-Dual-Follower Teleoperation in Object-Holding ...

引用

26th International Conference on Automation and Computing, ICAC 2021

作者： Huang, Darong Jiang, Yiming Yang, Chenguang South China University of Technology School of Automation Science and Engineering Guangzhou510641 China Hunan University National Engineering Laboratory for Robot Visual Perception and Control Changsha410082 China University of the West of England Bristol Robotics Laboratory BristolBS16 1QY United Kingdom

ISBN: (纸本)9781860435577

robot teleoperation attracts growing attention of researchers in many domains. Plenty of factors contribute to the good performance of a smart teleoperation system, and one crucial factor is that it provides an environmental feedback for the operator. Hence, in this paper, we offer a solution to help operator to perceive the contact force of the tele-robot. In addition, we introduce a strategy to cooperatively teleoperate two follower robots by using only single leader1 robot, which helps to reduce the control burden for the operator, as well as enables the teleoperation system to avoid the problem caused by the mismatched degree of freedom (DoF). Moreover, an impedance model based technique is employed to regulate the holding internal force during object-holding tasks. Experiments are conducted gradually, using a Touch X device as the leader robot and two arms of the Baxter robot as the tele-robots. Experimental results validate the feasibility of the proposed teleoperation system. © 2021 Chinese Automation and Computing Society in the UK-CACSUK.

关键词： Remote control

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：