检索结果-内蒙古大学图书馆

TNIE-SLAM: Neural Implicit Surface Reconstruction for Tracking-Oriented SLAM

Journal of Intelligent and Robotic Systems: Theory and Applications 2025年第2期111卷 1-22页

作者： Gan, Baolin Zhang, Congxuan Chen, Shuaixin Chen, Zhen He, Chao Lu, Ke Lu, Feng School of Instrument Science and Optoelectronic Engineering Nanchang Hangkong University Nanchang330063 China Jiangxi Provincial Key Laboratory of Image Processing and Pattern Recognition Nanchang Hangkong University Nanchang330063 China School of Information Engineering Nanchang Hangkong University Nanchang330063 China The College of Engineering Science University of Chinese Academy of Sciences Beijing100049 China

Recent studies on simultaneous localization and mapping (SLAM) have tended to employ implicit neural representation, which can improve the efficiency and robustness of SLAM system. However, these methodologies still face challenges, such as tracking failures and low-precision mapping. In this paper, we propose a dense reconstruction visual SLAM system enhanced with closed-loop threading and local map optimization, named TNIE-SLAM. First, we propose a tracking module that utilizes the similarity of ORB feature descriptors and the feature overlap rate of the current frame to model key frames, and then we define a complete and accurate initial map based on full bundle adjustment, which addresses the issue of tracking failure due to undermapped areas. Second, we add the 2D features of the initial map to the spatiotemporal encoding module to obtain the 3D features, enabling real-time prediction and tracking of unknown areas. Finally, considering the low-precision mapping issue arising from the complex geometric shapes of objects within the scene, we propose a local map optimization module that utilizes truncated signed distance fields to model 3D features and update the spatial occupancy of boundary and contour features of objects. We test our method on the synthetic Replica dataset and the real-world ScanNet and TUM RGB-D datasets to compare with some state-of-the-art RGB-D SLAM methods, and the experimental results indicate our method performs well in both tracking and mapping accuracy, surpassing the existing dense neural RGB-D SLAM methods. © The Author(s) 2025.

关键词： Surface reconstruction

来源：评论

学校读者我要写书评

暂无评论

Hybrid Data-Free Knowledge Distillation

arXiv

引用

arXiv 2024年

作者： Tang, Jialiang Chen, Shuo Gong, Chen School of Computer Science and Engineering Nanjing University of Science and Technology China Key Laboratory of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education China Jiangsu Key Laboratory of Image and Video Understanding for Social Security China Center for Advanced Intelligence Project RIKEN Japan Department of Automation Institute of Image Processing and Pattern Recognition Shanghai Jiao Tong University China

Data-free knowledge distillation aims to learn a compact student network from a pre-trained large teacher network without using the original training data of the teacher network. Existing collection-based and generation-based methods train student networks by collecting massive real examples and generating synthetic examples, respectively. However, they inevitably become weak in practical scenarios due to the difficulties in gathering or emulating sufficient real-world data. To solve this problem, we propose a novel method called Hybrid Data-Free Distillation (HiDFD), which leverages only a small amount of collected data as well as generates sufficient examples for training student networks. Our HiDFD comprises two primary modules, i.e., the teacher-guided generation and student distillation. The teacher-guided generation module guides a Generative Adversarial Network (GAN) by the teacher network to produce high-quality synthetic examples from very few real-world collected examples. Specifically, we design a feature integration mechanism to prevent the GAN from overfitting and facilitate the reliable representation learning from the teacher network. Meanwhile, we drive a category frequency smoothing technique via the teacher network to balance the generative training of each category. In the student distillation module, we explore a data inflation strategy to properly utilize a blend of real and synthetic data to train the student network via a classifier-sharing-based feature alignment technique. Intensive experiments across multiple benchmarks demonstrate that our HiDFD can achieve state-of-the-art performance using 120 times less collected data than existing methods. Code is available at https://***/tangjialiang97/HiDFD. Copyright © 2024, The Authors. All rights reserved.

关键词： Students

来源：评论

学校读者我要写书评

暂无评论

Online Lidar-Camera Extrinsic Parameters Self-Checking

SSRN

引用

SSRN 2023年

作者： Wei, Pengjin Yan, Guohang You, Xin Fang, Kun Ma, Tao Yang, Jie Liu, Wei Department of Automation Shanghai Jiao Tong University 800 Dongchuan RD Minhang District China Autonomous Driving Group Shanghai AI Laboratory 129 Longwen RD Xuhui District China Institute of Image Processing and Pattern Recognition China

With the development of neural networks and the increasing popularity of automatic driving, the calibration of the LiDAR and the camera has attracted more and more attention. This calibration task is multi-modal, where the rich color and texture information captured by the camera and the accurate three-dimensional spatial information from the LiDAR is incredibly significant for downstream tasks. In real-world applications, as smart cars roll off the production line, their LiDAR and camera systems are meticulously calibrated, while in the rest of the car life period, the poses of the LiDARs and cameras no longer get continually supervised to ensure the security. To this end, this paper proposes a self-checking algorithm to judge whether the extrinsic parameters are well-calibrated during the car life period by introducing a binary classification network based on the fused information from the camera and the LiDAR. Moreover, since there is no such dataset for the task in this work, we further generate a new dataset branch from the KITTI dataset tailored for the task. Our experiments on the proposed dataset branch demonstrate the performance of our method. To the best of our knowledge, this is the first work to address the significance of continually checking the calibrated extrinsic parameters for autonomous driving. The dataset and code are available at https://***/OpenCalib/LiDAR2camera_self-check © 2023, The Authors. All rights reserved.

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

Face Forgery Detection Algorithm Based on Improved MobileViT Network

Face Forgery Detection Algorithm Based on Improved MobileViT...

引用

6th International Conference on Intelligent Computing and Signal processing (ICSP)

作者： Tiantian Wang Xiaoqi Lu School of Information Engineering Inner Mongolia University of Science and Technology Baotou China Inner Mongolia Key Laboratory of Pattern Recognition and Intelligent Image Processing Baotou China Institute of Information Engineering Inner Mongolia University of Technology Hohhot China

DeepFakes blur the boundaries between reality and forgery, resulting in the collapse of exiting credit system, causing immeasurable consequences for national security and social order. Through analysis of existing face forgery techniques, it is found that most generation techniques rely on random noise distribution, and global information will be lost after up sampling. Therefore, we propose a deepfake detection algorithm based on improved MobileViT, which uses CNN local space biasing and the global space representation of the Transformer network to learn the local features and global representation of forged faces, respectively. Coordinate attention is introduced to obtain directional perception and position sensitive information, making the model locate synthetic traces of fake faces better and fusion local and global representation more effectively. For the improved generalization of the model, with the GELU activation function to solve the problem of neuron death. Our model achieved 96.2% on FF++(C23) datasets, and 93.7%,94.1%,96.3%,87.9% on DF, F2F, FS, and NT datasets, respectively. Comparing with previous methods, our model has shown detection robustness and better generalization.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Gait recognition Algorithm GCGait based on Skeleton

Gait Recognition Algorithm GCGait based on Skeleton

引用

Information Technology,Big Data and Artificial Intelligence (ICIBA), International Conference on IEEE International Conference on

作者： Ru Ma Jikai Zhang Mengyu Jia Xiaoqi Lv School of Information Engineering Inner Mongolia University of Science and Technology Baotou China Inner Mongolia Key Laboratory of Pattern Recognition and Intelligent Image Processing Baotou China School of Information Engineering Inner Mongolia University of Technology Hohhot China

To solve the problem of low accuracy of gait recognition in complex scenes, a novel skeleton-based gait recognition algorithm, GCGait, is proposed. Taking human posture as the input of gait feature, the interference caused by wearing changes and other factors is reduced. To extract sufficient input features, multi-branch input is used in the early stage of the model. By introducing the multi-attention mechanism, the network can learn the semantic information of the non-directly connected joints, excavate the most discriminative features from complex videos, and further improve the recognition performance. In order to reduce the influence of cross view, the fusion loss function is used in the experiment. Experimental results show that the average recognition rate of the proposed algorithm on the CASIA-B dataset is improved by 5.2%, and the average recognition accuracy on the OU-MVLP dataset is increased by 66.3%, which proves the effectiveness of the proposed method.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Mushroom Classification Based on Deep Residual Network

Mushroom Classification Based on Deep Residual Network

引用

pattern recognition and Machine Learning (PRML), IEEE International Conference on

作者： Ju Feng Xufeng Ling Yubo Wang Jie Yang School of Artificial Intelligence Shanghai Normal University Tianhua College Shanghai China Shanghai Acoustics Laboratory Chinese Academy of Sciences Shanghai China Institute of Image Processing and Pattern Recognition and Institute of Medical Robotics Shanghai Jiaotong University Shanghai China

Due to the similarity in mushroom features and the difficulty in distinguishing between poisonous and nonpoisonous varieties, mushrooms pose a threat to human health. To address the challenge of mushroom classification and identification, this paper proposes a mushroom classification method based on residual networks. Firstly, a network architecture with multiple residual blocks is designed, and it is trained using an image dataset. Then, a transfer learning strategy is employed to initialize the network parameters from a pre-trained model, followed by fine-tuning to adapt to the mushroom classification task. Finally, multiple testing experiments are conducted to evaluate the effectiveness of the proposed method. The experimental results demonstrate excellent performance of the proposed method in mushroom classification tasks. Compared to traditional feature extraction methods, it can better capture the details and texture features of mushrooms, thereby improving classification accuracy. In conclusion, the mushroom classification method based on residual networks exhibits high accuracy and generalization capability. This method has potential applications in the field of mushroom classification, aiding in the better identification and differentiation of poisonous mushrooms, thereby protecting human health.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Gait Planning and Motion Control Based on Vrep Simulation for Quadruped Robot

Gait Planning and Motion Control Based on Vrep Simulation fo...

引用

WRC Symposium on Advanced Robotics and Automation (WRC SARA)

作者： Linqi Zhou Zhihua Chen Jun Liu Zhi Liu Yumeng Chen Liting Zhang key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition and MOE Key Lab of Nondestructive Testing Technology Nanchang Hangkong University Nanchang China State Key Laboratory of Intelligent Control and Decision of Complex Systems School of Automation Beijing Institute of Technology Beijing China

Gait planning of quadruped robots plays an important role in achieving less walking, including dynamic and static gait. In this article, a static and dynamic gait control method based on center of gravity stability margin is proposed. Firstly, the robot model and kinematics modeling are introduced. Secondly, the robot’s foot static and dynamic gait were planned and the foot trajectory was designed. Finally, two types of gait of the robot were simulated using Vrep simulation software, and the differences in stability and speed between the coordinated gait with speed and stability in the static and dynamic gait of a 12 degree of freedom robot were analyzed, verifying the effectiveness of the gait control method proposed in this paper.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Remote Sensing image Object Detection Method with Feature Denoising Fusion Module

Remote Sensing Image Object Detection Method with Feature De...

引用

IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

作者： Penghui Chen Qishen Li Qiufeng Li Zhongyu Wu School of Information Engineering Nanchang Hangkong University Nanchang Jiangxi China Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition Nanchang Jiangxi China School of Software Nanchang Hangkong University Nanchang Jiangxi China Key Laboratory of Nondestructive Testing (Ministry of Education) Nanchang Hangkong University Nanchang Jiangxi China

Remote sensing object detection is an important research area in computer vision, widely applied in both military and civilian domains. However, challenges in remote sensing image object detection such as large image sizes, complex backgrounds, and significant variations in target scales are prevalent. To address these issues, this paper proposes a new Feature Denoising and Fusion Module (FDFM) aimed at enhancing the accuracy and robustness of object detection. This module comprises a Multi-Scale Denoising Submodule(MDS) and an Attention Optimization Submodule(AOS). The Multi-Scale Denoising Module aims to suppress lower-level texture noise by utilizing higher-level semantic features before the fusion process, reducing the impact of lower-level noise on subsequent multi-scale feature fusion. Meanwhile, the Attention Optimization Module seeks to enhance the precision of self-attention computations within the Multi-Scale Denoising Module without increasing the parameter count. The efficacy of this method was evaluated on public datasets DOTA, VisDrone, VOC and COCO, showing improvements in comparison to baseline models.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Object Detector based on Enhanced Multi-scale Feature Fusion Pyramid Network

Object Detector based on Enhanced Multi-scale Feature Fusion...

引用

IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

作者： Luan Zhao Xiaofeng Zhang Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition Nanchang Hangkong University Nanchang China

ISBN: (数字)9781728180281

ISBN: (纸本)9781728180298

Constructing the pyramidal architecture for the feature is currently a very effective way to obtain feature information of objects at different scales. Although the feature pyramid can realize the recognition and detection of multi-scale objects in the object detection task well, it still has some limitations. Since the feature information of different levels is often not from the same layer of the network, it is difficult to obtain the feature of different objects information at a certain scale from a certain level feature map of the pyramid network. To solve this problem, we present a novel object detection architecture, named Enhanced Multi-scale Feature Fusion Pyramid Network (EMFFPNet). Our network consists of Enhanced Multi-scale Feature Fusion Module (EMFFM) and Predictor Optimization Module (POM). In EMFFM, Features at different levels can be fused into the Enhanced features as outputs, which are more representative and deterministic. In order to enable the enhanced features to play their respective roles in the pyramid network, we assign different weights to fusion features of different levels in POM. We perform the experiments on the COCO detection benchmark. The experimental results indicate that the performance of our model is much better than the state-of-the-art model.

关键词： Object detection Predictive models Feature extraction Task analysis Information technology Optimization Standards

来源：评论

学校读者我要写书评

暂无评论

SiamORPN: Enabling Orthogonality between Object and Background in Siamese Object Tracking

SiamORPN: Enabling Orthogonality between Object and Backgrou...

引用

International Conference on Tools for Artificial Intelligence (ICTAI)

作者： Kai Huang Chaolin Pan Jun Chu Lu Leng Jun Miao Junjiang Wu Lingfeng Wang Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition Nanchang Hangkong University Nanchang China School of Information Science and Technology Beijing University of Chemical Technology Beijing China

Siamese-based trackers currently are the dominant tracking paradigm due to the balance between speed and performance. However, it is prone to drift and tracking failure when the environment is complex and similar objects interfere. While the Siamese-based trackers perform the correlation operation, the responses of the target object and background appear in different channels, i.e., the feature spaces of the target object and background have some orthogonality. However, when meeting background clutters and similar objects interfere, this orthogonality becomes weaker and the wrong classification contribution of the object and the background reduces the stability of the learned similarity function, leading to many misclassified pixels in the heatmaps. In this work, we proposed a SiamORPN to solve the above issues. It is incorporated at two levels: an Orthogonal Region Proposal Network (ORPN) and an Adaptive Pixel-wise Aggregation (APA) module. Specifically, for ORPN, the orthogonality between the object and the background maximizes the inter-class inertia. Moreover, the ORPN introduces the orthogonal module to enhance this orthogonality. For APA, it introduces two lightweight networks to predict the weights of all pixels in different heatmaps and the weights of all pixels in different regression offsets. Experiments on challenging benchmarks, including OTB2015, VOT2016, VOT2018, GOT-10k test set, UAV123, LaSOT, and TrackingNet, demonstrate the proposed SiamORPN outperforms many SOTA trackers and achieves leading performance. The inference speed at GTX1080Ti can reach about 32 FPS, meeting the real-time requirements.

关键词： Heating systems Target tracking Correlation Adaptive systems Benchmark testing Real-time systems Proposals

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：