检索结果-内蒙古大学图书馆

IEEE Transactions on intelligent Vehicles 2024年 1-19页

作者： Cao, Yue Shangguan, Wei Visser, Arnoud Chen, Junjie Chai, Linguo Cai, Baigen School of Automation and Intelligence Beijing Jiaotong University Beijing China School of Automation and Intelligence and State Key Laboratory of Rail Traffic Control and Safety Beijing Jiaotong University Beijing China Intelligent Robotics and Computer Vision Lab of the Informatics Institute Faculty of Science University of Amsterdam The Netherlands

Detecting surrounding situations and reacting accordingly to avoid collisions remains a challenging task for autonomous driving. This task requires predicting the trajectories of surrounding agents and assessing the potential risk of future situations, which can be difficult to achieve solely through onboard vehicle devices. Therefore, this paper proposes a cooperative architecture for trajectory prediction and risk assessment conducted on roadside devices (RSUs) to assist Connected and Autonomous Vehicles (CAVs). Firstly, we develop a segmentbased prediction model (SegNet) tailored to hub signalized intersections. Intersections are divided into multiple segments, and the Curvilinear coordinates are utilized to indicate the geometric road features. The model leverages individual interaction cues in the ego segment and group features in the merging segments, while also incorporating traffic signal information to generate multimodal prediction results. In terms of risk assessment, we utilize the prediction results to provide hierarchical assistance, such as risk values, risk maps, and reference trajectories. Offline experimental results demonstrate that our SegNet model achieves competitive and well-balanced performance compared to stateof-the-art methods on the CitySim Database, with more accurate and smooth prediction trajectories. Through real-time CARLA and SUMO co-simulation, the performance of assisted CAVs indicates that they can safely and effectively navigate with the support of the proposed architecture. IEEE

关键词： Real time systems

来源：评论

学校读者我要写书评

暂无评论

Point-GR: Graph Residual Point Cloud Network for 3D Object Classification and Segmentation

arXiv

引用

arXiv 2024年

作者： Meraz, Md. Ansari, Md Afzal Javed, Mohammed Chakraborty, Pavan Center of Intelligent Robotics Lab Department of Information Technology Computer Vision & Biometrics Lab Department of Information Technology Indian Institute of Information Technology Allahabad U.P Prayagraj India

In recent years, the challenge of 3D shape analysis within point cloud data has gathered significant attention in computer vision. Addressing the complexities of effective 3D information representation and meaningful feature extraction for classification tasks remains crucial. This paper presents Point-GR, a novel deep learning architecture designed explicitly to transform unordered raw point clouds into higher dimensions while preserving local geometric features. It introduces residual-based learning within the network to mitigate the point permutation issues in point cloud data. The proposed Point-GR network significantly reduced the number of network parameters in Classification and PartSegmentation compared to baseline graph-based networks. Notably, the Point-GR model achieves a state-of-the-art scene segmentation mean IoU of 73.47% on the S3DIS benchmark dataset, showcasing its effectiveness. Furthermore, the model shows competitive results in Classification and Part-Segmentation tasks. © 2024, CC BY.

关键词： Graph neural networks

来源：评论

学校读者我要写书评

暂无评论

Image saliency detection method based on multi-feature maps fusion 4

Image saliency detection method based on multi-feature maps ...

引用

4th International Conference on computer Graphics, Image, and Virtualization, ICCGIV 2024

作者： Li, Xiaoli Liu, Yunpeng Zhao, Huaici Shenyang Institute of Automation Chinese Academy of Sciences Shenyang China Institutes for Robotics and Intelligent Manufacturing Chinese Academy of Sciences Shenyang China University of Chinese Academy of Sciences Beijing China Key Laboratory of Opto-Electronic Information Processing Shenyang China The Key Lab of Image Understanding and Computer Vision Shenyang China Shenyang Jianzhu University Shenyang China

ISBN: (纸本)9781510683242

In this research, we introduce an innovative saliency detection algorithm, comprising three essential steps. Firstly, leveraging fully convolutional networks with aggregation interaction modules, we generate an initial saliency map. Secondly, we extract hand-craft and deep features to express the image, then use manifold ranking method to construct saliency maps. Ultimately, by integrating the outcomes from preceding stages, we generate the final saliency map. Experimental findings demonstrate that our method surpasses twelve cutting-edge saliency detection techniques in terms of precision, recall, F-measure, and MAE value metrics." © 2024 SPIE.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Benchmarking Graph Representations and Graph Neural Networks for Multivariate Time Series Classification

arXiv

引用

arXiv 2025年

作者： Yang, Wennuo Wu, Shiling Zhou, Yuzhi Luo, Cheng He, Xilin Xie, Weicheng Shen, Linlin Song, Siyang Computer Vision Institute School of Computer Science & Software Engineering Shenzhen University China Shenzhen Institute of Artificial Intelligence and Robotics for Society China Guangdong Provincial Key Laboratory of Intelligent Information Processing China HBUG Lab University of Exeter United Kingdom

Multivariate Time Series Classification (MTSC) enables the analysis if complex temporal data, and thus serves as a cornerstone in various real-world applications, ranging from healthcare to finance. Since the relationship among variables in MTS usually contain crucial cues, a large number of graph-based MTSC approaches have been proposed, as the graph topology and edges can explicitly represent relationships among variables (channels), where not only various MTS graph representation learning strategies but also different Graph Neural Networks (GNNs) have been explored. Despite such progresses, there is no comprehensive study that fairly benchmarks and investigates the performances of existing widely-used graph representation learning strategies/GNN classifiers in the application of different MTSC tasks. In this paper, we present the first benchmark which systematically investigates the effectiveness of the widely-used three node feature definition strategies, four edge feature learning strategies and five GNN architecture, resulting in 60 different variants for graph-based MTSC. These variants are developed and evaluated with a standardized data pipeline and training/validation/testing strategy on 26 widely-used suspensor MTSC datasets. Our experiments highlight that node features significantly influence MTSC performance, while the visualization of edge features illustrates why adaptive edge learning outperforms other edge feature learning methods. The code of the proposed benchmark is publicly available at https://***/CVI-yangwn/*** Codes 68T10 © 2025, CC BY.

关键词： Graph neural networks

来源：评论

学校读者我要写书评

暂无评论

Unsupervised Terahertz Image Restoration Based on CycleGan 3

Unsupervised Terahertz Image Restoration Based on CycleGan

引用

3rd International Conference on Defence Technology, ICDT 2022

作者： Su, Zhipeng Zhang, Yixiong Qi, Feng Shi, Jianghong School of Informatics Xiamen University National Demonstrative Software School Xiamen361005 China Shenyang Institute of Automation Chinese Academy of Sciences Shenyang110016 China Institutes for Robotics and Intelligent Manufacturing Chinese Academy of Sciences Shenyang110016 China Key Laboratory of Opto-Electronic Information Process Shenyang110016 China Key Laboratory of Image Understanding and Computer Vision Shenyang110016 China

Terahertz (THz) is considered as one of the key technologies for sixth generation communications, military, medical imaging and industrial inspection. THz images are susceptible to degradation due to system noise and point spread functions during transmission. The existing deep learning methods use ground truth and input images for supervised training that can recover THz images very well. But it's difficult to obtain labeled THz data in practical application. In this paper, we propose an attentional adversarial cycle generation network for THz image restoration (CycleTHz) based on CycleGan to address this problem. The CycleTHz generates clean images firstly by an attention-guided generation network and then discriminates the quality of the generators by an attention discriminator. In addition, RGB color loss is used for image channels for constraint. To the best of our knowledge, this is the first THz dataset to be trained using an unsupervised approach. Extensive experiments show that the proposed method improves the PSNR and SSIM by 43.4% and 101.7% compared with CycleGan, which is a benchmark method for the unsupervised development in THz image restoration. The code is available at https://***/hellogry/UnsupervisedCycleTHz © Published under licence by IOP Publishing Ltd.

关键词： Image reconstruction

来源：评论

学校读者我要写书评

暂无评论

Fast Candidate Region Extraction for SAR Ship Target 37

Fast Candidate Region Extraction for SAR Ship Target

引用

37th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2022

作者： Zhang, Panpan Luo, Haibo Xu, Zheng He, Miao Shenyang Institute of Automation Shenyang110016 China Institutes for Robotics and Intelligent Manufacturing Shenyang110016 China University of Chinese Academy of Sciences Beijing100049 China Key Laboratory of Opto-Electronic Information Processing Shenyang110016 China The Key Lab of Image Understanding and Computer Vision Shenyang110016 China

ISBN: (纸本)9781665465366

At present, deep learning technology is widely used in ship target detection in synthetic aperture radar (SAR) images. However, high-resolution remote sensing SAR images cover a larger area and have larger image sizes. To be able to use the deep learning model for training and testing, the image needs to be cropped to the appropriate size. In a high-resolution SAR ship image, the ship target usually takes only a small part of the whole image. As a result, only part of the cropped image contains the target, and most of the rest are background regions. This will cause a lot of computational redundancy in the model inference stage. To solve this problem, a fast candidate region extraction algorithm is proposed in this paper for ship target extraction. The algorithm consists of three parts: firstly, extract the saliency map of the SAR ship image, secondly, perform median filtering on the segmented image, and thirdly, extract candidate regions. The superiority of the algorithm was demonstrated by experiments on the AIR-SARShip-1.0 dataset. © 2022 IEEE.

关键词： Synthetic aperture radar

来源：评论

学校读者我要写书评

暂无评论

A Point-to-distribution Degeneracy Detection Factor for LiDAR SLAM using Local Geometric Models

A Point-to-distribution Degeneracy Detection Factor for LiDA...

引用

IEEE International Conference on robotics and Automation (ICRA)

作者： Sehua Ji Weinan Chen Zerong Su Yisheng Guan Jiehao Li Hong Zhang Haifei Zhu Biomimetic and Intelligent Robotics Lab (BIRL) School of Electromechanical Engineer Guangdong University of Technology Guangzhou China JT-Innovation (Guangdong) Intelligent Technology Co. Ltd. Guangdong Key Laboratory of Modern Control Technology Institute of Intelligent Manufacturing Guangdong Academy of Sciences Guangzhou China College of Engineering South China Agricultural University China Shenzhen Key Laboratory of Robotics and Computer Vision Southern University of Science and Technology China

ISBN: (数字)9798350384574

ISBN: (纸本)9798350384581

Limited by the working principles, LiDAR-SLAM systems suffer from the degeneration phenomenon in environments such as long corridors and tunnels, due to the lack of sufficient geometric features for frame-to-frame matching. The accuracy and sensitivity of existing degeneracy detection methods need to be further improved. In this paper, we propose a novel method for degeneracy detection using local geometric models based on point-to-distribution matching. To obtain an accurate description of local geometric models, an adaptive adjustment of voxel segmentation according to the point cloud distribution and density is designed. The codes of the proposed method is open-source and available at https://***/jisehua/***. Experiments with public datasets and self-build robots were conducted to evaluate the methods. The results exhibit that our proposed method achieves higher accuracy than the other existing approaches. Applying our proposed method is beneficial for improving the robustness of the LiDAR-SLAM systems.

关键词： Point cloud compression Accuracy Simultaneous localization and mapping Sensitivity Laser radar Geometric modeling Noise

来源：评论

学校读者我要写书评

暂无评论

Scene Consistency Representation Learning for Video Scene Segmentation

arXiv

引用

arXiv 2022年

作者： Wu, Haoqian Chen, Keyu Luo, Yanan Qiao, Ruizhi Ren, Bo Liu, Haozhe Xie, Weicheng Shen, Linlin Computer Vision Institute Shenzhen University China Tencent YouTu Lab Shenzhen Institute of Artificial Intelligence and Robotics for Society China Guangdong Key Laboratory of Intelligent Information Processing China KAUST Saudi Arabia

A long-term video, such as a movie or TV show, is composed of various scenes, each of which represents a series of shots sharing the same semantic story. Spotting the correct scene boundary from the long-term video is a challenging task, since a model must understand the storyline of the video to figure out where a scene starts and ends. To this end, we propose an effective Self-Supervised Learning (SSL) framework to learn better shot representations from unlabeled long-term videos. More specifically, we present an SSL scheme to achieve scene consistency, while exploring considerable data augmentation and shuffling methods to boost the model generalizability. Instead of explicitly learning the scene boundary features as in the previous methods, we introduce a vanilla temporal model with less inductive bias to verify the quality of the shot features. Our method achieves the state-of-the-art performance on the task of Video Scene Segmentation. Additionally, we suggest a more fair and reasonable benchmark to evaluate the performance of Video Scene Segmentation methods. The code is made available. © 2022, CC BY.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Autoregressive Queries for Adaptive Tracking with Spatio-Temporal Transformers

arXiv

引用

arXiv 2024年

作者： Xie, Jinxia Zhong, Bineng Mo, Zhiyi Zhang, Shengping Shi, Liangtao Song, Shuxiang Ji, Rongrong Key Laboratory of Education Blockchain and Intelligent Technology Ministry of Education China Guangxi Key Lab of Multi-Source Information Mining & Security China Guangxi Normal University Guilin541004 China Guangxi Key Laboratory of Machine Vision and Intelligent Control Wuzhou University China School of Computer Science and Technology Harbin Institute of Technology China Media Analytics and Computing Lab School of Informatics Xiamen University China

The rich spatio-temporal information is crucial to capture the complicated target appearance variations in visual tracking. However, most top-performing tracking algorithms rely on many hand-crafted components for spatio-temporal information aggregation. Consequently, the spatio-temporal information is far away from being fully explored. To alleviate this issue, we propose an adaptive tracker with spatio-temporal transformers (named AQATrack), which adopts simple autoregressive queries to effectively learn spatio-temporal information without many hand-designed components. Firstly, we introduce a set of learnable and autoregressive queries to capture the instantaneous target appearance changes in a sliding window fashion. Then, we design a novel attention mechanism for the interaction of existing queries to generate a new query in current frame. Finally, based on the initial target template and learnt autoregressive queries, a spatio-temporal information fusion module (STM) is designed for spatio-temporal formation aggregation to locate a target object. Benefiting from the STM, we can effectively combine the static appearance and instantaneous changes to guide robust tracking. Extensive experiments show that our method significantly improves the tracker’s performance on six popular tracking benchmarks: LaSOT, LaSOText, TrackingNet, GOT-10k, TNL2K, and *** and models will be here. Copyright © 2024, The Authors. All rights reserved.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

Autoregressive Queries for Adaptive Tracking with Spatio-Temporal Transformers

Autoregressive Queries for Adaptive Tracking with Spatio-Tem...

引用

Conference on computer vision and Pattern Recognition (CVPR)

作者： Jinxia Xie Bineng Zhong Zhiyi Mo Shengping Zhang Liangtao Shi Shuxiang Song Rongrong Ji Key Laboratory of Education Blockchain and Intelligent Technology Ministry of Education Guangxi Normal University Guilin China Guangxi Key Lab of Multi-Source Information Mining & Security Guangxi Normal University Guilin China Guangxi Key Laboratory of Machine Vision and Intelligent Control Wuzhou University School of Computer Science and Technology Harbin Institute of Technology Media Analytics and Computing Lab School of Informatics Xiamen University

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

The rich spatio-temporal information is crucial to capture the complicated target appearance variations in visual tracking. However, most top-performing tracking algorithms rely on many hand-crafted components for spatio-temporal information aggregation. Consequently, the spatio-temporal information is far away from being fully explored. To alleviate this issue, we propose an adaptive tracker with spatio-temporal transformers (named AQA-Track), which adopts simple autoregressive queries to effectively learn spatio-temporal information without many hand-designed components. Firstly, we introduce a set of learnable and autoregressive queries to capture the instantaneous target appearance changes in a sliding window fashion. Then, we design a novel attention mechanism for the interaction of existing queries to generate a new query in current frame. Finally, based on the initial target template and learnt autoregressive queries, a spatio-temporal information fusion module (STM) is designed for spatiotemporal formation aggregation to locate a target object. Benefiting from the STM, we can effectively combine the static appearance and instantaneous changes to guide robust tracking. Extensive experiments show that our method significantly improves the tracker's performance on six popular tracking benchmarks: LaSOT, LaSOT ext , TrackingNet, GOT-10k, TNL2K, and UAV123. Code and models will be https://***/orgs/GXNU-Zhonglab.

关键词： Adaptation models Visualization computer vision Target tracking Computational modeling Transformers Market research

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：