检索结果-内蒙古大学图书馆

5th IEEE International Conference on data science in Cyberspace, DSC 2020

作者： Zheng, Changmeng Peng, Qi Xu, Xuemiao South China University of Technology Key Laboratory of Big Data and Intelligent Robot School of Software Engineering Guangzhou China South China University of Technology School of Computer Science Engineering Guangzhou China

ISBN: (纸本)9781728195582

Effectively monitoring ships and discovering abnormal ship trajectory in time is necessary for marine traffic supervision. The basic work of discovering the ship's abnormal trajectory is to predict the ship's navigation dynamically. Previous works in ship trajectory prediction are basically concern on single-source data, for example, the AIS data. These methods ignore the relations between different sources which may improve the performance of predicting ship trajectory. We propose a neural sequence model based on heterogeneous multisource fusion for ship trajectory completion and prediction. Our method makes better utilization of AIS, GPS and ARPA radar information to predict ship trajectory precisely. We construct a dataset which contains about 8 million ship trajectory samples and the experiments demonstrate that our multi-source fusion model gains promising results. © 2020 IEEE.

关键词： Ships

来源：评论

学校读者我要写书评

暂无评论

Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph

arXiv

引用

arXiv 2024年

作者： Chen, Ziyang Zhang, Yongjun Li, Wenting Wang, Bingshu Zhao, Yong Chen, C.L. Philip College of Computer Science the State Key Laboratory of Public Big Data Guizhou University Guiyang550025 China School of Information Engineering Guizhou University of Commerce Guiyang550021 China School of Software Northwestern Polytechnical University Xi’an710129 China Key Laboratory of Integrated Microsystems Shenzhen Graduate School Peking University Shenzhen518055 China School of Computer Science and Engineering South China University of Technology Guangzhou510641 China

Real-world applications of stereo matching, such as autonomous driving, place stringent demands on both safety and accuracy. However, learning-based stereo matching methods inherently suffer from the loss of geometric structures in certain feature channels, creating a bottleneck in achieving precise detail matching. Additionally, these methods lack interpretability due to the black-box nature of deep learning. In this paper, we propose MoCha-V2, a novel learning-based paradigm for stereo matching. MoCha-V2 introduces the Motif Correlation Graph (MCG) to capture recurring textures, which are referred to as "motifs" within feature channels. These motifs reconstruct geometric structures and are learned in a more interpretable way. Subsequently, we integrate features from multiple frequency domains through wavelet inverse transformation. The resulting motif features are utilized to restore geometric structures in the stereo matching process. Experimental results demonstrate the effectiveness of MoCha-V2. MoCha-V2 achieved 1st place on the Middlebury benchmark at the time of its release. Code is available at here. Copyright © 2024, The Authors. All rights reserved.

关键词： Stereocenters

来源：评论

学校读者我要写书评

暂无评论

data Stream Classification Based on Extreme Learning Machine: A Review

引用

big data Research 2022年 30卷

作者： Zheng, Xiulin Li, Peipei Wu, Xindong Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology) Ministry of Education China School of Computer Science and Information Engineering Hefei University of Technology Hefei 230601 China Mininglamp Academy of Sciences Mininglamp Technology Beijing 100084 China

Many daily applications are generating massive amount of data in the form of stream at an ever higher speed, such as medical data, clicking stream, internet record and banking transaction, etc. In contrast to the traditional static data, data streams are of some inherent properties, to name a few, infinite length, concept drift, multiple labels and concept evolution. Among all the data mining tasks, classification is one of the basic topics in data stream mining and has gained more and more attentions among different research communities. Extreme Learning Machine (ELM) has drawn much interests in data classification due to its high efficiency, universal approximation capability, generalization ability, and simplicity, which have greatly inspired the development of many ELM-based algorithms and their applications during the past decades. In this paper, we mainly provide a comprehensive review on ELM theoretical research and its variants in data stream classification, and categorize these algorithms from different perspectives. Firstly, we briefly introduce the basic principles of ELM and its characteristics. Secondly, we give an overview of different ELM variants to address the particular issues of data stream classification. Thirdly, we present an overview of different strategies to optimize the ELM, which have further improved the stability, accuracy and generalization ability of ELM, and briefly introduce some practical applications of ELM in data stream classification. Finally, we conduct several groups of experiments to compare the performance of ELM based models addressing the focused issues. Also, the open issues and prospects of ELM models used for stream classification are discussed, which are worthwhile to be further studied in the future. © 2022

关键词： Classification data stream Extreme learning machine

来源：评论

学校读者我要写书评

暂无评论

Robust Low-rank Deep Feature Recovery in CNNs: Toward Low Information Loss and Fast Convergence

Robust Low-rank Deep Feature Recovery in CNNs: Toward Low In...

引用

IEEE International Conference on data Mining (ICDM)

作者： Jiahuan Ren Zhao Zhang Jicong Fan Haijun Zhang Mingliang Xu Meng Wang School of Computer Science and Information Engineering Hefei University of Technology Hefei China Key Laboratory of Knowledge Engineering with Big Data (Ministry of Education) & Intelligent Interconnected Systems Laboratory of Anhui Province Hefei University of Technology Hefei China School of Data Science The Chinese University of Hong Kong (Shenzhen) & Shenzhen Research Institute of Big Data Shenzhen China Harbin Institute of Technology (Shenzhen) Shenzhen China School of Information Engineering Zhengzhou University Zhengzhou China

ISBN: (纸本)9781665423991

Convolutional Neural Networks (CNNs)-guided deep models have obtained impressive performance for image representation, however the representation ability may still be restricted and usually needs more epochs to make the model converge in training, due to the useful information loss during the convolution and pooling operations. We therefore propose a general feature recovery layer, termed Low-rank Deep Feature Recovery (LDFR), to enhance the representation ability of the convolutional features by seamlessly integrating low-rank recovery into CNNs, which can be easily extended to all existing CNNs-based models. To be specific, to recover the lost information during the convolution operation, LDFR aims at learning the low-rank projections to embed the feature maps onto a low-rank subspace based on some selected informative convolutional feature maps. Such low-rank recovery operation can ensure all convolutional feature maps to be reconstructed easily to recover the underlying subspace with more useful and detailed information discovered, e.g., the strokes of characters or the texture information of clothes can be enhanced after LDFR. In addition, to make the learnt low-rank subspaces more powerful for feature recovery, we design a fusion strategy to obtain a generalized subspace, which averages over all learnt sub-spaces in each LDFR layer, so that the convolutional feature maps in test phase can be recovered effectively via low-rank embedding. Extensive results on several image datasets show that existing CNNs-based models equipped with our LDFR layer can obtain better performance.

关键词： Training Image recognition Convolution Conferences Image representation data mining Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

An improved LED junction temperature test method based on FCM 5

An improved LED junction temperature test method based on FC...

引用

2022 5th International Conference on Power Electronics and Control Engineering, ICPECE 2022

作者： Zhang, Jie Zhang, Junchao Xu, Bingshe Fu, Qiang Wang, Chen Zhao, Haodi College of Electrical and Power Engineering Taiyuan University of Technology Shanxi Taiyuan030024 China Key Laboratory of Interface Science and Engineering of New Materials. Ministry of Education Taiyuan University of Technology Shanxi Taiyuan030024 China Shanxi Electric Drive and Internet of Things Engineering Research Center Shanxi Taiyuan030024 China Intelligent City Lighting Data Sharing and Pub. Serv. Platform Eng. Res. Ctr. of 1331 Proj. in Shanxi Prov. Based on Big Data Shanxi Taiyuan030024 China Shanxi Intelligent Digital Lighting Union Laboratory Shanxi Taiyuan030024 China Shanxi Intelligent Digital Cultural Tourism Industry Research Institute Shanxi Taiyuan030024 China

High junction temperature (Tj) of LED will affect the performance and life of LED, so this paper proposes a high-power LED junction temperature test method. Mainly aiming at the problem that the accuracy of the forward current method (FCM) will be affected by the calibration procedure, equipment aging, and technical differences when batch testing LED Tj, this paper uses the relationship between relative current and Tj to calibrate the procedure based on this method, to limit the negative affection of sample heterogeneity. Ten LED samples were tested in the experiment, and the Tj errors under two test methods were obtained. The results showed that the improved method was more accurate than the traditional method. Then, the Tj error of the improved method was compared with that of the forward voltage method, and the error of the new method was smaller. © Published under licence by IOP Publishing Ltd.

关键词： Light emitting diodes

来源：评论

学校读者我要写书评

暂无评论

A hybrid approach using human posture and contour for gait recognition under body occlusion 7

A hybrid approach using human posture and contour for gait r...

引用

7th International Conference on Information science and Control Engineering, ICISCE 2020

作者： Ma, Yue Wei, Chenghao Long, Hao Key Laboratory of Impression Evidence Examination and Identification Technology Ministry of Public Security China Shenzhen University Big Data Institute College of Computer Science and Software Engineering China

ISBN: (纸本)9781728164069

This paper presents a hybrid method using a Gaitset network for gait recognition. We firstly use Alphpose model to obtain key points of human posture to improve the extraction of human contour under body occlusion. The human contour map is obtained by pose2Seg model with the key points of human posture. Both human posture and body contour are used as input of the network. By using the hybrid method, it can effectively improve the accuracy and robustness of gait recognition with the upper body information under occlusion. In scenes of human walking under body occlusion, our experimental results show that the accuracy of the proposed method in self-built database can reach 84.6% accuracy. © 2020 IEEE.

关键词： Gait analysis

来源：评论

学校读者我要写书评

暂无评论

Learning from multi-dimensional partial labels 29

Learning from multi-dimensional partial labels

引用

29th International Joint Conference on Artificial Intelligence, IJCAI 2020

作者： Wang, Haobo Liu, Weiwei Zhao, Yang Hu, Tianlei Chen, Ke Chen, Gang Key Lab of Intelligent Computing Based Big Data of Zhejiang Province Zhejiang University China College of Computer Science and Technology Zhejiang University China School of Computer Science Wuhan University China

ISBN: (纸本)9780999241165

Multi-dimensional classification (MDC) has attracted much attention from the community. Though most studies consider fully annotated data, in real practice obtaining fully labeled data in MDC tasks is usually intractable. In this paper, we propose a novel learning paradigm: MultiDimensional Partial Label Learning (MDPL) where the ground-truth labels of each instance are concealed in multiple candidate label sets. We first introduce the partial hamming loss for MDPL that incurs a large loss if the predicted labels are not in candidate label sets, and provide an empirical risk minimization (ERM) framework. Theoretically, we rigorously prove the conditions for ERM learnability of MDPL in both independent and dependent cases. Furthermore, we present two MDPL algorithms under our proposed ERM framework. Comprehensive experiments on both synthetic and real-world datasets validate the effectiveness of our proposals. © 2020 Inst. Sci. inf., Univ. Defence in Belgrade. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

MATNet: MRI Super-Resolution with Multiple Attention Mechanisms

MATNet: MRI Super-Resolution with Multiple Attention Mechani...

引用

New Trends in Computational Intelligence (NTCI), International Conference on

作者： Longfeng Shen Fenglan Qin Yingjie Zhang Qiong Wang Bowen Hu Wei Zhao Anhui Engineering Research Center for Intelligent Computing and Application on Cognitive Behavior (ICACB) Hefei Comprehensive National Science Center Institute of Artificial Integence Hefei China School of Computer Science and Technology Huaibei Normal University Huaibei Anhui China Anhui Big-Data Research Center on University Management Huaibei Anhui China People's Hospital of Huaibei City Anhui China

High-resolution magnetic resonance imaging (MRI) provides a clear anatomical structure for diagnosis, however, its high cost makes it unsuitable in practice. On the contrary, low-resolution MRI cannot provide fine structural information but is resource and time efficient. Super-resolution methods are high-resolution images obtained from low-resolution MRI. However, existing MRI super-resolution methods suffer from the following defects: (1) CNN-based super-resolution methods lack the ability to build relationships for remote features. (2) While models using transformer based on self-attentive mechanism have achieved a breakthrough in super-resolution, the single self-attentive model fails to fully utilize the useful features of the input image. Thus, in this paper, an MRI super-resolution network (MATNet) based on a multi-attention mechanism is proposed that can utilize multiple useful image features from different dimensions for image super-resolution, improved super-resolution performance. To enable the network to obtain noise-free image features before sampling the image, we also added a denoising module to it. Finally, we prove the effectiveness of the model using the IXI public dataset.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Hierarchical Local-Global Transformer for Temporal Sentence Grounding

arXiv

引用

arXiv 2022年

作者： Fang, Xiang Liu, Daizong Zhou, Pan Xu, Zichuan Li, Ruixuan Hubei Engineering Research Center on Big Data Security School of Cyber Science and Engineering Huazhong University of Science and Technology Wuhan430074 China Wangxuan Institute of Computer fTechnology Peking University No. 128 Zhongguancun North Street Beijing100080 China School of software Dalian University of Technology Dalian116024 China School of Computer Science and Technology Huazhong University of Science and Technology Wuhan430074 China

This paper studies the multimedia problem of temporal sentence grounding (TSG), which aims to accurately determine the specific video segment in an untrimmed video according to a given sentence query. Traditional TSG methods mainly follow the top-down or bottom-up framework and are not end-to-end. They severely rely on time-consuming post-processing to refine the grounding results. Recently, some transformer-based approaches are proposed to efficiently and effectively model the fine-grained semantic alignment between video and query. Although these methods achieve significant performance to some extent, they equally take frames of the video and words of the query as transformer input for correlating, failing to capture their different levels of granularity with distinct semantics. To address this issue, in this paper, we propose a novel Hierarchical Local-Global Transformer (HLGT) to leverage this hierarchy information and model the interactions between different levels of granularity and different modalities for learning more fine-grained multi-modal representations. Specifically, we first split the video and query into individual clips and phrases to learn their local context (adjacent dependency) and global correlation (long-range dependency) via a temporal transformer. Then, a global-local transformer is introduced to learn the interactions between the local-level and global-level semantics for better multimodal reasoning. Besides, we develop a new cross-modal cycle-consistency loss to enforce interaction between two modalities and encourage the semantic alignment between them. Finally, we design a brand-new cross-modal parallel transformer decoder to integrate the encoded visual and textual features for final grounding. Extensive experiments on three challenging datasets (ActivityNet Captions, Charades-STA and TACoS) show that our proposed HLGT achieves a new state-of-the-art performance, demonstrating its effectiveness and computational efficiency. Copyright ©

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

MLS3RDUH: Deep unsupervised hashing via manifold based local semantic similarity structure reconstructing 29

MLS3RDUH: Deep unsupervised hashing via manifold based local...

引用

29th International Joint Conference on Artificial Intelligence, IJCAI 2020

作者： Tu, Rong-Cheng Mao, Xian-Ling Wei, Wei Department of Computer Science and Technology Beijing Institute of Technology China CETC Big Data Research Institute Co. Ltd. Guiyang55002 China Zhijiang Lab Hangzhou China School of Computer Science Huazhong University of Science and Technology China

ISBN: (纸本)9780999241165

Most of the unsupervised hashing methods usually map images into semantic similarity-preserving hash codes by constructing local semantic similarity structure as guiding information, i.e., treating each point similar to its k nearest neighbours. However, for an image, some of its k nearest neighbours may be dissimilar to it, i.e., they are noisy datapoints which will damage the retrieval performance. Thus, to tackle this problem, in this paper, we propose a novel deep unsupervised hashing method, called MLS3RDUH, which can reduce the noisy datapoints to further enhance retrieval performance. Specifically, the proposed method first defines a novel similarity matrix by utilising the intrinsic manifold structure in feature space and the cosine similarity of datapoints to reconstruct the local semantic similarity structure. Then a novel log-cosh hashing loss function is used to optimize the hashing network to generate compact hash codes by incorporating the defined similarity as guiding information. Extensive experiments on three public datasets show that the proposed method outperforms the state-of-the-art baselines. © 2020 Inst. Sci. inf., Univ. Defence in Belgrade. All rights reserved.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：