检索结果-内蒙古大学图书馆

High-Dimensional Multi-Label data Stream Classification With Concept Drifting Detection

IEEE Transactions on knowledge and data engineering 2023年第8期35卷 8085-8099页

作者： Li, Peipei Zhang, Haixiang Hu, Xuegang Wu, Xindong Anhui Province Key Laboratory of Industry Safety and Emergency Technology Hefei230601 China Hefei University of Technology School of Computer Science and Information Engineering Hefei230601 China Second People's Hospital of Hefei Anhui Hefei230011 China Hefei University of Technology Key Laboratory of Knowledge Engineering with Big Data Ministry of Education of China School of Computer Science and Information Engineering Anhui Hefei230002 China

Multi-label data streams such as Web texts and images have been popular on the Web. These data present the characteristics of multiple label, high dimensionality, high volume, high velocity and especial concept drift etc. Thus, multi-label data stream classification is a very challenging and significant task especially in the handling of high-dimensional data with concept drifts. However, this challenge has received little attention from the research community. Therefore, we propose the max-relevance and min-redundancy based algorithm adaptation approach for the efficient and effective classification on multi-label data streams with high-dimensional attributes and concept drifts. In order to reduce the impact from the high-dimensional data with noisy attributes, we first refine the minimal-redundancy-maximal-relevance criterion based on mutual information to select qualified features in multi-label data streams. Second, we propose the data distribution based concept drifting detection approach to distinguish concept drifts hidden in data streams. Finally, we build an incremental ensemble classification model for efficiently classifying multi-label data streams. Extensive studies show that our approach can get optimal subsets of features while maintaining a good performance in the multi-label classification, as compared to several state-of-the-art multi-label feature selection algorithms using two efficient multi-label classification methods as base classifiers. Meanwhile, our approach is superior to three well-known multi-label data stream classification approaches in the effectiveness and efficiency. © 1989-2012 IEEE.

关键词： Classification (of information)

来源：评论

学校读者我要写书评

暂无评论

Partial Label Learning via Conditional-Label-Aware Disambiguation

引用

Journal of Computer Science & Technology 2021年第3期36卷 590-605页

作者： Peng Ni Su-Yun Zhao Zhi-Gang Dai Hong Chen Cui-Ping Li Key Laboratory of Data Engineering and Knowledge Engineering(Renmin University of China) Ministry of Education Beijing 100087China School of Information Renmin University of ChinaBeijing 100087China

Partial label learning is a weakly supervised learning framework in which each instance is associated with multiple candidate labels,among which only one is the ground-truth *** paper proposes a unified formulation that employs proper label constraints for training models while simultaneously performing *** existing partial label learning approaches that only leverage similarities in the feature space without utilizing label constraints,our pseudo-labeling process leverages similarities and differences in the feature space using the same candidate label constraints and then disambiguates noise *** experiments on artificial and real-world partial label datasets show that our approach significantly outperforms state-of-the-art counterparts on classification prediction.

关键词： disambiguation partial label learning similarity and dissimilarity weak supervision

来源：评论

学校读者我要写书评

暂无评论

Efficient Model Store and Reuse in an OLML database System

引用

Journal of Computer Science & Technology 2021年第4期36卷 792-805页

作者： Jian-Wei Cui Wei Lu Xin Zhao Xiao-Yong Du Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education Renmin University of China Beijing 100872China School of Information Renmin University of ChinaBeijing 100872China

Deep learning has shown significant improvements on various machine learning tasks by introducing a wide spectrum of neural network ***,for these neural network models,it is necessary to label a tremendous amount of training data,which is prohibitively expensive in *** this paper,we propose OnLine Machine Learning(OLML)database which stores trained models and reuses these models in a new training task to achieve a better training effect with a small amount of training *** efficient model reuse algorithm AdaReuse is developed in the OLML ***,AdaReuse firstly estimates the reuse potential of trained models from domain relatedness and model quality,through which a group of trained models with high reuse potential for the training task could be selected ***,multi selected models will be trained iteratively to encourage diverse models,with which a better training effect could be achieved by *** evaluate AdaReuse on two types of natural language processing(NLP)tasks,and the results show AdaReuse could improve the training effect significantly compared with models training from scratch when the training data is *** on AdaReuse,we implement an OLML database prototype system which could accept a training task as an SQL-like query and automatically generate a training plan by selecting and reusing trained *** studies are conducted to illustrate the OLML database could properly store the trained models,and reuse the trained models efficiently in new training tasks.

关键词： model selection model reuse OnLine Machine Learning(OLML)database

来源：评论

学校读者我要写书评

暂无评论

Task Offloading Optimization Based on Position Prediction in Pedestrian- Robot Mixed Traffic Flows 8

Task Offloading Optimization Based on Position Prediction in...

引用

8th IEEE International Conference on Computer and Communications, ICCC 2022

作者： Zheng, Dawen Wang, Lusheng Kai, Caihong Peng, Min Key Laboratory of Knowledge Engineering with Big Data Ministry of Education School of Computer Science and Information Engineering Hefei University of Technology Hefei China

ISBN: (纸本)9781665450515

Service robots play an increasingly important role in people's daily life. The density of pedestrians is large and the movement is irregular in pedestrian-robot mixed traffic flows. Robots are prone to collision with pedestrians, and the tasks to be offloaded are closely related to pedestrians. How to analyze the tasks of robots and select the appropriate roadside unit is an important issue. In this paper, the social force model is used to predict the positions of pedestrians and robots, taking into account the influence of various forces to avoid collisions. A task offloading resource optimization algorithm with position prediction is proposed. According to the predicted information, the size and position distribution of all tasks in the scenario are obtained, and then the neural network trained beforehand based on deep Q-Iearning is used to generate a task offloading strategy. The simulation results show that the running time of the proposed algorithm is very short, and the resource allocation required for task offloading is completed in advance based on the predicted information before robots arriving the corresponding positions. Besides, the algorithm significantly reduces the task offloading delay. © 2022 IEEE.

关键词： Edge computing

来源：评论

学校读者我要写书评

暂无评论

CITE: Compact Interactive TransformEr for Multilingual Image Captioning 23

CITE: Compact Interactive TransformEr for Multilingual Image...

引用

6th International Conference on Image and Graphics Processing, ICIGP 2023

作者： Xu, Yueyuan Hu, Zhenzhen Zhou, Yuanen Hao, Shijie Hong, Richang Key Laboratory of Knowledge Engineering with Big Data School of Computer and Information Hefei University of Technology China Institute of Artificial Intelligence Hefei Comprehensive National Science Center Hefei China

ISBN: (纸本)9781450398572

Current state-of-the-art image captioning models generate captions in a single language, requiring a combination of multiple language specific models to build a multilingual image captioning system. However, as the number of supported languages increases, it leads to the parameters of the multilingual image captioning system grow linearly. To tackle this issue, we propose a single Compact Interactive TransformEr (CITE) model, which can describe an image in multiple languages simultaneously, making the captioning system more compact. Specifically, based on the standard Transformer model, we share the encoder and decoder backbone parameters and replace the self-attention sub-layer in the decoder with the interactive attention sub-layer. In addition, we extend the traditional monolingual reinforcement learning mechanism to a multilingual version to promote better description generation. Due to the wide use of Chinese and English, we evaluate the performance of our CITE model by simultaneously generating English and Chinese captions. We expand the image captions of the whole MSCOCO dataset and release a COCO-EN-CN dataset. Extensive experiments on the COCO-EN-CN dataset show that our single CITE model with more parameter-efficient can maintain the competitive performance or even better than the monolingual captioning models. © 2023 ACM.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Evaluating the Impact of dataset Size on Univariate Prediction Techniques for Moroccan Agriculture

Evaluating the Impact of Dataset Size on Univariate Predicti...

引用

International Conference on Artificial Intelligence and Smart Environment, ICAISE 2022

作者： Ed-daoudi, Rachid Alaoui, Altaf Zerouaoui, Jad Ettaki, Badia Zerouaoui, Jamal Laboratory of Engineering Sciences and Modeling Faculty of Sciences Ibn Tofail University Campus Universitaire BP 133 Kenitra Morocco LyRICA: Laboratory of Research in Computer Science Data Sciences and Knowledge Engineering School of Information Sciences Rabat Rabat Morocco

ISBN: (纸本)9783031262531

Learning models used for prediction are mostly developed without taking into account the size of datasets that can produce models of high accuracy and better performance. Although, the general believe is that, large dataset is needed to construct a predictive learning model. To describe a data set as large in size depends on the circumstances and context of prediction. This means that what makes a dataset to be considered as being big or small is controversial. In this paper, the ability of the predictive model to adapt to a particular size of data in training is examined. The study experiments on three different sizes of Moroccan agricultural data using a variety of statistical and Machine Learning techniques, to create predictive models with a view to establishing if the size of data has any effect on the accuracy of a model. The output of each model is measured using the Mean Absolute Error (MAE) and r-squared, and comparisons are made. The results of training the models through the three partitioned dataset show that, the models trained with the smallest and largest size of training data appear to be less accurate, while the models trained with a medium sized dataset delivers a much better results. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

KAN v.s. MLP for Offline Reinforcement Learning

KAN v.s. MLP for Offline Reinforcement Learning

引用

2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025

作者： Guo, Haihong Li, Fengxin Li, Jiao Liu, Hongyan School of Information Renmin University of China China Institute of Medical Information Medical Library Chinese Academy of Medical Sciences Peking Union Medical College China Key Laboratory of Data Engineering and Knowledge Engineering Ministry of Education China School of Economics and Management Tsinghua University China

ISBN: (纸本)9798350368741

Kolmogorov-Arnold Networks (KAN) is an emerging neural network architecture in machine learning. It has greatly interested the research community about whether KAN can be a promising alternative to the commonly used Multi-Layer Perceptions (MLP). Experiments in various fields demonstrated that KAN-based machine learning can achieve comparable if not better performance than MLP-based methods, but with much smaller parameter scales and are more explainable. In this paper, we explore the incorporation of KAN into the actor and critic networks for offline reinforcement learning (RL). We evaluated the performance, parameter scales, and training efficiency of various KAN and MLP-based conservative Q-learning (CQL) on the classical D4RL benchmark for offline RL. Our study demonstrates that KAN can achieve performance close to the commonly used MLP with significantly fewer parameters. This allows us to choose the base networks according to the offline RL task requirements. © 2025 IEEE.

关键词： KAN Kolmogorov-Arnold networks MLP multilayer perceptrons offline reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

RGB-D SLAM Method Based on Feature Association in Dynamic Environment 6

RGB-D SLAM Method Based on Feature Association in Dynamic En...

引用

6th Asian Conference on Artificial Intelligence Technology, ACAIT 2022

作者： Fang, Baofu Wang, Hao Wang, Hao Hefei University of Technology Key Laboratory of Knowledge Engineering with Big Data Hefei University of Technology School of Computer Science and Information Engineering Hefei China

ISBN: (纸本)9781665453110

Simultaneous localization and mapping (SLAM) is one of the current research hotspots. However, in visual SLAM for dynamic environments, inaccurate detection of object motion states and incomplete dynamic region culling will lead to large localization errors. To address these issues, this paper proposes an RGB-D SLAM method based on feature association. The method has strongly correlated features in time and space according to the input image sequence. Using the moving probability of the feature points in the previous frame, the movement of the feature points in the current frame is calculated in combination with the dynamic corner points screened in the current frame. Then, the motion state of the object is determined according to the proportion of different feature points. Then combined with semantic information and object depth information, the fast search method is used to obtain accurate dynamic regions. Finally, the selected effective feature points are used to estimate the camera pose and establish a static map of the environment. This paper evaluates the robustness and accuracy of our method on the TUM dataset and real environment, and the results show that our method can significantly improve the system tracking effect and reduce the system tracking error compared with other SLAM methods in dynamic environments. © 2022 IEEE.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Semantic SLAM Based on Compensated Segmentation and Geometric Constraints in Dynamic Environments 6

Semantic SLAM Based on Compensated Segmentation and Geometri...

引用

6th Asian Conference on Artificial Intelligence Technology, ACAIT 2022

作者： Fang, Baofu Zhou, Shuai Wang, Hao Hefei University of Technology Key Laboratory of Knowledge Engineering with Big Data Hefei University of Technology School of Computer Science and Information Engineering Hefei China

ISBN: (纸本)9781665453110

Most of the existing slam algorithms are designed based on the assumption of a static environment, this strong assumption limits the practical application of most slam systems. The main reason is that moving objects will cause feature mismatch in the pose estimation process, which in turn affects the accuracy of localization and mapping. In this paper, we propose a SLAM algorithm in a dynamic environment. First, we use the BlendMask network to detect potential moving objects to generate masks for dynamic objects. The geometrically constrained joint optical flow method is used to detect dynamic feature points. Secondly, aiming at the failure of semantic segmentation network segmentation, a missed detection compensation algorithm based on the invariance of adjacent frame speed is proposed. Finally, a keyframe selection strategy is proposed to construct a semantic octree graph containing only static objects. We evaluate our algorithm on TUM RGB-D and real scene datasets. The experimental results show that the algorithm has high accuracy and real-time performance. © 2022 IEEE.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

HUSS:A Heuristic Method for Understanding the Semantic Structure of Spreadsheets

引用

data Intelligence 2023年第3期5卷 537-559页

作者： Xindong Wu Hao Chen Chenyang Bu Shengwei Ji Zan Zhang Victor S.Sheng Key Laboratory of Knowledge Engineering with Big Data(the Ministry of Education of China) Hefei University of TechnologyChinaSchool of Computer Science and Information EngineeringHefei University of TechnologyHefeiChina Research Institute of Artificial Intelligence Zhejiang LabHangzhouChina Department of Computer Science Texas Tech UniversityLubbockTX 79409USA

Spreadsheets contain a lot of valuable data and have many practical *** key technology of these practical applications is how to make machines understand the semantic structure of spreadsheets,e.g.,identifying cell function types and discovering relationships between cell *** existing methods for understanding the semantic structure of spreadsheets do not make use of the semantic information of cells.A few studies do,but they ignore the layout structure information of spreadsheets,which affects the performance of cell function classification and the discovery of different relationship types of cell *** this paper,we propose a Heuristic algorithm for Understanding the Semantic Structure of spreadsheets(HUSS).Specifically,for improving the cell function classification,we propose an error correction mechanism(ECM)based on an existing cell function classification model[11]and the layout features of *** improving the table structure analysis,we propose five types of heuristic rules to extract four different types of cell pairs,based on the cell style and spatial location *** experimental results on five real-world datasets demonstrate that HUSS can effectively understand the semantic structure of spreadsheets and outperforms corresponding baselines.

关键词： Spreadsheet semantic structure information extraction Heuristics Cell function analysis Table structure analysis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：