检索结果-内蒙古大学图书馆

IEEE International Conference on data Mining (ICDM)

作者： Yi He Xu Yuan Nian-Feng Tzeng Xindong Wu Center for Advanced Computer Studies University of Louisiana Lafayette Key Laboratory of Knowledge Engineering with Big Data Hefei University of Technology Mininglamp Academy of Sciences Mininglamp Technology

ISBN: (数字)9781728183169

ISBN: (纸本)9781728183176

Predictive modeling of networked data finds many real-world applications, such as fraud detection in social networks, drug discovery in biomedical networks, paper topic classification in citation networks, and so forth. Although the advanced machine learning approaches can help build reasonably accurate predictive models, their applicability is immensely hindered by the data labeling tasks, which are onerous, time-consuming, and error-prone. In this paper, we propose a novel active learning paradigm for networked data, named topology-and-content-aware (TACA) active learning, aiming to minimize the number of labels while achieving a desirable level of model accuracy. Overall, TACA advances existing works from two aspects: (1) TACA makes no assumption on the network property, whereas most existing works only perform effectively on a locally consistent network in which linked nodes are expected to share the same labels and (2) TACA generates queries without relying on model performance, thereby enjoying robust predictive results even when noises exist in the queried labels. Both theoretical and empirical evidences are presented, substantiating the effectiveness of and optimism our approach.

关键词： Social networking (online) Neural networks Predictive models data models Labeling Noise measurement Task analysis

来源：评论

学校读者我要写书评

暂无评论

Who knows more? The role of structural hole spanners in accurate information identification on social media

引用

Pacific Basin Finance Journal 2025年 93卷

作者： Guo, Man Long, Wen Zhong, Yanqiang Zhang, Wei School of Economics and Management University of Chinese Academy of Sciences Beijing 100190 China Research Center on Fictitious Economy & Data Science Chinese Academy of Sciences Beijing 100190 China Key Laboratory of Big Data Mining & Knowledge Management Chinese Academy of Sciences Beijing 100190 China Department of Information Systems and Management Engineering Southern University of Science and Technology Shenzhen 518055 China

Based on structural hole theory, this study explores the differences in the ability of Chinese social media users with different characteristics to provide accurate information. Using over 20 million interactive data from 2.19 million users, we construct a social network and identify structural hole spanners and ordinary users. As key players in a network, structural hole spanners are typically located at the intersections of different groups or information sources. They differ from ordinary users in the way they gather and disseminate information. The empirical results indicate that the accuracy of market judgements by structural hole spanners is at least 12.38 % higher than that of ordinary users. Strategy simulations show that using information from structural hole spanners to make investment decisions can achieve a 116 % cumulative return during the backtesting period, nearly four times that of ordinary users. This information advantage occurs primarily because opinion distance and information uniqueness play significant roles in improving information accuracy. This study not only helps deepen understanding of information dissemination characteristics among heterogeneous users on social media but also provides empirical support for the application of structural hole theory in finance, with important theoretical and practical implications. © 2024

关键词： Accurate information Social media Stock market Structural hole spanners

来源：评论

学校读者我要写书评

暂无评论

Balanced Tree Partitioning with Succinct Logic

Balanced Tree Partitioning with Succinct Logic

引用

IEEE International Conference on Big knowledge (ICBK)

作者： Xindong Wu Shaojing Sheng Peng Zhou Key Laboratory of Knowledge Engineering with Big Data (Heifei Unversity of Technology) Ministry of Education Heifei China School of Compuster Science and Technology Anhui University Heifei China

ISBN: (数字)9781728181561

ISBN: (纸本)9781728181578

As a widely used data structure, graphs are good at characterizing data with internal associations, such as social and biological data. Tree structured data are special and are widely used in many real-world applications, such as organizational structure analysis and genealogical knowledge graph reasoning. For example, in kinship knowledge graph analysis, when a genealogical tree is particularly large (more than 25 levels and 45,000 nodes), it is a great challenge to partition this large tree into a specified number of subtrees with succinct logic and a balanced number of nodes. Therefore, in this paper, we propose the TPA (tree partitioning algorithm) algorithm to achieve a balanced and succinct logic partition of large-scale tree structured data. TPA first extracts all related nodes from a massive graph database and then constructs the convergent subgraph into a complete tree with a specified root node. Specifically, several virtual nodes are supplemented for generation-skipping connected nodes to achieve correct node numbering and partitioning. Finally, a graph partitioning algorithm is executed on the complete tree to obtain a specified number of subtrees with succinct logic and balanced node scales. Experiments conducted on four real-world datasets verify the effectiveness of our TPA algorithm.

关键词： Partitioning algorithms data visualization Convergence data analysis Distributed databases Big data

来源：评论

学校读者我要写书评

暂无评论

An efficient semismooth Newton method for adaptive sparse signal recovery problems

arXiv

引用

arXiv 2021年

作者： Ding, Yanyun Zhang, Haibin Li, Peili Xiao, Yunhai Department of Operations Research and Information Engineering Beijing University of Technology Beijing100124 China School of Statistics Key Laboratory of Advanced Theory and Application in Statistics and Data Science-MOE East China Normal University Shanghai200062 China School of Mathematics and Statistics Henan University Kaifeng475000 China

We know that compressive sensing can establish stable sparse recovery results from highly undersampled data under a restricted isometry property condition. In reality, however, numerous problems are coherent, and vast majority conventional methods might work not so well. Recently, it was shown that using the difference between l1- and l2-norm as a regularization always has superior performance. In this paper, we propose an adaptive lp-l1−2 model where the lp-norm with p ≥ 1 measures the data fidelity and the l1−2-term measures the sparsity. This proposed model has the ability to deal with different types of noises and extract the sparse property even under high coherent condition. We use a proximal majorization-minimization technique to handle the nonconvex regularization term and then employ a semismooth Newton method to solve the corresponding convex relaxation subproblem. We prove that the sequence generated by the semismooth Newton method admits fast local convergence rate to the subproblem under some technical assumptions. Finally, we do some numerical experiments to demonstrate the superiority of the proposed model and the progressiveness of the proposed algorithm. Copyright © 2021, The Authors. All rights reserved.

关键词： Compressed sensing

来源：评论

学校读者我要写书评

暂无评论

XCrossNet: Feature structure-oriented learning for click-through rate prediction

arXiv

引用

arXiv 2021年

作者： Yu, Runlong Ye, Yuyang Liu, Qi Wang, Zihan Yang, Chunfeng Hu, Yucheng Chen, Enhong Anhui Province Key Laboratory of Big Data Analysis and Application School of Computer Science and Technology University of Science and Technology of China Hefei China Management Science and Information Systems Rutgers Business School Rutgers University Newark United States MOE Key Laboratory of Computational Linguistics School of Electronics Engineering and Computer Science Peking University Beijing China Tencent Inc Shenzhen China

Click-Through Rate (CTR) prediction is a core task in nowadays commercial recommender systems. Feature crossing, as the mainline of research on CTR prediction, has shown a promising way to enhance predictive performance. Even though various models are able to learn feature interactions without manual feature engineering, they rarely attempt to individually learn representations for different feature structures. In particular, they mainly focus on the modeling of cross sparse features but neglect to specifically represent cross dense features. Motivated by this, we propose a novel Extreme Cross Network, abbreviated XCrossNet, which aims at learning dense and sparse feature interactions in an explicit manner. XCrossNet as a feature structure-oriented model leads to a more expressive representation and a more precise CTR prediction, which is not only explicit and interpretable, but also time-efficient and easy to implement. Experimental studies on Criteo Kaggle dataset show significant improvement of XCrossNet over state-of-the-art models on both effectiveness and efficiency. © 2021, CC BY.

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

Dictionary Pair-based data-Free Fast Deep Neural Network Compression

Dictionary Pair-based Data-Free Fast Deep Neural Network Com...

引用

IEEE International Conference on data Mining (ICDM)

作者： Yangcheng Gao Zhao Zhang Haijun Zhang Mingbo Zhao Yi Yang Meng Wang School of Computer Science and Information Engineering Hefei University of Technology Hefei China Key Laboratory of Knowledge Engineering with Big Data (Ministry of Education) & Intelligent Interconnected Systems Laboratory of Anhui Province Hefei University of Technology Hefei China Harbin Institute of Technology (Shenzhen) Xili University Town Shenzhen China City University of Hong Kong Hong Kong SAR Centre for Artificial Intelligence University of Technology Sydney Sydney NSW Australia

ISBN: (纸本)9781665423991

Deep neural network (DNN) compression can reduce the memory footprint of deep networks effectively, so that the deep model can be deployed on the portable devices. However, most of the existing model compression methods cost lots of time, e.g., vector quantization or pruning, which makes them inept to the real-world applications that need fast online computation. In this paper, we therefore explore how to accelerate the model compression process by reducing the computation cost. Then, we propose a new deep model compression method, termed Dictionary Pair-based data-Free Fast DNN Compression, which aims at reducing the memory consumption of DNNs without extra training and can greatly improve the compression efficiency. Specifically, our proposed method performs tensor decomposition on the DNN model with a fast dictionary pair learning-based reconstruction approach, which can be deployed on different layers (e.g., convolution and fully-connection layers). Given a pre-trained DNN model, we first divide the parameters (i.e., weights) of each layer into a series of partitions for dictionary pair-based fast reconstruction, which can potentially discover more fine-grained information and provide the possibility for parallel model compression. Then, dictionaries of less memory occupation are learned to reconstruct the weights. Extensive experiments on popular DNNs (i.e., VGG-16, ResNet-18 and ResNet-50) showed that our proposed weight compression method can significantly reduce the memory footprint and speed up the compression process, with less performance loss.

关键词： Deep learning Training Performance evaluation Dictionaries Tensors Costs Computational modeling

来源：评论

学校读者我要写书评

暂无评论

Corrigendum to “Do we measure novelty when we analyze unusual combinations of cited references? A validation study of bibliometric novelty indicators based on F1000Prime data” [Journal of Informetrics 13/4 (2019) 100979]

引用

Journal of Informetrics 2024年第3期18卷

作者： Lutz Bornmann Alexander Tekles Helena H. Zhang Fred Y. Ye Science Policy and Strategy Department Administrative Headquarters of the Max Planck Society Hofgartenstr. 8 80539 Munich Germany University of Passau Innstr. 41 94032 Passau Germany Jiangsu Key Laboratory of Data Engineering and Knowledge Service School of Information Management Nanjing University Nanjing 210023 China

来源：评论

学校读者我要写书评

暂无评论

A majorized-generalized alternating direction method of multipliers for convex composite programming

arXiv

引用

arXiv 2021年

作者： Qin, Congying Xiao, Yunhai Li, Peili School of Mathematics and Statistics Henan University Kaifeng475000 China Henan Engineering Research Center for Artificial Intelligence Theory and Algorithms Henan University Kaifeng475000 China School of Statistics Key Laboratory of Advanced Theory and Application in Statistics and Data Science-MOE East China Normal University Shanghai200062 China

The linearly constrained convex composite programming problems whose objective function contains two blocks with each block being the form of nonsmooth+smooth arises frequently in multiple fields of applications. If both of the smooth terms are quadratic, this problem can be solved efficiently by using the symmetric Gaussian-Seidel (sGS) technique based proximal alternating direction method of multipliers (ADMM). However, in the non-quadratic case, the sGS technique can not be used any more, which leads to the separable structure of nonsmooth+smooth had to be ignored. In this paper, we present a generalized ADMM and particularly use a majorization technique to make the corresponding subproblems more amenable to efficient computations. Under some appropriate conditions, we prove its global convergence for the relaxation factor in (0, 2). We apply the algorithm to solve a kind of simulated convex composite optimization problems and a type of sparse inverse covariance matrix estimation problems which illustrates that the effectiveness of the algorithm are obvious. Copyright © 2021, The Authors. All rights reserved.

关键词： Covariance matrix

来源：评论

学校读者我要写书评

暂无评论

Deep Reinforcement Learning with Transformers for Text Adventure Games

Deep Reinforcement Learning with Transformers for Text Adven...

引用

IEEE Symposium on Computational Intelligence and Games, CIG

作者： Yunqiu Xu Ling Chen Meng Fang Yang Wang Chengqi Zhang Centre for Artificial Intelligence University of Technology Sydney Sydney Australia Tencent Robotics X Key Laboratory of Knowledge Engineering with Big Data (Ministry of Education) Hefei University of Technology China

ISBN: (数字)9781728145334

ISBN: (纸本)9781728145341

In this paper, we study transformers for text-based games. As a promising replacement of recurrent modules in Natural Language Processing (NLP) tasks, the transformer architecture could be treated as a powerful state representation generator for reinforcement learning. However, the vanilla transformer is neither effective nor efficient to learn with a huge amount of weight parameters. Unlike existing research that encodes states using LSTMs or GRUs, we develop a novel lightweight transformer-based representation generator featured with reordered layer normalization, weight sharing and block-wise aggregation. The experimental results show that our proposed model not only solves single games with much fewer interactions, but also achieves better generalization on a set of unseen games. Furthermore, our model outperforms state-of-the-art agents in a variety of man-made games.

关键词： Games Generators Task analysis Reinforcement learning Logic gates Buildings Computer architecture

来源：评论

学校读者我要写书评

暂无评论

Cloud detection from visual band of satellite image based on variance of fractal dimension

引用

Journal of Systems engineering and Electronics 2019年第3期30卷 485-491页

作者： TIAN Pingfang GUANG Qiang LIU Xing School of Computer Science and Technology Wuhan University of Science and Technology Wuhan 430065 China Key Laboratory of Intelligent Information Processing and Real-Time Industrial System in Hubei Province Wuhan 430065 China Institute of Big Data Science and Engineering Wuhan University of Science and Technology Wuhan 430065 China Key Laboratory of Rich-Media Knowledge Organization and Service of Digital Publishing Content National Press and Publication Administration Beijing 100038 China

Cover ratio of cloud is a very important factor which affects the quality of a satellite image, therefore cloud detection from satellite images is a necessary step in assessing the image quality. The study on cloud detection from the visual band of a satellite image is developed. Firstly, we consider the differences between the cloud and ground including high grey level, good continuity of grey level, area of cloud region, and the variance of local fractal dimension (VLFD) of the cloud region. A single cloud region detection method is proposed. Secondly, by introducing a reference satellite image and by comparing the variance in the dimensions corresponding to the reference and the tested images, a method that detects multiple cloud regions and determines whether or not the cloud exists in an image is described. By using several Ikonos images, the performance of the proposed method is demonstrated.

关键词： cloud detection visual image satellite image variance of local fractal dimension (VLFD)

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：