检索结果-内蒙古大学图书馆

Increasing Interpretation of web topic detection via Prototype Learning From Sparse Poisson Deconvolution

IEEE TRANSACTIONS ON CYBERNETICS 2019年第3期49卷 1072-1083页

作者： Pang, Junbiao Hu, Anjing Huang, Qingming Tian, Qi Yin, Baocai Beijing Univ Technol Fac Informat Technol Beijing Key Lab Multimedia & Intelligent Software Beijing 100124 Peoples R China Univ Chinese Acad Sci Chinese Acad Sci Beijing 100049 Peoples R China Chinese Acad Sci Inst Comp Technol Beijing 100190 Peoples R China Univ Texas San Antonio Dept Comp Sci San Antonio TX 78249 USA Dalian Univ Technol Adv Invocat Ctr Future Internet Technol Dalian 116024 Peoples R China

Organizing webpages into interesting topics is one of the key steps to understand the trends from multimodal web data. The sparse, noisy, and less-constrained user-generated content results in inefficient feature representations. These descriptors unavoidably cause that a detected topic still contains a certain number of the false detected webpages, which further make a topic be less coherent, less interpretable, and less useful. In this paper, we address this problem from a viewpoint interpreting a topic by its prototypes, and present a two-step approach to achieve this goal. Following the detection-by-ranking approach, a sparse Poisson deconvolution is proposed to learn the intratopic similarities between webpages. To find the prototypes, leveraging the intratopic similarities, top-k diverse yet representative prototype webpages are identified from a submodularity function. Experimental results not only show the improved accuracies for the web topic detection task, but also increase the interpretation of a topic by its prototypes on two public datasets.

关键词： Poisson deconvolution prototype learning (PL) sparsity submodularity topic interpretation web topic detection

来源：评论

学校读者我要写书评

暂无评论

Robust Latent Poisson Deconvolution From Multiple Features for web topic detection

引用

IEEE TRANSACTIONS ON MULTIMEDIA 2016年第12期18卷 2482-2493页

作者： Pang, Junbiao Tao, Fei Zhang, Chunjie Zhang, Weigang Huang, Qingming Yin, Baocai Beijing Univ Technol Coll Metropolitan Transportat Beijing Key Lab Multimedia & Intelligent Software Beijing 100124 Peoples R China Univ Chinese Acad Sci Sch Comp & Control Engn Beijing 100049 Peoples R China Harbin Inst Technol Sch Comp Sci & Technol Weihai 264209 Peoples R China Univ Chinese Acad Sci Chinese Acad Sci Beijing 100049 Peoples R China Chinese Acad Sci Inst Comp Technol Beijing 100190 Peoples R China Dalian Univ Technol Adv Invocat Ctr Future Internet Technol Dalian 116024 Peoples R China Beijing Univ Technol Beijing 100124 Peoples R China

Detecting "hot" topics from the enormous user-generated content (UGC) data on web poses two main difficulties that the conventional approaches can barely handle: 1) poor feature representations from noisy images or short texts, and 2) uncertain roles of modalities where the visual content is either highly or weakly relevant to the textual cues due to the less-constrained UGC. In this paper, following the detection-by-ranking approach, we address above challenges by learning a robust latent representation from multiple, noisy and a high probability of the complementary features. Both the textual features and the visual ones are encoded into a k-nearest neighbor hybrid similarity graph (HSG), where nonnegative matrix factorization using random walk is introduced to generate topic candidates. An efficient fusion of multiple HSGs is then done by a latent poisson deconvolution, which consists of a poisson deconvolution with sparse basis similarity for each edge. Experiments show significantly improved accuracy of the proposed approach in comparison with the state-of-the-art methods on two public datasets.

关键词： K-nearest neighbor similarity graph latent poisson deconvolution (LPD) multi-view learning (MVL) user-generated content (UGC) web topic detection

来源：评论

学校读者我要写书评

暂无评论

Towards scalable topic detection on web via simulating Lévy walks nature of topics in similarity space

引用

INFORMATION SCIENCES 2025年 690卷

作者： Pang, Junbiao Huang, Qingming Beijing Univ Technol Fac Informat Technol 100 Pingleyuan Rd Beijing 100124 Peoples R China Univ Chinese Acad Sci Sch Comp & Control Engn 19 Yuquan Rd Beijing 100049 Peoples R China

Organizing a few webpages from social media into hot topics is one of the key steps to understand trends on web. Discovering popular yet hot topics from web faces a sea of noise webpages which never evolve into popular topics. In this paper, we discover that the similarity values between webpages in a popular topic contain the statistically similar features observed in L & eacute;vy walks. Consequently, we present a simple, novel, yet very powerful Explore-Exploit (EE) approach to group topics by simulating L & eacute;vy walks nature in the similarity space. The proposed EE-based topic clustering is an effective and efficient method which is a solid move towards handling a sea of noise webpages. Experiments on two public data sets demonstrate that our approach is not only comparable to the State-Of-The-Art (SOTA) methods in terms of effectiveness but also significantly outperforms the SOTA methods in terms of efficiency.

关键词： User-generated content web topic detection L & eacute vy walks Explore-exploit Noise robust clustering

来源：评论

学校读者我要写书评

暂无评论

Unsupervised web topic detection Using A Ranked Clustering-Like Pattern Across Similarity Cascades

引用

IEEE TRANSACTIONS ON MULTIMEDIA 2015年第6期17卷 843-853页

作者： Pang, Junbiao Jia, Fei Zhang, Chunjie Zhang, Weigang Huang, Qingming Yin, Baocai Beijing Univ Technol Coll Metropolitan Transportat Beijing Key Lab Multimedia Beijing 100124 Peoples R China Beijing Univ Technol Coll Metropolitan Transportat Intelligent Software Technol Beijing 100124 Peoples R China Univ Chinese Acad Sci Sch Comp & Control Engn Beijing 100049 Peoples R China Harbin Inst Technol Weihai Sch Comp Sci & Technol Weihai 264209 Peoples R China Chinese Acad Sci Univ Chinese Acad Sci Beijing 100049 Peoples R China Chinese Acad Sci Inst Comp Technol Beijing 100190 Peoples R China

Despite the massive growth of social media on the Internet, the process of organizing, understanding, and monitoring user generated content (UGC) has become one of the most pressing problems in today's society. Discovering topics on the web from a huge volume of UGC is one of the promising approaches to achieve this goal. Compared with classical topic detection and tracking in news articles, identifying topics on the web is by no means easy due to the noisy, sparse, and less-constrained data on the Internet. In this paper, we investigate methods from the perspective of similarity diffusion, and propose a clustering-like pattern across similarity cascades (SCs). SCs are a series of subgraphs generated by truncating a similarity graph with a set of thresholds, and then maximal cliques are used to capture topics. Finally, a topic-restricted similarity diffusion process is proposed to efficiently identify real topics from a large number of candidates. Experiments demonstrate that our approach outperforms the state-of-the-art methods on three public data sets.

关键词： Maximal clique Poisson deconvolution similarity cascade (SC) unsupervised ranking web topic detection

来源：评论

学校读者我要写书评

暂无评论

web topic detection USING A RANKED CLUSTERING-LIKE PATTERN ACROSS SIMILARITY CASCADES

WEB TOPIC DETECTION USING A RANKED CLUSTERING-LIKE PATTERN A...

引用

IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

作者： Jia, Fei Pang, Junbiao Zhang, Weigang Li, Guorong Zhang, Chunjie Huang, Qingming Liu, Yugui Univ Chinese Acad Sci Sch Comp & Control Engn Beijing Peoples R China Beijing Univ Technol Coll Metropolitan Transportat Beijing Key Lab Multimedia & Intelligent Software Beijing Peoples R China Chinese Acad Sci Inst Comp Tech Key Lab Intell Info Proc Beijing 100864 Peoples R China Harbin Inst Technol Sch Comp Sci & Technol Harbin Peoples R China

ISBN: (纸本)9781479947614

In multi-media and social media communities, web topic detection poses two main difficulties that conventional approaches can barely handle: 1) there are large inter-topic variations among web topics;2) supervised information is rare to identify the real topics. In this paper, we address these problems from the similarity diffusion perspective among objects on web, and present a clustering-like pattern across similarity cascades (SCs). SCs are a series of subgraphs generated by truncating a weighted graph with a set of thresholds, and then maximal cliques are used to describe the topic candidates. Poisson deconvolution is adopted to efficiently identify the real topics from these topic candidates. Experiments demonstrate that our approach outperforms the state-of-the-arts on two datasets. In addition, we report accuracy v.s. false positives per topic (FPPT) curves for performance evaluation. To our knowledge, this is the first complete evaluation of web topic detection at the topic-wise level, and it establishes a new benchmark for this problem.

关键词： web topic detection maximal cliques unsupervised ranking Poisson process similarity cascade

来源：评论

学校读者我要写书评

暂无评论

web topic detection USING A RANKED CLUSTERING-LIKE PATTERN ACROSS SIMILARITY CASCADES

WEB TOPIC DETECTION USING A RANKED CLUSTERING-LIKE PATTERN A...

引用

IEEE International Conference on Multimedia and Expo

作者： Fei Jia Junbiao Pang Weigang Zhang Guorong Li Chunjie Zhang Qingming Huang Yugui Liu School of Computer and Control Engineering University of Chinese Academy of Sciences Beijing Key Laboratory of Multimedia and Intelligent Software Technology College of Metropolitan Transportation Beijing University of Technology School of Computer Science and Technology Harbin Institute of Technology

ISBN: (纸本)9781479947607

关键词： web topic detection Maximal cliques Unsupervised ranking Poisson process Similarity cascade

来源：评论

学校读者我要写书评

暂无评论

Accelerating topic detection on web for a Large-Scale Data Set via Stochastic Poisson Deconvolution 25th

Accelerating Topic Detection on Web for a Large-Scale Data S...

引用

25th International Conference on MultiMedia Modeling (MMM)

作者： Lin, Jinzhong Pang, Junbiao Su, Li Liu, Yugui Huang, Qingming Univ Chinese Acad Sci Sch Comp & Control Engn Beijing Peoples R China Beijing Univ Technol Fac Informat Technol Beijing Peoples R China Chinese Acad Sci Inst Comp Technol Beijing Peoples R China

ISBN: (纸本)9783030057107;9783030057091

Organizing webpages into hot topics is one of the key steps to understand the trends from multi-modal web data. To handle this pressing problem, Poisson Deconvolution (PD), a state-of-the-art method, recently is proposed to rank the interestingness of web topics on a similarity graph. Nevertheless, in terms of scalability, PD optimized by expectation-maximization is not sufficiently efficient for a large-scale data set. In this paper, we develop a Stochastic Poisson Deconvolution (SPD) to deal with the large-scale web data sets. Experiments demonstrate the efficacy of the proposed approach in comparison with the state-of-the-art methods on two public data sets and one large-scale synthetic data set.

关键词： Large-scale Poisson Deconvolution Unsupervised ranking web topic detection Surrogate function

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：