检索结果-内蒙古大学图书馆

Automatic Text Summarization Method Based on Improved textrank algorithm and K-Means Clustering

KNOWLEDGE-BASED SYSTEMS 2024年 287卷

作者： Liu, Wenjun Sun, Yuyan Yu, Bao Wang, Hailan Peng, Qingcheng Hou, Mengshu Guo, Huan Wang, Hai Liu, Cheng Univ Elect Sci & Technol China UESTC Sch Comp Sci & Engn Chengdu 611731 Peoples R China Xihua Univ Sch Comp & Software Engn Chengdu 610039 Peoples R China Chengdu Technol Univ Sch Big Data & Artificial Intelligence Chengdu 611730 Peoples R China 30th Res Inst China Elect Technol Grp Corp Sci & Technol Commun Secur Lab Chengdu 610041 Peoples R China

Automatic text summarization is to obtain a summary by compressing the text while retaining its important information. Then users can obtain the important content of the text by reading the summary. In the research literatures, the extraction summary method is widely used and is also one type of the main research methods of summary methods. However, this extraction summary method still has some problems. The selection of the initial cluster center has not been carefully determined, and the sentence redundancy summarized is high in articles with complex sentences. In order to solve the above problems, this paper proposes an automatic text summarization method based on improved textrank algorithm and K -Means clustering. This method combines the improved BM25 model and the textrank algorithm to calculate the BM25 similarity between sentences and obtain the TR scores of sentences. The TR scores are used to select the initial center of clustering based on similarity difference judgment and maximum judgment. The final summary is obtained by combining the cluster scores and sentence scores. The experimental results show that the proposed method in this paper has better evaluation indicators containing ROUGE -1, ROUGE -2 and ROUGE -L than other comparison algorithms including Lead -3, textrank and MBM25EMB on the DUC2004 dataset. In conclusion, the proposed method in this paper improves the accuracy of automatic text summarization and reduce the redundancy from documents.

关键词： Text Summarization Sentence Vector K -means Clustering Word Embedding textrank algorithm

来源：评论

学校读者我要写书评

暂无评论

Extractive summarization of telugu documents using textrank algorithm 4

Extractive summarization of telugu documents using textrank ...

引用

4th International Conference on IoT in Social, Mobile, Analytics and Cloud, ISMAC 2020

作者： Manjari, K Usha Institute of Aeronautical Engineering Department of Computer Science and Engineering Hyderabad India

ISBN: (纸本)9781728154640

Reading large and lengthy documents is a tedious and time-consuming task. A summary of the same document gives us an overall idea of what the document is all about. Automated summaries can be generated using various algorithms. The summary can be generated for single or multiple document inputs. Here the proposed system carries out extractive summarization of multiple Telugu text documents. The algorithm applied here is text rank algorithm. © 2020 IEEE.

关键词： Extractive Multi-document Telugu Text summarization textrank algorithm

来源：评论

学校读者我要写书评

暂无评论

A large-scale group decision making method with text mining and probabilistic linguistic complementation for energy transition path assessment

引用

RENEWABLE ENERGY 2025年 239卷

作者： Wang, Yaping Gao, Jianwei Liu, Huihui North China Elect Power Univ Sch Econ & Management Beijing 102206 Peoples R China North China Elect Power Univ Beijing Key Lab New Energy & Low Carbon Dev Beijing 102206 Peoples R China Chinese Acad Sci Inst Sci & Dev Beijing 100080 Peoples R China

Selecting a scientific, practical and efficient energy transition path is the key to solving the main contradiction in the energy industry. Considering text mining and probabilistic linguistic information complementation, a largescale group decision-making method is developed. Firstly, text mining technology is used to extract big data of public behavioral preference, so as to establish the evaluation criteria system of energy transition paths, and a criterion weighting model is proposed according to affinity coefficient and textrank algorithm. Then, experts are clustered based on the social trust network analysis, so that the missing probabilistic linguistic information of expert are completed. Next, the clusters with overlapping features or isolated nodes are optimized via the principle of minimum deviation, and the expert weights are modified by combining information similarity. Finally, the alternatives are ranked by the S-hyperbolic absolute risk aversion utility function. The proposed method is applied to the practical problem of evaluating China's energy transition paths under the dual-carbon goal, with "enhancing the use of clean energy" as the optimal path. The validity and practicability of the model is demonstrated through the multidimensional sensitivity analysis, and insights and suggestions are given in this field.

关键词： Large-scale group decision making textrank algorithm Social trust network analysis Probabilistic linguistic Energy transition path assessment

来源：评论

学校读者我要写书评

暂无评论

Multi-document hybrid text summarization with bi-LSTM RNN for Telugu language

引用

SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES 2024年第2期49卷 1-12页

作者： Babu, G. L. Anand Badugu, Srinivasu Anurag Univ Dept Informat Technol Hyderabad 500088 Telangana India Stanley Coll Engn & Technol Women Dept Comp Sci & Engn Hyderabad 500001 Telangana India

One of the most popular south Indian languages in India is the Telugu language which is currently spoken by 84 million native Telugu speakers in Andhra Pradesh and Telangana. With the rapid growth of the Telugu digital content, the need for the automatic text summarizer is arisen to provide short text from huge text documents. Extractive text summarization model generates only significant sentences. Abstractive text summarization method requires more training time. In this paper, a novel hybrid model is proposed for generating text summaries by combining extractive and abstractive approach to reduce the training time. For extractive method textrank algorithm is utilized and for abstractive method attention-based sequence to sequence model with bidirectional long short-term memory (Bi-LSTM) is utilized. Moreover, coverage mechanism is included into the proposed hybrid approach to reduce the repetition in summaries and to improve the quality of summaries. The performance of the proposed hybrid model is evaluated by the ROUGE toolkit in terms of F-measure, recall and precision. The results of the proposed model are compared with other existing models which shows that the proposed hybrid model outperforms other existing text summarization models for Telugu Language.

关键词： textrank algorithm sequence to sequence model bidirectional long short-term memory (Bi-LSTM) attention mechanism coverage mechanism ROUGE

来源：评论

学校读者我要写书评

暂无评论

ONTO-TDM domain ontology population for a specific discipline

引用

APPLIED ONTOLOGY 2024年第3期19卷 265-285页

作者： Abdoune, Rosana Lazib, Lydia Dahmani-Bouarab, Farida Fernandez-Breis, Jesualdo Tomas Mouloud Mammeri Univ Tizi Ouzou LARI Lab Tizi Algeria Mouloud Mammeri Univ Tizi Ouzou Comp Sci Dept Tizi Ouzou Algeria Univ Murcia Dept Informat & Sistemas IMIB Pascual Parrilla CEIR Campus Mare Nostrum Murcia 30100 Spain

Ontologies play a vital role in organizing and constructing knowledge across various domains, enabling effective knowledge management and sharing. The development of domain-specific ontologies, such as the ONTO-TDM ontology for teaching domain modeling, is essential for providing a comprehensive and standardized representation of knowledge within a given discipline. However, to maximize the usefulness and relevance of such ontologies, it is crucial to automate their population with domain-specific information, reducing manual work and ensuring scalability. This paper presents a novel method for ontology population by extracting and integrating relevant information from diverse sources. The method combines the textrank algorithm with Word2Vec to enhance keyword extraction, capturing both semantic meaning and textual importance. Keywords are then annotated and used to train a machine learning classifier, which aids in integrating new instances into the ontology. Experiments show that the proposed method achieves a precision of 63.33%, a recall of 61.29% and an F1-score of 62.28%, significantly improving keyword extraction and ontology population accuracy compared to existing methods. This validates the method's effectiveness in semi-automatically extracting relevant instances from diverse data sources, enhancing the efficiency and accuracy of ontology population, and advancing automated knowledge management in domain-specific contexts.

关键词： Ontology population ONTO-TDM ontology textrank algorithm Word2Vec keywords extraction machine learning classifier

来源：评论

学校读者我要写书评

暂无评论

Automated Keyword Extraction and Summarization for Romanian Texts 24

Automated Keyword Extraction and Summarization for Romanian ...

引用

International Conference on Automation, Quality and Testing, Robotics (AQTR)

作者： Lupea, M. I. Mocan, C. M. Nandra, C. I. Chifu, E. S. Tech Univ Cluj Napoca Comp Sci Dept Cluj Napoca Romania

ISBN: (纸本)9798350361940;9798350361933;9798350361919

This research explores the theoretical and practical aspects of two fundamental tasks in Natural Language Processing: keyword extraction and extractive summarization, with a focus on the Romanian language. The study investigates the textrank algorithm's application for identifying key terms and generating extractive summaries from texts in Romanian. The investigation reveals the algorithm's language independence, with minimal preprocessing requirements. The findings underscore the significance of automated text processing tools in enhancing information retrieval and document organization in Romanian. This study contributes to advancing Natural Language Processing methodologies and tools for Romanian language applications.

关键词： Natural Language Processing Keyword Extraction Extractive Summarization textrank algorithm

来源：评论

学校读者我要写书评

暂无评论

textrank Keyword Extraction Method Weighted by Multivariate Quantitative Indexes

TextRank Keyword Extraction Method Weighted by Multivariate ...

引用

International Conference on algorithms, High Performance Computing, and Artificial Intelligence (AHPCAI)

作者： Luan, Xin Gao, Wenya Chen, Ming Song, Dalei Ocean Univ China Dept Informat Sci & Engn Qingdao 266100 Peoples R China Qingdao West Coast Dev Grp Co LTD Qingdao 266500 Peoples R China Ocean Univ China Coll Engn Qingdao 266100 Peoples R China

ISBN: (数字)9781510651890

ISBN: (纸本)9781510651890;9781510651883

In the process of keyword extraction, news text has its uniqueness. Keywords extraction of news text not only needs to pay attention to the difference of quantitative indexes of words, but also needs to consider the influence of phrases. In order to improve the keyword extraction effect of news texts, this paper constructs a keyword graph based on textrank, improves the probability transition matrix by combining four quantitative indicators of node frequency, location, span and part of speech, realizing the weight difference of words. Considering the influence of word segmentation technology on phrases extraction, the reconstruction of phrases is completed according to the law of recombination and the concept of combinatorial entropy is defined to realize the filtering of reconstructed phrases. According to the statistical quantitative index of phrases, the linear weighted value is assigned to the reconstructed phrases, and finally, the TopN words or phrases are selected as keywords according to their weight value. Experimental results show that the proposed algorithm is not only superior to the traditional textrank and TF-IDF algorithms, but also has great advantages compared with the improved PositionRank and MyWPMWRank algorithms, the F value of which can be increased by 9.75% at most, which effectively improves the keywords extraction effect of news text.

关键词： news text keyword extraction textrank algorithm multivariate feature quantitative indexes phrase combinatorial entropy

来源：评论

学校读者我要写书评

暂无评论

News Aggregator and Efficient Summarization System

引用

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS 2020年第6期11卷 636-641页

作者： Mohamed, Alaa Ibrahim, Marwan Yasser, Mayar Ayman, Mohamed Gamil, Menna Hassan, Walaa Misr Int Univ Fac Comp Sci Cairo Egypt

News Aggregator is simply an online software which collects new stories and events around the world from various sources all in one place. News aggregator plays a very important role in reducing time consumption, as all of the news that would be explored through more than one website will be placed only in a single location. Also, summarizing this aggregated content absolutely will save reader's time. A proposed technique used called the textrank algorithm that showed promising results for summarization. This paper presents the main goal of this project which is developing a news aggregator able to aggregate relevant articles of a certain input keyword or key-phrase. Summarizing the relevant articles after enhancing the text to give the reader understandable and efficient summary.

关键词： News aggregator text summarization text enhancement textrank algorithm

来源：评论

学校读者我要写书评

暂无评论

Text Keyword Extraction Based on Meta-Learning Strategy

Text Keyword Extraction Based on Meta-Learning Strategy

引用

International Conference on Big Data and Artificial Intelligence (BDAI)

作者： Yuan, Man Zou, Chenhong Northeast Petr Univ Sch Comp Sci & Informat Technol Daqing Peoples R China

ISBN: (纸本)9781538661369

Keyword extraction is an important content in many fields and it is a key step to achieve document retrieval, information retrieval, scientific and technological literature indexing, news reading, text clustering and classification, Machine Translation and so on. In order to improved the accuracy of keyword extraction for text, we put forward a framework of keyword extraction based on meta-learning. This framework not only integrates the keyword extraction algorithm selection, parameter adjustment algorithm, but also integrates a variety of algorithms. Experimental results show that keyword extraction based on meta learning is not only simple, but also significantly improved the accuracy of keyword extraction.

关键词： keyword extraction TF-IDF algorithm LDA algorithm textrank algorithm Meta learning

来源：评论

学校读者我要写书评

暂无评论

Research on the Evaluation System of Liquor Product Based on the Product Reviews

Research on the Evaluation System of Liquor Product Based on...

引用

China Marketing International Conference 2018 (2018中国市场营销国际学术年会)

作者： HAN Xiao-yun HUANG Di-yuan YU Wei-ping Business School Sichuan University Research Institute for Interdisciplinary Sciences Shanghai University of Finance and Economics

In this study,the data mining crawler technology was used to obtain the information of top 100 liquor list and more than 13 thousand related post-purchase review,liquor products was divided into six grades according to the *** textrank algorithm to work out the key words of reviews and their weights of each grade's products,then combined with the characteristics of online shopping consumption to establish an evaluation indicators of net purchase of liquor *** paper classifies the key words of each grade of liquor products by the indicators established before,thus acquiring the evaluation system of six grades'liquor by the *** on the results,the paper explores the differences in the concerns of consumers buying different liquor products at different prices and puts forward the corresponding marketing management suggestions.

关键词： Liquor Product Reviews textrank algorithm Features Extraction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：