检索结果-内蒙古大学图书馆

NB⁺: An improved naive bayesian algorithm

KNOWLEDGE-BASED SYSTEMS 2011年第5期24卷 563-569页

作者： Appavu alias Balamurugan Rajaram, Ramasamy Pramala, S. Rajalakshmi, S. Jeyendran, C. Prakash, J. Dinesh Surya Thiagarajar Coll Engn Dept Informat Technol Madurai 15 Tamil Nadu India

A novel algorithm named NB which is an extended version of the traditional naive bayesian algorithm has been presented in this paper. An exception occurs when there is an equal probability for the class label value in the naive bayesian algorithm. The approach aims to suggest a solution with the help of a partial matching method. Consequently, the classification accuracy has drastically improved. Experimental evaluation has been done on various databases to show that NB+ algorithm outperforms the traditional naive bayesian algorithm. (C) 2010 Elsevier B.V. All rights reserved.

关键词： Data mining Classification naive bayesian algorithm Conflicting data Lazy learning

来源：评论

学校读者我要写书评

暂无评论

A Method of Text Classification Combining naive Bayes and the Similarity Computing algorithms

A Method of Text Classification Combining Naive Bayes and th...

引用

Asia Pacific Web Conference on Web Technologies and Applications (APWeb)

作者： Hong, Yinghan Mai, Guizhen Zeng, Hui Guo, Cai Hanshan Normal Univ Chaozhou 521041 Gugangdong Peoples R China Guangdong Univ Technol Guangzhou 510006 Gugangdong Peoples R China

ISBN: (纸本)9783319281216;9783319281209

Text classification is one of the main issues in the big data analysis and research. In present, however, there is a lack of a universal algorithm model that can fulfill the requirement of both accuracy and efficiency of text classification. This paper proposes a method of text classification, which combines the naive Bayes and the similarity computing algorithm. Firstly, the text information is cut into several word segmentation vectors by the Paoding Analyzer;then the bayesian algorithm is employed to conduct the first-level directory classification to the text information;after that, the improved similarity computing algorithm is adopted to carry out the second-level directory classification. Finally, the algorithm model is tested with actual data, and the results are compared with those of bayesian algorithm and similarity computing algorithm respectively. The results show that the proposed method achieves a higher precision rate.

关键词： naive bayesian algorithm Similarity computing algorithm Precision

来源：评论

学校读者我要写书评

暂无评论

Feature weighted naive Bayes algorithm for information retrieval of enterprise systems

引用

ENTERPRISE INFORMATION SYSTEMS 2014年第1期8卷 107-120页

作者： Wang, Li Ji, Ping Qi, Jing Shan, Siqing Bi, Zhuming Deng, Weiguo Zhang, Naijing Beihang Univ Sch Econ & Management Beijing 100191 Peoples R China Inst Forest Resource Informat Beijing Peoples R China Indiana Univ Purdue Univ Dept Engn Ft Wayne IN 46818 USA Bank China Informat Ctr Beijing 10094 Peoples R China

Automated information retrieval is critical for enterprise information systems to acquire knowledge from the vast amount of data sets. One challenge in information retrieval is text classification. Current practices rely heavily on the classical naive Bayes algorithm due to its simplicity and robustness. However, results from this algorithm are not always satisfactory. In this article, the limitations of the naive Bayes algorithm are discussed, and it is found that the assumption on the independence of terms is the main reason for an unsatisfactory classification in many real-world applications. To overcome the limitations, the dependent factors are considered by integrating a term frequency-inverse document frequency (TF-IDF) weighting algorithm in the naive Bayes classification. Moreover, the TF-IDF algorithm itself is improved so that both frequencies and distribution information are taken into consideration. To illustrate the effectiveness of the proposed method, two simulation experiments were conducted, and the comparisons with other classification methods have shown that the proposed method has outperformed other existing algorithms in terms of precision and index recall rate.

关键词： enterprise information systems (EIS) information retrieval data mining text classification naive bayesian algorithm term frequency-inverse document frequency (TF-IDF)

来源：评论

学校读者我要写书评

暂无评论

The Improved naive bayesian WEB Text Classification algorithm

The Improved Naive Bayesian WEB Text Classification Algorith...

引用

1st International Symposium on Computer Network and Multimedia Technology

作者： Bai, Ping Li, Junqing Wuhan Univ Sci & Engn Coll Business Adm Wuhan 430073 Peoples R China Wuhan Univ Technol Sch Comp Sci & Technol Wuhan Peoples R China

ISBN: (纸本)9781424452729

It is a very important task that how to classify Web pages automatically and effectively in accordance with the given model for machine learning. The traditional operation modes, including artificial way and semiautomatic way, form category abstracts after domain experts' personnel inspection and then put the results into a particular class library according to the scheduled requirements. An improved naive bayesian WEB text classification algorithm is proposed in this paper. The common bayesian classifier assumes that all the items are equally important while in this paper the terms in each title are considered to be more important than others. Experiments showed that, the improved naive bayesian algorithm is more precise in the text classification.

关键词： Text Classification naive bayesian algorithm Theme text

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：