检索结果-内蒙古大学图书馆

2011 IEEE International Conference on Computer Science and Automation Engineering(CSAE 2011)

作者： Wang Chun-hong Nan Li-Li Ren Yao-Peng Computer Science and Technology ,Yun cheng University, Yun Cheng,044000

The text clustering based on Vector Space Model has problems, such as high-dimensional and sparse, unable to solve synonym and polyseme etc. And meanwhile, k-means clustering algorithm has shortcomings, which depends on the initial clustering center and needs to fix the number of clusters in advance. Aiming at these problems, in this paper, a text clustering algorithm based on Latent Semantic Analysis and Optimization is proposed. This algorithm can not only overcome the problems of Vector Space Model, but also can avoid the shortcomings of k-means algorithm. And compared with the text clustering algorithm based on Latent Semantic Analysis and the text clustering algorithm based on Vector Space Model and optimization, our algorithm is proved which can preferably improve the effect of text clustering, and upgrade the precision ratio and recall ration of text.

关键词： text clustering Vector Space Model Latent Semantic Analysis clustering optimization k-means clustering algorithm

来源：评论

学校读者我要写书评

暂无评论

Applying Hybrid kEPSO clustering to Web Pages 10

Applying Hybrid KEPSO Clustering to Web Pages

引用

48th ACM Annual Southeast Regional Conference (SE)

作者： Moh, Teng-Sheng Sabnis, Ameya San Jose State Univ Dept Comp Sci San Jose CA 95192 USA

ISBN: (纸本)9781450300643

Various optimization methods are used along with the standard clustering algorithms to make the clustering process simpler and quicker. In this paper we propose a new hybrid technique of clustering known as k-Evolutionary Particle Swarm Optimization (kEPSO) based on the concept of Particle Swarm Optimization (PSO). The proposed algorithm uses the k-means algorithm as the first step and the Evolutionary Particle Swarm Optimization (EPSO) algorithm as the second step to perform clustering. The experiments were performed using the clustering benchmark data. This method was compared with the standard k-means and EPSO algorithms. The results show that this method produced compact results and performed faster than other clustering algorithms. Later, the algorithm was used to cluster web pages. The web pages were clustered by first cleaning the unnecessary data and then labeling the obtained web pages to categorize them.

关键词： clustering algorithms k-means clustering algorithm Particle Swarm Optimization Data clustering Evolutionary Particle Swarm Optimization algorithm

来源：评论

学校读者我要写书评

暂无评论

Internet Public Opinion Hotspot Detection Research Based on k-means algorithm

Internet Public Opinion Hotspot Detection Research Based on ...

引用

1st International Conference on Swarm Intelligence

作者： Liu, Hong Li, Xiaojun Zhejiang Gongshang Univ Coll Comp & Informat Engn Hangzhou 310018 Zhejiang Peoples R China

ISBN: (纸本)9783642134975

Internet is becoming a spreading platform for the public opinion. It is important to grasp the internet public opinion (IPO) in time and understand the trends of their opinion correctly. Text mining plays a fundamental role in a number of information management and retrieval tasks. This paper studies internet public opinion hotspot detection using text mining approaches. First, we create an algorithm to obtain vector space model for all of text document. Second, this algorithm is combined with k-means clustering algorithm to develop unsupervised text mining approach. We use the proposed text mining approach to group the internet public opinion into various clusters, with the center of each representing a hotspot public opinion within the current time span. Through the result of the experiment, it shows that the efficiency and effectiveness of the algorithm using.

关键词： Internet public opinion k-means clustering algorithm vector space model text classification

来源：评论

学校读者我要写书评

暂无评论

Optimal Locations of Remote Radio Units in CoMP Systems for Energy Efficiency

Optimal Locations of Remote Radio Units in CoMP Systems for ...

引用

IEEE 72nd Vehicular Technology Conference Fall 2010

作者： Zhang, Congqing Zhang, Tiankui Zeng, Zhimin Cuthbert, Laurie Xiao, Lin Beijing Univ Posts & Telecommun Sch Informat & Telecommun Engn Beijing 100088 Peoples R China Univ London Sch Elect Engn & Comp Sci London WC1E 7HU England

ISBN: (纸本)9781424435746

In coordinated multi-point transmission (CoMP) systems, the optimal remote radio unit (RRU) location is analyzed theoretically and a RRU location design scheme for energy efficiency in practical scenarios is given. An average minimum access distance criterion is given for RRU location optimization. By minimizing the average distance between users and RRU, the optimal RRU distribution can be obtained when users are located uniformly in the cell. Taking into account the fact that user distribution will not be completely uniform in a practical environment, the k-means clustering algorithm is used to get the optimized RRU deployment in a practical user distribution. Simulation results show that the uplink transmission power can be greatly reduced with the RRU optimized location design in both the uniform and non-uniform user distribution.

关键词： CoMP energy efficiency RRU location optimization site design k-means clustering algorithm

来源：评论

学校读者我要写书评

暂无评论

Ant Colony Optimization for the k-means algorithm in Image Segmentation 10

Ant Colony Optimization for the K-means Algorithm in Image S...

引用

48th ACM Annual Southeast Regional Conference (SE)

作者： Hung, Chih-Cheng Sun, Mojia Southern Polytech State Univ 1100 S Marietta Pkwy Marietta GA 30060 USA

ISBN: (纸本)9781450300643

In this paper the ant colony optimization (ACO) is used in the k-means algorithm for improving the image segmentation. The learning mechanism of this algorithm is formulated by using the ACO meta-heuristic. As the pheromone dominates the exploration of ants for problem solutions, preliminary experiments on pheromone's update are reported. Two methods for defining and updating pheromone values are proposed and tested: one with the spatial coordinate distances and the other without using such a distance. The ACO improves the k-means algorithm by making it less dependent on the initial parameters.

关键词： k-means clustering algorithm Ant colony optimization Swarm Intelligence

来源：评论

学校读者我要写书评

暂无评论

Semi-supervised Rough Cost/Benefit Decisions

引用

FUNDAMENTA INFORMATICAE 2009年第2期94卷 233-244页

作者： Lingras, Pawan Chen, Min Miao, Duoqian St Marys Univ Dept Math & Comp Sci Halifax NS B3H 3C3 Canada Tongji Univ Sch Elect & Informat Engn Shanghai 201804 Peoples R China Jiao Tong Univ Minist Educ China Key Lab Embedded Syst & Serv Comp Shanghai 201804 Peoples R China

Most of the business decisions are based on cost and benefit considerations. Data mining techniques that make it possible for the businesses to incorporate financial considerations will be more meaningful to the decision makers. Decision theoretic framework has been helpful in providing a better understanding of classification models. This study describes a semi-supervised decision theoretic rough set model. The model is based on an extension of decision theoretic model proposed by Yao. The proposal is used to model financial cost/benefit scenarios for a promotional campaign in a real-world retail store.

关键词： Rough sets Rough approximation Probability Decision theory Cost/benefit analysis k-means clustering algorithm

来源：评论

学校读者我要写书评

暂无评论

An initialization method for the k-means algorithm using neighborhood model

引用

COMPUTERS & MATHEMATICS WITH APPLICATIONS 2009年第3期58卷 474-483页

作者： Cao, Fuyuan Liang, Jiye Jiang, Guang Shanxi Univ Sch Comp & Informat Technol Taiyuan 030006 Shanxi Peoples R China Minist Educ Key Lab Computat Intelligence & Chinese Informat Taiyuan 030006 Peoples R China Chinese Acad Sci Key Lab Intelligent Informat Proc Inst Comp Technol Beijing 100190 Peoples R China

As a simple clustering method, the traditional k-means algorithm has been widely discussed and applied in pattern recognition and machine learning. However, the k-means algorithm could not guarantee unique clustering result because initial cluster centers are chosen randomly. In this paper, the cohesion degree of the neighborhood of an object and the coupling degree between neighborhoods of objects are defined based on the neighborhood-based rough set model. Furthermore, a new initialization method is proposed, and the corresponding time complexity is analyzed as well. We study the influence of the three norms on clustering, and compare the clustering results of the k-means with the three different initialization methods. The experimental results illustrate the effectiveness of the proposed method. (C) 2009 Elsevier Ltd. All rights reserved.

关键词： Neighborhood model Initial cluster centers Cohesion degree Coupling degree k-means clustering algorithm

来源：评论

学校读者我要写书评

暂无评论

Application of an unsupervised artificial neural network technique to multivariant surface water quality data

引用

ECOLOGICAL RESEARCH 2009年第1期24卷 163-173页

作者： Cinar, Oezer Merdun, Hasan Kahramanmaras Sutcu Imam Univ Fac Engn & Architecture Dept Environm Engn TR-46060 Kahramanmaras Turkey Kahramanmaras Sutcu Imam Univ Fac Agr Dept Agr Engn TR-46060 Kahramanmaras Turkey

Surface water contamination from agricultural and urban runoff and wastewater discharges from industrial and municipal activities is of major concern to people worldwide. Classical models can be insufficient to visualise the results because the water quality variables used to describe dynamic pollution sources are complex, multivariable, and nonlinearly related. Artificial intelligence techniques with the ability to analyse multivariant water quality data by means of a sophisticated visualisation capacity can offer an alternative to current models. In this study, the kohonen self-organising feature maps (SOM) neural network was initially applied to analyse the complex nonlinear relationships among multivariable surface water quality variables using the component planes of the variables to determine the complex behaviour of water quality parameters. The dependencies between water quality variables were extracted and interpreted using the pattern analysis visualised in component planes. For further investigation, the k-means clustering algorithm was used to determine the optimal number of clusters by partitioning the maps and utilising the Davies-Bouldin clustering index, leading to seven groups or clusters corresponding to water quality variables. The results reveal that the concentrations of Na, k, Cl, NH4-N, NO2-N, o-PO4, component planes of organic matter (pV), and dissolved oxygen (DO) were significantly affected by seasonal changes, and that the SOM technique is an efficient tool with which to analyse and determine the complex behaviour of multidimensional surface water quality data. These results suggest that this technique could also be applied to other environmentally sensitive areas such as air and groundwater pollution.

关键词： kohonen self-organising feature maps clustering k-means clustering algorithm Surface water quality Variable dependencies

来源：评论

学校读者我要写书评

暂无评论

引用

作者： Veni, Rushikesh University of Nevada Las Vegas

学位级别：M.S.C.S.

Document clustering or unsupervised document classification is an automated process of grouping documents with similar content. A typical technique uses a similarity function to compare documents. In the literature, many similarity functions such as dot product or cosine measures are proposed for the comparison operator. For the thesis, we evaluate the effects a similarity function may have on clustering. We start by representing a document and a query, both as a vector of high-dimensional space corresponding to the keywords followed by using an appropriate distance measure in k-means to compute similarity between the document vector and the query vector to form clusters. Based on these clusters we decide the best distance metric for the document set used. Next, we compute time complexities for different similarity functions for the same model and document set based on the number of iterations and number of clusters.

关键词： Canberra distances Chi-Square Data mining Distances Document clustering Euclidean distances Execution time Information retrieval k-means clustering algorithm Similarity functions

来源：评论

学校读者我要写书评

暂无评论

Improved k-means clustering algorithm for exploring local protein sequence motifs representing common structural property

IEEE TRANSACTIONS ON NANOBIOSCIENCE

引用

IEEE TRANSACTIONS ON NANOBIOSCIENCE 2005年第3期4卷 255-265页

作者： Zhong, W Altun, G Harrison, R Tai, PC Pan, Y Georgia State Univ Dept Comp Sci Atlanta GA 30303 USA Georgia State Univ Dept Biol Atlanta GA 30303 USA

Information about local protein sequence motifs is very important to the analysis of biologically significant conserved regions of protein sequences. These conserved regions can potentially determine the diverse conformation and activities of proteins. In this work, recurring sequence motifs of proteins are explored with an improved k-means clustering algorithm on a new dataset. The structural similarity of these recurring sequence clusters to produce sequence motifs is studied in order to evaluate the relationship between sequence motifs and their structures. To the best of our knowledge, the dataset used by our research is the most updated dataset among similar studies for sequence motifs. A new greedy initialization method for the k-means algorithm is proposed to improve traditional k-means clustering techniques. The new initialization method tries to choose suitable initial points, which are well separated and have the potential to form high-quality clusters. Our experiments indicate that the improved k-means algorithm satisfactorily increases the percentage of sequence segments belonging to clusters with high structural similarity. Careful comparison of sequence motifs obtained by the improved and traditional algorithms also suggests that the improved k-means clustering algorithm may discover some relatively weak and subtle sequence motifs, which are undetectable by the traditional k-means algorithms. Many biochemical tests reported in the literature show that these sequence motifs are biologically meaningful. Experimental results also indicate that the improved k-means algorithm generates more detailed sequence motifs representing common structures than previous research. Furthermore, these motifs are universally conserved sequence patterns across protein families, overcoming some weak points of other popular sequence motifs. The satisfactory result of the experiment suggests that this new k-means algorithm may be applied to other areas of bioinformatics resea

关键词： k-means clustering algorithm protein structure sequence motif

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：