检索结果-内蒙古大学图书馆

Some refinements of rough k-means clustering

PATTERN RECOGNITION 2006年第8期39卷 1481-1491页

作者： Peters, Georg Munich Univ Appl Sci Dept Comp Sci Math D-80335 Munich Germany

Lingras et at. proposed a rough cluster algorithm and successfully applied it to web mining. In this paper we analyze their algorithm with respect to its objective function, numerical stability, the stability of the clusters and others. Based on this analysis a refined rough cluster algorithm is presented. The refined algorithm is applied to synthetic, forest and microarray gene expression data. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.

关键词： cluster algorithms rough k-means soft computing data analysis forest data bioinformatics data

来源：评论

学校读者我要写书评

暂无评论

Optimistic bias in the assessment of high dimensional classifiers with a limited dataset

Optimistic bias in the assessment of high dimensional classi...

引用

International Joint Conference on Neural Networks (IJCNN)

作者： Chen, Weijie Brown, David G. US FDA Silver Spring MD 20993 USA

ISBN: (纸本)9781424496365

It is commonly recognized that using the same dataset for training and testing the classifier introduces optimistic bias in estimating classifier performance. However, bias of the same kind may still exist even when independent datasets are used for training and testing a classifier. This problem is especially important in the setting of high dimensional feature space and limited data. bioinformatics data is typically characterized by a tremendous amount of data per patient but from a limited number of patients. Often the entire data set is utilized in a "pre-training" stage during which the feature set is winnowed to a manageable number, and the parameters of the training algorithm are established. Subsequently the data is bifurcated into training and test sets;however, bias has already been introduced into the classifier development process. We investigate the significance of this bias by performing simulated gene expression experiments. We find that, for data with moderate intrinsic separability and modest sample size, any observed separation is due to selection bias introduced in the aforementioned pre-training process. For greater intrinsic separability, correct data hygiene, i.e., complete separation of development and validation data yields a positive result, but one far less impressive than that mistakenly obtained using incomplete data separation.

关键词： Breast cancer Classification algorithms Covariance matrix Measurement Signal to noise ratio Testing Training bioinformatics bioinformatics data classifier development process gene expression genetics high dimensional classifier optimistic bias pattern classification training algorithm

来源：评论

学校读者我要写书评

暂无评论

Dynamic Partial Reconfiguration implementation of the SVM/KNN multi-classifier on FPGA for bioinformatics application 37

Dynamic Partial Reconfiguration implementation of the SVM/KN...

引用

37th Annual International Conference of the IEEE-Engineering-in-Medicine-and-Biology-Society (EMBC)

作者： Hussain, Hanaa M. Benkrid, Khaled Seker, Huseyin PAAET Coll Technol Studies Elect Engn Technol Dept Shuwaikh 70654 Kuwait Univ Edinburgh Sch Engn & Elect Kings BldgMayfield Rd Edinburgh EH9 3JL Midlothian Scotland Northumbria Univ Dept Comp Sci & Digital Technol Fac Engn & Environm Newcastle Upon Tyne NE1 8ST Tyne & Wear England

ISBN: (纸本)9781424492701

bioinformatics data tend to be highly dimensional in nature thus impose significant computational demands. To resolve limitations of conventional computing methods, several alternative high performance computing solutions have been proposed by scientists such as Graphical Processing Units (GPUs) and Field Programmable Gate Arrays (FPGAs). The latter have shown to be efficient and high in performance. In recent years, FPGAs have been benefiting from dynamic partial reconfiguration (DPR) feature for adding flexibility to alter specific regions within the chip. This work proposes combing the use of FPGAs and DPR to build a dynamic multi-classifier architecture that can be used in processing bioinformatics data. In bioinformatics, applying different classification algorithms to the same dataset is desirable in order to obtain comparable, more reliable and consensus decision, but it can consume long time when performed on conventional PC. The DPR implementation of two common classifiers, namely support vector machines (SVMs) and K-nearest neighbor (KNN) are combined together to form a multi-classifier FPGA architecture which can utilize specific region of the FPGA to work as either SVM or KNN classifier. This multi-classifier DPR implementation achieved at least similar to 8x reduction in reconfiguration time over the single non-DPR classifier implementation, and occupied less space and hardware resources than having both classifiers. The proposed architecture can be extended to work as an ensemble classifier.

关键词： bioinformatics field programmable gate arrays support vector machines DPR feature FPGA K-nearest neighbor KNN classifier SVM-KNN multiclassifier bioinformatics application bioinformatics data classification algorithm dynamic partial reconfiguration implementation graphical processing units support vector machine bioinformatics Classification algorithms Computer architecture Field programmable gate arrays Hardware Support vector machines Training

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：