检索结果-内蒙古大学图书馆

Low-level cursive word representation based on geometric decomposition

4th international conference on machine learning and data mining in pattern recognition, MLDM 2005

作者： Dong, Jian-Xiong Krzyzak, Adam Suen, Ching Y. Ponson, Dominique IMDS Software 75 rue Queen Montréal Que. H3C 2N6 Canada Center for Pattern-Recognition and Machine Intelligence Concordia University Montréal Que. H3G 1M8 Canada Department of Computer Science and Software Engineering Concordia University 1455 de Maisonneuve Blvd. W. Montréal Que. H3G 1M8 Canada

ISBN: (纸本)3540269231

An efficient low-level word image representation plays a crucial role in general cursive word recognition. This paper proposes a novel representation scheme, where a word image can be represented as two sequences of feature vectors in two independent channels, which are extracted from vertical peak points on the upper external contour and at vertical minima on the lower external contour, respectively. A data-driven method based on support vector machine is applied to prune and group those extreme points. Our experimental results look promising and have indicated the potential of this low-level representation for complete cursive handwriting recognition. © Springer-Verlag Berlin Heidelberg 2005.

关键词： Image analysis

来源：评论

学校读者我要写书评

暂无评论

Autonomous vehicle steering based on evaluative feedback by reinforcement learning

引用

4th international conference on machine learning and data mining in pattern recognition, MLDM 2005

作者： Kuhnert, Klaus-Dieter Krödel, Michael University of Siegen Institute of Real-Time Learningsystems Hölderlinstrasse 3 D-57068 Siegen Germany

ISBN: (纸本)3540269231

Steering an autonomous vehicle requires the permanent adaptation of behavior in relation to the various situations the vehicle is in. This paper describes a research which implements such adaptation and optimization based on Reinforcement learning (RL) which in detail purely learns from evaluative feedback in contrast to instructive feedback. Convergence of the learning process has been achieved at various experimental results revealing the impact of the different RL parameters. While using RL for autonomous steering is in itself already a novelty, additional attention has been given to new proposals for post-processing and interpreting the experimental data. © Springer-Verlag Berlin Heidelberg 2005.

关键词： learning systems

来源：评论

学校读者我要写书评

暂无评论

Parameter inference of cost-sensitive boosting algorithms

引用

4th international conference on machine learning and data mining in pattern recognition, MLDM 2005

作者： Sun, Yanmin Wong, A.K.C. Wang, Yang Pattern Analysis and Machine Intelligence Lab. University of Waterloo Canada Pattern Discovery Software Ltd.

ISBN: (纸本)3540269231

Several cost-sensitive boosting algorithms have been reported as effective methods in dealing with class imbalance problem. Misclassification costs, which reflect the different level of class identification importance, are integrated into the weight update formula of AdaBoost algorithm. Yet, it has been shown that the weight update parameter of AdaBoost is induced so as the training error can be reduced most rapidly. This is the most crucial step of AdaBoost in converting a weak learning algorithm into a strong one. However, most reported cost-sensitive boosting algorithms ignore such a property. In this paper, we come up with three versions of cost-sensitive AdaBoost algorithms where the parameters for sample weight updating are induced. Then, their identification abilities on the small classes are tested on four "real world" medical data sets taken from UCI machine learning database based on F-measure. Our experimental results show that one of our proposed costsensitive AdaBoost algorithms is superior in achieving the best identification ability on the small class among all reported cost-sensitive boosting algorithms. © Springer-Verlag Berlin Heidelberg 2005.

关键词： Algorithms

来源：评论

学校读者我要写书评

暂无评论

Parallel algorithm for growing SOM with regions of influence and neuron inertia

Parallel algorithm for growing SOM with regions of influence...

引用

3rd IASTED international conference on Circuits, Signals, and Systems

作者： Hammond, J Fischer, S Valova, I Univ Massachusetts Comp & Informat Sci Dept N Dartmouth MA 02747 USA

ISBN: (纸本)0889865078

The self-organizing map (SOM) is a common methodology used to capture and represent data patterns and increasingly playing a significant role in the development of neural networks. The primary objective of an SOM is to determine an approximate representation of data with an unknown probability distribution, from a multi-dimensional input space, using a lower dimensional neural network. The approximation by the network corresponds to the topological structure inherent in the data distribution. The classical SOM, and many of its variations such as the growing grid, construct the network based on randomly selected pieces of the input space, where the number of pieces increases over time. We give an overview of a parallel algorithm for the SOM (ParaSOM), which alternatively examines the entire input in each step, leading to a more accurate representation of input patterns after only a fraction of iterations, albeit requiring significantly more time. Both growing grid and ParaSOM, unlike the classical SOM, do not maintain a fixed number of neurons. Instead, their networks may grow and increase in density to match the input space. We present a comparison of results generated by implementations of ParaSOM and growing grid is made, making apparent their considerable performance differences despite having the growth feature in common.

关键词： neural networks Self-Organizing Map parallel learning algorithms pattern recognition neural systems

来源：评论

学校读者我要写书评

暂无评论

Text classification using small number of features

引用

4th international conference on machine learning and data mining in pattern recognition, MLDM 2005

作者： Makrehchi, Masoud Kamel, Mohamed S. Pattern Analysis and Machine Intelligence Lab. Department of Electrical and Computer Engineering University of Waterloo Waterloo Ont. N2L 3G1 Canada

ISBN: (纸本)3540269231

Feature selection method for text classification based on information gain ranking, improved by removing redundant terms using mutual information measure and inclusion index, is proposed. We report an experiment to study the impact of term redundancy on the performance of text classifier. The result shows that term redundancy behaves very similar to noise and may degrade the classifier performance. The proposed method is tested on an SVM text classifier. Feature reduction by this method remarkably outperforms information gain based feature selection. © Springer-Verlag Berlin Heidelberg 2005.

关键词： Classification (of information)

来源：评论

学校读者我要写书评

暂无评论

Efficient mining of Contrast patterns and Their Applications to Classification

Efficient Mining of Contrast Patterns and Their Applications...

引用

international conference on Intelligent Sensing and Information Processing (ICISIP)

作者： K. Ramamohanarao J. Bailey Hongjian Fan Department of Computer Science and Software Engineering University of Melbourne VIC Australia

data mining is one of the most important areas in the 21st century with many wide ranging applications. These include medicine, finance, commerce and engineering. pattern mining is amongst the most important and challenging techniques employed in data mining. patterns are collections of items which satisfy certain properties. Emerging patterns are those whose frequencies change significantly from one dataset to another. They represent strong contrast knowledge and have been shown to be very successful for constructing accurate and robust classifiers. In this paper, we examine various kinds of contrast patterns. We also investigate efficient pattern mining techniques and discuss how to exploit patterns to construct effective classifiers

关键词： data mining Gene expression Finance Business Frequency Supervised learning Itemsets Robustness Statistics machine learning

来源：评论

学校读者我要写书评

暂无评论

Diagnosis of lung nodule using reinforcement learning and geometric measures

引用

4th international conference on machine learning and data mining in pattern recognition, MLDM 2005

作者： Silva, Aristoáfanes Correâ Da Silva Jr., Valdeci Ribeiro De Neto, Areolino Almeida De Paiva, Anselmo Cardoso Federal University of Maranhão - UFMA Department of Electrical Engineering Campus do Bacanga Av. dos Portugueses SN Bacanga 065085-580 Sao Luis MA Brazil Federal University of Maranhão - UFMA Department of Computer Science Campus do Bacanga Av. dos Portugueses SN Bacanga 65085-580 Sao Luis MA Brazil

ISBN: (纸本)3540269231

This paper uses a set of 3D geometric measures with the purpose of characterizing lung nodules as malignant or benign. Based on a sample of 36 nodules, 29 benign and 7 malignant, these measures are analyzed with a technique for classification and analysis called reforcement learning. We have concluded that this techiniqne allows good discrimination from benign to malignant nodules. © Springer-Verlag Berlin Heidelberg 2005.

关键词： learning systems

来源：评论

学校读者我要写书评

暂无评论

Combining classification improvements by ensemble processing

Combining classification improvements by ensemble processing

引用

3rd international conference on Software Engineering Research, Management and Applications

作者： Ishii, N Tsuchiya, E Bao, YG Yamaguchi, N Aichi Institute of Technology Japan Aichi Information System Japan Saga University Japan

ISBN: (纸本)0769522971

The k-nearest neighbor (KNN) classification is a simple and effective classification approach. However improving performance of the classifier is still attractive. Combining multiple classifiers is an effective technique for improving accuracy. There are many general combining algorithms, such as Bagging, Boosting, or Error Correcting Output Coding that significantly improve the classifier such as decision trees, rule learners, or neural networks. Unfortunately, these combining methods developed do not improve the nearest neighbor classifiers. In this paper first, we present a new approach to combine multiple KNN classifiers based on different distance functions, in which we apply multiple distance functions to improve the performance of the k-nearest neighbor classifier Second, we develop a combining method, in which the weights of the distance function, are learnt by genetic algorithm. Finally, combining classifiers in error correcting output coding, are discussed The proposed algorithms seek to increase generalization accuracy when compared to the basic k-nearest neighbor algorithm. Experiments have been conducted on some benchmark datasets from the UCI machine learning Repository. The results show that the proposed algorithms improve the performance of the k-nearest neighbor classification.

关键词： data mining artificial intelligence knowledge discovery

来源：评论

学校读者我要写书评

暂无评论

Finding rough set reducts with SAT

Finding rough set reducts with SAT

引用

10th international conference on Rough Sets, Fuzzy Sets, data mining and Granular Computing (RSFDGrC 2005)

作者： Jensen, R Shen, Q Tuson, A Univ Coll Wales Dept Comp Sci Aberystwyth SY23 3BZ Dyfed Wales City Univ London Sch Informat Dept Comp London EC1V 0HB England

ISBN: (纸本)3540286535

Feature selection refers to the problem of selecting those input features that are most predictive of a given outcome;a problem encountered in many areas such as machine learning, pattern recognition and signal processing. In particular, solution to this has found successful application in tasks that involve datasets containing huge numbers of features (in the order of tens of thousands), which would be impossible to process further. Recent examples include text processing and web content classification. Rough set theory has been used as such a dataset pre-processor with much success, but current methods are inadequate at finding minimal reductions, the smallest sets of features possible. This paper proposes a technique that considers this problem from a propositional satisfiability perspective. In this framework, minimal subsets can be located and verified. An initial experimental investigation is conducted, comparing the new method with a standard rough set-based feature selector.

关键词： Rough set theory

来源：评论

学校读者我要写书评

暂无评论

Exploring conditions for the optimality of Naive bayes

引用

international JOURNAL OF pattern recognition AND ARTIFICIAL INTELLIGENCE 2005年第2期19卷 183-198页

作者： Zhang, H Univ New Brunswick Fac Comp Sci Fredericton NB E3B 5A3 Canada

Naive Bayes is one of the most efficient and effective inductive learning algorithms for machine learning and data mining. Its competitive performance in classification is surprising, because the conditional independence assumption on which it is based is rarely true in real-world applications. An open question is: what is the true reason for the surprisingly good performance of Naive Bayes in classification? In this paper, we propose a novel explanation for the good classification performance of Naive Bayes. We show that, essentially, dependence distribution plays a crucial role. Here dependence distribution means how the local dependence of an attribute distributes in each class, evenly or unevenly, and how the local dependences of all attributes work together, consistently (supporting a certain classification) or inconsistently (canceling each other out). Specifically, we show that no matter how strong the dependences among attributes are, Naive Bayes can still be optimal if the dependences distribute evenly in classes, or if the dependences cancel each other out. We propose and prove a sufficient and necessary condition for the optimality of Naive Bayes. Further, we investigate the optimality of Naive Bayes under the Gaussian distribution. We present and prove a sufficient condition for the optimality of Naive Bayes, in which the dependences among attributes exist. This provides evidence that dependences may cancel each other out. Our theoretic analysis can be used in designing learning algorithms. In fact, a major class of learning algorithms for Bayesian networks are conditional independence-based (or Cl-based), which are essentially based on dependence. We design a dependence distribution-based algorithm by extending the ChowLiu algorithm, a widely used CI based algorithm. Our experiments show that the new algorithm outperforms the ChowLiu algorithm, which also provides empirical evidence to support our new explanation.

关键词： Naive Bayes optimality classification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：