检索结果-内蒙古大学图书馆

Feature selection via dependence maximization

The Journal of machine learning Research 2012年第1期13卷

作者： Le Song Alex Smola Arthur Gretton Justin Bedo Karsten Borgwardt Computational Science and Engineering Georgia Institute of Technology Atlanta GA Yahoo! Research Santa Clara CA Gatsby Computational Neuroscience Unit London UK and Intelligent Systems Group Max Planck Institutes Tübingen Germany Statistical Machine Learning Program National ICT Australia Canberra ACT Australia and Australian National University Canberra ACT Australia Machine Learning and Computational Biology Research Group Max Planck Institutes Tübingen Germany

We introduce a framework for feature selection based on dependence maximization between the selected features and the labels of an estimation problem, using the Hilbert-Schmidt Independence Criterion. The key idea is that good features should be highly dependent on the labels. Our approach leads to a greedy procedure for feature selection. We show that a number of existing feature selectors are special cases of this framework. Experiments on both artificial and real-world data show that our feature selector works well in practice.

关键词： Hilbert space embedding of distribution Hilbert-Schmidt independence criterion feature selection independence measure kernel methods

来源：评论

学校读者我要写书评

暂无评论

Using two-stage conditional word frequency models to model word burstiness and motivating TF-IDF

Using two-stage conditional word frequency models to model w...

引用

11th International Conference on Artificial Intelligence and Statistics, AISTATS 2007

作者： Sunehag, Peter Statistical Machine Learning Program National ICT Australia Locked bag 8001 ACT 2601 Australia

Several authors have recently studied the problem of creating exchangeable models for natural languages that exhibit word burstiness. Word burstiness means that a word that has appeared once in a text should be more likely to appear again than it was to appear in the first place. In this article the different existing methods are compared theoretically through a unifying framework. New models that do not satisfy the exchangeability assumption but whose probability revisions only depend on the word counts of what has previously appeared, are introduced within this framework. We will refer to these models as two-stage conditional presence/ abundance models since they, just like some recently introduced models for the abundance of rare species in ecology, seperate the issue of presence from the issue of abundance when present. We will see that the widely used TF-IDF heuristic for information retrieval follows naturally from these models by calculating a cross-entropy. We will also discuss a connection between TF-IDF and file formats that seperate presence from abundance given presence.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Hyperparameter learning for graph based semi-supervised learning algorithms 19

引用

20th Annual Conference on Neural Information Processing Systems, NIPS 2006

作者： Zhang, Xinhua Lee, Wee Sun Statistical Machine Learning Program National ICT Australia Canberra Australia CSL RSISE ANU Canberra Australia Department of Computer Science National University of Singapore 3 Science Drive 2 Singapore 117543 Singapore

ISBN: (纸本)9780262195683

Semi-supervised learning algorithms have been successfully applied in many applications with scarce labeled data, by utilizing the unlabeled data. One important category is graph based semi-supervised learning algorithms, for which the performance depends considerably on the quality of the graph, or its hyperparameters. In this paper, we deal with the less explored problem of learning the graphs. We propose a graph learning method for the harmonic energy minimization method;this is done by minimizing the leave-one-out prediction error on labeled data points. We use a gradient based method and designed an efficient algorithm which significantly accelerates the calculation of the gradient by applying the matrix inversion lemma and using careful pre-computation. Experimental results show that the graph learning method is effective in improving the performance of the classification algorithm.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Simpler knowledge-based support vector machines 06

Simpler knowledge-based support vector machines

引用

23rd International Conference on machine learning, ICML 2006

作者： Le, Quoc V. Smola, Alex J. Gärtner, Thomas RSISE Australian National University 0200 ACT Australia Statistical Machine Learning Program National ICT Australia 0200 ACT Australia Fraunhofer AIS.KD Schloß Birlinghoven 53754 Sankt Augustin Germany

ISBN: (纸本)1595933832

If appropriately used, prior knowledge can significantly improve the predictive accuracy of learning algorithms or reduce the amount of training data needed. In this paper we introduce a simple method to incorporate prior knowledge in support vector machines by modifying the hypothesis space rather than the optimization problem. The optimization problem is amenable to solution by the constrained concave convex procedure, which finds a local optimum. The paper discusses different kinds of prior knowledge and demonstrates the applicability of the approach in some characteristic experiments.

关键词： Support vector machines

来源：评论

学校读者我要写书评

暂无评论

Class prediction from time series gene expression profiles using dynamical systems kernels

Class prediction from time series gene expression profiles u...

引用

11th Pacific Symposium on Biocomputing 2006, PSB 2006

作者： Borgwardt, Karsten M. Vishwanathan, S.V.N. Kribgel, Hans-Peter Institute for Computer Science Ludwig-Maximilians-University Oettingenstr. 67 80538 Munich Germany Statistical Machine Learning Program National ICT Australia Canberra ACT 0200 Australia

ISBN: (纸本)9812564632

We present a kernel-based approach to the classification of time series of gene expression profiles. Our method takes into account the dynamic evolution over time as well as the temporal characteristics of the data. More specifically, we model the evolution of the gene expression profiles as a Linear Time Invariant (LTI) dynamical system and estimate its model parameters, A kernel on dynamical systems is then used to classify these time series. We successfully test our approach on a published dataset to predict response to drug therapy in Multiple Sclerosis patients. For phartnacogenomics, our method offers a huge potential for advanced computational tools in disease diagnosis, and disease and drug therapy outcome prognosis.

关键词： Dynamical systems

来源：评论

学校读者我要写书评

暂无评论

Hyperparameter learning for graph based semi-supervised learning algorithms 06

Hyperparameter learning for graph based semi-supervised lear...

引用

Proceedings of the 20th International Conference on Neural Information Processing Systems

作者： Xinhua Zhang Wee Sun Lee Statistical Machine Learning Program National ICT Australia Canberra Australia and CSL RSISE ANU Canberra Australia Department of Computer Science National University of Singapore Singapore

Semi-supervised learning algorithms have been successfully applied in many applications with scarce labeled data, by utilizing the unlabeled data. One important category is graph based semi-supervised learning algorithms, for which the performance depends considerably on the quality of the graph, or its hyperparameters. In this paper, we deal with the less explored problem of learning the graphs. We propose a graph learning method for the harmonic energy minimization method; this is done by minimizing the leave-one-out prediction error on labeled data points. We use a gradient based method and designed an efficient algorithm which significantly accelerates the calculation of the gradient by applying the matrix inversion lemma and using careful pre-computation. Experimental results show that the graph learning method is effective in improving the performance of the classification algorithm.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Learnability of probabilistic automata via oracles

Learnability of probabilistic automata via oracles

引用

16th International Conference on Algorithmic learning Theory, ALT 2005

作者： Guttman, Omri Vishwanathan, S.V.N. Williamson, Robert C. Statistical Machine Learning Program National ICT Australia Australian National University Canberra ACT Australia

ISBN: (纸本)354029242X

Efficient learnability using the state merging algorithm is known for a subclass of probabilistic automata termed μ-distinguishable. In this paper, we prove that state merging algorithms can be extended to efficiently learn a larger class of automata. In particular, we show learnability of a subclass which we call μ2-distinguishable. Using an analog of the Myhill-Nerode theorem for probabilistic automata, we analyze μ-distinguishability and generalize it to μp- distinguishability. By combining new results from property testing with the state merging algorithm we obtain KL-PAC learnability of the new automata class. © Springer-Verlag Berlin Heidelberg 2005.

关键词： Automata theory

来源：评论

学校读者我要写书评

暂无评论

Heteroscedastic Gaussian process regression 05

Heteroscedastic Gaussian process regression

引用

ICML 2005: 22nd International Conference on machine learning

作者： Le, Quoc V. Smola, Alex J. Canu, Stéphane RSISE Australian National University ACT 0200 Australia Statistical Machine Learning Program National ICT Australia ACT 0200 Australia PSI - FRE CNRS 2645 INSA de Rouen France

ISBN: (纸本)1595931805

This paper presents an algorithm to estimate simultaneously both mean and variance of a non parametric regression problem. The key point is that we are able to estimate variance locally unlike standard Gaussian Process regression or SVMs. This means that our estimator adapts to the local noise. The problem is cast in the setting of maximum a posteriori estimation in exponential families. Unlike previous work, we obtain a convex optimization problem which can be solved via Newton's method.

关键词： learning systems

来源：评论

学校读者我要写书评

暂无评论

Step size-adapted online support vector learning

Step size-adapted online support vector learning

引用

8th International Symposium on Signal Processing and its Applications, ISSPA 2005

作者： Karatzoglou, Alexandros Vishwanathan, S.V.N. Schraudolph, Nicol N. Smola, Alex J. Department of Statistics Technische Universität Wien Wiedner Hauptstraße 8-10 Austria National ICT Australia Statistical Machine Learning Program Australian National University Canberra

ISBN: (纸本)0780392434

We present an online Support Vector machine (SVM) that uses Stochastic Meta-Descent (SMD) to adapt its step size automatically. We formulate the online learning problem as a stochastic gradient descent in Reproducing Kernel Hubert Space (RKHS) and translate SMD to the nonparametric setting, where its gradient trace parameter is no longer a coefficient vector but an element of the RKHS. We derive efficient updates that allow us to perform the step size adaptation in linear time. We apply the online SVM framework to a variety of loss functions and in particular show how to achieve efficient online multiclass classification. Experimental evidence suggests that our algorithm outperforms existing methods. © 2005 IEEE.

关键词： learning systems

来源：评论

学校读者我要写书评

暂无评论

Step size-adapted online support vector learning

Step size-adapted online support vector learning

引用

International Symposium on Signal Processing and Its Applications (ISSPA)

作者： A. Karatzoglou S.V.N. Vishwanathan N.N. Schraudolph A.J. Smola Department of Statistics Technische Universität Wien Austria RSISE Statistical Machine Learning Program National ICT Australia Australian National University Canberra Australia

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：