检索结果-内蒙古大学图书馆

Step size adaptation in reproducing kernel Hilbert space

JOURNAL OF machine learning RESEARCH 2006年 7卷 1107-1133页

作者： Vishwanathan, S. V. N. Schraudolph, Nicol N. Smola, Alex J. Natl ICT Australia Stat Machine Learning Program Canberra ACT 2601 Australia Australian Natl Univ Res Sch Informat Sci & Engn Canberra ACT 0200 Australia

This paper presents an online support vector machine (SVM) that uses the stochastic meta-descent (SMD) algorithm to adapt its step size automatically. We formulate the online learning problem as a stochastic gradient descent in reproducing kernel Hilbert space (RKHS) and translate SMD to the nonparametric setting, where its gradient trace parameter is no longer a coefficient vector but an element of the RKHS. We derive efficient updates that allow us to perform the step size adaptation in linear time. We apply the online SVM framework to a variety of loss functions, and in particular show how to handle structured output spaces and achieve efficient online multiclass classification. Experiments show that our algorithm outperforms more primitive methods for setting the gradient step size.

关键词： online SVM stochastic meta-descent structured output spaces

来源：评论

学校读者我要写书评

暂无评论

Newton-like methods for nonparametric independent component analysis

引用

13th International Conference on Neural Informational Processing

作者： Shen, Hao Hueper, Knut Smola, Alexander J. Natl ICT Australia Syst Engn & Complex Syst Res Program Canberra ACT 2612 Australia Natl ICT Australia Stat Machine Learning Res Program Canberra ACT 2612 Australia Australian Natl Univ Res Sch Informat Sci & Engn Dept Informat Engn Canberra ACT 0200 Australia Australian Natl Univ Res Sch Informat Sci & Engn Comp Sci Lab Canberra ACT 0200 Australia

ISBN: (纸本)3540464794

The performance of ICA algorithms significantly depends on the choice of the contrast function and the optimisation algorithm used in obtaining the demixing matrix. In this paper we focus on the standard linear nonparametric ICA problem from an optimisation point of view. It is well known that after a pre-whitening process, the problem can be solved via an optimisation approach on a suitable manifold. We propose an approximate Newton's method on the unit sphere to solve the one-unit linear nonparametric ICA problem. The local convergence properties are discussed. The performance of the proposed algorithms is investigated by numerical experiments.

关键词： Computational methods

来源：评论

学校读者我要写书评

暂无评论

Hyperparameter learning for graph based semi-supervised learning algorithms 06

Hyperparameter learning for graph based semi-supervised lear...

引用

Proceedings of the 20th International Conference on Neural Information Processing Systems

作者： Xinhua Zhang Wee Sun Lee Statistical Machine Learning Program National ICT Australia Canberra Australia and CSL RSISE ANU Canberra Australia Department of Computer Science National University of Singapore Singapore

Semi-supervised learning algorithms have been successfully applied in many applications with scarce labeled data, by utilizing the unlabeled data. One important category is graph based semi-supervised learning algorithms, for which the performance depends considerably on the quality of the graph, or its hyperparameters. In this paper, we deal with the less explored problem of learning the graphs. We propose a graph learning method for the harmonic energy minimization method; this is done by minimizing the leave-one-out prediction error on labeled data points. We use a gradient based method and designed an efficient algorithm which significantly accelerates the calculation of the gradient by applying the matrix inversion lemma and using careful pre-computation. Experimental results show that the graph learning method is effective in improving the performance of the classification algorithm.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Nonparametric quantile estimation

引用

JOURNAL OF machine learning RESEARCH 2006年 7卷 1231-1264页

作者： Takeuchi, Ichiro Le, Quoc V. Sears, Timothy D. Smola, Alexander J. Mie Univ Grad Sch Engn Div Comp Sci Tsu Mie 5148507 Japan Australian Natl Univ RSISE Canberra ACT 0200 Australia Natl ICT Australia Stat Machine Learning Program Canberra ACT 0200 Australia

In regression, the desired estimate of y vertical bar x is not always given by a conditional mean, although this is most common. Sometimes one wants to obtain a good estimate that satisfies the property that a proportion, tau, of y vertical bar x, will be below the estimate. For tau = 0.5 this is an estimate of the median. What might be called median regression, is subsumed under the term quantile regression. We present a nonparametric version of a quantile estimator, which can be obtained by solving a simple quadratic programming problem and provide uniform convergence statements and bounds on the quantile property of our estimator. Experimental results show the feasibility of the approach and competitiveness of our method with existing ones. We discuss several types of extensions including an approach to solve the quantile crossing problems, as well as a method to incorporate prior qualitative knowledge such as monotonicity constraints.

关键词： support vector machines kernel methods quantile estimation nonparametric techniques estimation with constraints

来源：评论

学校读者我要写书评

暂无评论

Second order cone programming approaches for handling missing and uncertain data

引用

JOURNAL OF machine learning RESEARCH 2006年 7卷 1283-1314页

作者： Shivaswamy, Pannagadatta K. Bhattacharyya, Chiranjib Smola, Alexander J. Columbia Univ New York NY 10027 USA Indian Inst Sci Dept Comp Sci & Automat Bangalore 560012 Karnataka India Natl ICT Australia Stat Machine Learning Program Canberra ACT 0200 Australia Australian Natl Univ Canberra ACT 0200 Australia

We propose a novel second order cone programming formulation for designing robust classifiers which can handle uncertainty in observations. Similar formulations are also derived for designing regression functions which are robust to uncertainties in the regression setting. The proposed formulations are independent of the underlying distribution, requiring only the existence of second order moments. These formulations are then specialized to the case of missing values in observations for both classification and regression problems. Experiments show that the proposed formulations outperform imputation.

关键词： Computer systems programming

来源：评论

学校读者我要写书评

暂无评论

Solving factored MDPs with hybrid state and action variables

引用

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH 2006年第1期27卷 153-201页

作者： Kveton, Branislav Hauskrecht, Milos Guestrin, Carlos Univ Pittsburgh Intelligent Syst Program Pittsburgh PA 15260 USA Univ Pittsburgh Dept Comp Sci Pittsburgh PA 15260 USA Carnegie Mellon Univ Machine Learning Dept Pittsburgh PA 15213 USA Carnegie Mellon Univ Dept Comp Sci Pittsburgh PA 15213 USA

Efficient representations and solutions for large decision problems with continuous and discrete variables are among the most important challenges faced by the designers of automated decision support systems. In this paper, we describe a novel hybrid factored Markov decision process (MDP) model that allows for a compact representation of these problems, and a new hybrid approximate linear programming (HALP) framework that permits their efficient solutions. The central idea of HALP is to approximate the optimal value function by a linear combination of basis functions and optimize its weights by linear programming. We analyze both theoretical and computational aspects of this approach, and demonstrate its scale-up potential on several hybrid optimization problems.

关键词： Markov processes

来源：评论

学校读者我要写书评

暂无评论

Binet-cauchy kernels

Binet-cauchy kernels

引用

18th Annual Conference on Neural Information Processing Systems, NIPS 2004

作者： Vishwanathan, S.V.N. Smola, Alexander J. National ICT Australia Machine Learning Program Canberra ACT 0200 Australia

ISBN: (纸本)0262195348

We propose a family of kernels based on the Binet-Cauchy theorem and its extension to Fredholm operators. This includes as special cases all currently known kernels derived from the behavioral framework, diffusion processes, marginalized kernels, kernels on graphs, and the kernels on sets arising from the subspace angle approach. Many of these kernels can be seen as the extrema of a new continuum of kernel functions, which leads to numerous new special cases. As an application, we apply the new class of kernels to the problem of clustering of video sequences with encouraging results.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Learnability of probabilistic automata via oracles

Learnability of probabilistic automata via oracles

引用

16th International Conference on Algorithmic learning Theory, ALT 2005

作者： Guttman, Omri Vishwanathan, S.V.N. Williamson, Robert C. Statistical Machine Learning Program National ICT Australia Australian National University Canberra ACT Australia

ISBN: (纸本)354029242X

Efficient learnability using the state merging algorithm is known for a subclass of probabilistic automata termed μ-distinguishable. In this paper, we prove that state merging algorithms can be extended to efficiently learn a larger class of automata. In particular, we show learnability of a subclass which we call μ2-distinguishable. Using an analog of the Myhill-Nerode theorem for probabilistic automata, we analyze μ-distinguishability and generalize it to μp- distinguishability. By combining new results from property testing with the state merging algorithm we obtain KL-PAC learnability of the new automata class. © Springer-Verlag Berlin Heidelberg 2005.

关键词： Automata theory

来源：评论

学校读者我要写书评

暂无评论

Large-scale multiclass transduction

Large-scale multiclass transduction

引用

2005 Annual Conference on Neural Information Processing Systems, NIPS 2005

作者： Gärtner, Thomas Le, Quoc V. Burton, Simon Smola, Alex J. Vishwanathan, Vishy Fraunhofer AIS.KD 53754 Sankt Augustin Germany Statistical Machine Learning Program NICTA ANU Canberra ACT Australia

ISBN: (纸本)9780262232531

We present a method for performing transductive inference on very large datasets. Our algorithm is based on multiclass Gaussian processes and is effective whenever the multiplication of the kernel matrix or its inverse with a vector can be computed sufficiently fast. This holds, for instance, for certain graph and string kernels. Transduction is achieved by variational inference over the unlabeled data subject to a balancing constraint.

关键词： Bacteriophages

来源：评论

学校读者我要写书评

暂无评论

Kernel methods for missing variables

Kernel methods for missing variables

引用

10th International Workshop on Artificial Intelligence and Statistics, AISTATS 2005

作者： Smola, Alex J. Vishwanathan, S.V.N. Hofmann, Thomas Statistical Machine Learning Program NICTA ANU Canberra ACT 0200 Australia Department of Computer Science Brown University Providence RI United States

ISBN: (纸本)097273581X

We present methods for dealing with missing variables in the context of Gaussian Processes and Support Vector machines. This solves an important problem which has largely been ignored by kernel methods: How to systematically deal with incomplete data? Our method can also be applied to problems with partially observed labels as well as to the transductive setting where we view the labels as missing data. Our approach relies on casting kernel methods as an estimation problem in exponential families. Hence, estimation with missing variables becomes a problem of computing marginal distributions, and finding efficient optimization methods. To that extent we propose an optimization scheme which extends the Concave Convex Procedure (CCP) of Yuille and Rangarajan, and present a simplified and intuitive proof of its convergence. We show how our algorithm can be specialized to various cases in order to efficiently solve the optimization problems that arise. Encouraging preliminary experimental results on the USPS dataset are also presented.

关键词： Problem solving

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：