检索结果-内蒙古大学图书馆

18th Annual Conference on learning Theory (COLT 2005)

作者： Warmuth, MK Vishwanathan, SVN Univ Calif Santa Cruz Dept Comp Sci Santa Cruz CA 95064 USA Natl ICT Australia Machine Learning Program Canberra ACT 0200 Australia

ISBN: (纸本)3540265562

We discuss a simple sparse linear problem that is hard to learn with any algorithm that uses a linear combination of the training instances as its weight vector. The hardness holds even if we allow the learner to embed the instances into any higher dimensional feature space (and use a kernel function to define the dot product between the embedded instances). These algorithms are inherently limited by the fact that after seeing k instances only a weight space of dimension k can be spanned. Our hardness result is surprising because the same problem can be efficiently learned using the exponentiated gradient (EG) algorithm: Now the component-wise logarithms of the weights are essentially a linear combination of the training instances and after seeing k instances. This algorithm enforces additional constraints on the weights (all must be non-negative and sum to one) and in some cases these constraints alone k force the rank of the weight space to grow as fast as 2(k).

关键词： learning systems

来源：评论

学校读者我要写书评

暂无评论

Step size-adapted online support vector learning

Step size-adapted online support vector learning

引用

8th International Symposium on Signal Processing and its Applications, ISSPA 2005

作者： Karatzoglou, Alexandros Vishwanathan, S.V.N. Schraudolph, Nicol N. Smola, Alex J. Department of Statistics Technische Universität Wien Wiedner Hauptstraße 8-10 Austria National ICT Australia Statistical Machine Learning Program Australian National University Canberra

ISBN: (纸本)0780392434

We present an online Support Vector machine (SVM) that uses Stochastic Meta-Descent (SMD) to adapt its step size automatically. We formulate the online learning problem as a stochastic gradient descent in Reproducing Kernel Hubert Space (RKHS) and translate SMD to the nonparametric setting, where its gradient trace parameter is no longer a coefficient vector but an element of the RKHS. We derive efficient updates that allow us to perform the step size adaptation in linear time. We apply the online SVM framework to a variety of loss functions and in particular show how to achieve efficient online multiclass classification. Experimental evidence suggests that our algorithm outperforms existing methods. © 2005 IEEE.

关键词： learning systems

来源：评论

学校读者我要写书评

暂无评论

Heteroscedastic Gaussian process regression 05

Heteroscedastic Gaussian process regression

引用

ICML 2005: 22nd International Conference on machine learning

作者： Le, Quoc V. Smola, Alex J. Canu, Stéphane RSISE Australian National University ACT 0200 Australia Statistical Machine Learning Program National ICT Australia ACT 0200 Australia PSI - FRE CNRS 2645 INSA de Rouen France

ISBN: (纸本)1595931805

This paper presents an algorithm to estimate simultaneously both mean and variance of a non parametric regression problem. The key point is that we are able to estimate variance locally unlike standard Gaussian Process regression or SVMs. This means that our estimator adapts to the local noise. The problem is cast in the setting of maximum a posteriori estimation in exponential families. Unlike previous work, we obtain a convex optimization problem which can be solved via Newton's method.

关键词： learning systems

来源：评论

学校读者我要写书评

暂无评论

Large-scale multiclass transduction 05

Large-scale multiclass transduction

引用

Proceedings of the 19th International Conference on Neural Information Processing Systems

作者： Thomas Gärtner Quoc V. Le Simon Burton Alex J. Smola Vishy Vishwanathan Fraunhofer Sankt Augustin Statistical Machine Learning Program NICTA and ANU Canberra ACT

We present a method for performing transductive inference on very large datasets. Our algorithm is based on multiclass Gaussian processes and is effective whenever the multiplication of the kernel matrix or its inverse with a vector can be computed sufficiently fast. This holds, for instance, for certain graph and string kernels. Transduction is achieved by varia-tional inference over the unlabeled data subject to a balancing constraint.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A second order cone programming formulation for classifying missing data

A second order cone programming formulation for classifying ...

引用

18th Annual Conference on Neural Information Processing Systems, NIPS 2004

作者： Bhattacharyya, Chiranjib Pannagadatta, K.S. Smola, Alexander J. Department of Computer Science and Automation Indian Institute of Science Bangalore 560 012 India Department of Electrical Engineering Indian Institute of Science Bangalore 560 012 India Machine Learning Program National ICT Australia and ANU Canberra ACT 0200 Australia

ISBN: (纸本)0262195348

We propose a convex optimization based strategy to deal with uncertainty in the observations of a classification problem. We assume that instead of a sample (xi;yi) a distribution over (xi;y i) is specified. In particular, we derive a robust formulation when the distribution is given by a normal distribution. It leads to Second Order Cone programming formulation. Our method is applied to the problem of missing data, where it outperforms direct imputation.

关键词： Second-order cone programming

来源：评论

学校读者我要写书评

暂无评论

Step size-adapted online support vector learning

Step size-adapted online support vector learning

引用

International Symposium on Signal Processing and Its Applications (ISSPA)

作者： A. Karatzoglou S.V.N. Vishwanathan N.N. Schraudolph A.J. Smola Department of Statistics Technische Universität Wien Austria RSISE Statistical Machine Learning Program National ICT Australia Australian National University Canberra Australia

来源：评论

学校读者我要写书评

暂无评论

Binet-Cauchy kernels 04

Binet-Cauchy kernels

引用

Proceedings of the 18th International Conference on Neural Information Processing Systems

作者： S. V. N. Vishwanathan Alexander J. Smola National ICT Australia Machine Learning Program Canberra ACT Australia

We propose a family of kernels based on the Binet-Cauchy theorem and its extension to Fredholm operators. This includes as special cases all currently known kernels derived from the behavioral framework, diffusion processes, marginalized kernels, kernels on graphs, and the kernels on sets arising from the subspace angle approach. Many of these kernels can be seen as the extrema of a new continuum of kernel functions, which leads to numerous new special cases. As an application, we apply the new class of kernels to the problem of clustering of video sequences with encouraging results.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Second order Cone programming formulation for classifying missing data 04

A Second order Cone Programming formulation for classifying ...

引用

Proceedings of the 17th International Conference on Neural Information Processing Systems

作者： Chiranjib Bhattacharyya K. S. Pannagadatta Alexander J. Smola Department of Computer Science and Automation Indian Institute of Science Bangalore India Department of Electrical Engineering Indian Institute of Science Bangalore India Machine Learning Program National ICT Australia and ANU Canberra ACT Australia

We propose a convex optimization based strategy to deal with uncertainty in the observations of a classification problem. We assume that instead of a sample (xi, yi) a distribution over (xi, yi) is specified. In particular, we derive a robust formulation when the distribution is given by a normal distribution. It leads to Second Order Cone programming formulation. Our method is applied to the problem of missing data, where it outperforms direct imputation.

关键词：

来源：评论

学校读者我要写书评

暂无评论

SimpleSVM

SimpleSVM

引用

Proceedings, Twentieth International Conference on machine learning

作者： Vishwanathan, S.V.N. Smola, Alexander J. Murty, M. Narasimha Machine Learning Program National ICT for Australia Canberra ACT 0200 Australia Machine Learning Group RSISE Australian National University Canberra ACT 0200 Australia Dept. of Comp. Sci. Indian Institute of Science Bangalore 560012 India

ISBN: (纸本)1577351894

We present a fast iterative support vector training algorithm for a large variety of different formulations. It works by incrementally changing a candidate support vector set using a greedy approach, until the supporting hyperplane is found within a finite number of iterations. It is derived from a simple active set method which sweeps through the set of Lagrange multipliers and keeps optimality in the unconstrained variables, while discarding large amounts of bound-constrained variables. The hard-margin version can be viewed as a simple (yet computationally crucial) modification of the incremental SVM training algorithms of Cauwenberghs and Poggio. Experimental results for various settings are reported. In all cases our algorithm is considerably faster than competing methods such as Sequential Minimal Optimization or the Nearest Point Algorithm.

关键词： Computer science

来源：评论

学校读者我要写书评

暂无评论

Empirical performance comparison of selective and constructive induction (Reprinted from Proceedings of the International Joint Conferences on Artificial Intelligence)

引用

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE 1996年第6期9卷 627-637页

作者： Szczepanik, W Arciszewski, T Wnek, J GEORGE MASON UNIV CIVIL ENVIRONM & INFRASTRUCT PROGRAMSCH INFORMAT TECHNOL & ENGNFAIRFAXVA 22030 GEORGE MASON UNIV MACHINE LEARNING & INFERENCE LABFAIRFAXVA 22030

The paper provides the results of a performance comparison study of Two symbolic learning programs, both based on the AQ15c learning algorithm. The first program uses a single representation space, while the second one utilizes constructive induction, which changes the representation space. The performance of the compared systems was analyzed using three empirical error rates, including the overall, commission and omission error rates. These were determined by applying the hold-out, 10-fold, and leave-one-out sampling methods. Both systems' performance was calculated for individual stages in a multi-stage knowledge-acquisition process. learning curves and their envelopes were prepared. The study was conducted using a set of 384 optimal designs of wind bracing in steel skeleton structures of tall buildings. The research methodology and the two learning systems used in the experiments are described, all numerical results are provided, and the conclusions of the research are given. Copyright (C) 1996 IJCAI Inc.

关键词： structural engineering learning design rules selective and constructive induction performance comparison of learning systems empirical error rates learning curves and envelopes

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：