检索结果-内蒙古大学图书馆

25th International Conference on machine learning

作者： Quadrianto, Novi Smola, Alex J. Caetano, Tiberio S. Le, Quoc V. Statistical Machine Learning NICTA and RSISE Australian National University Computer Science Department Stanford University

ISBN: (纸本)9781605582054

Consider the following problem: given sets of unlabeled observations, each set with known label proportions, predict the labels of another set of observations, also with known label proportions. This problem appears in areas like e-commerce, spam filtering and improper content detection. We present consistent estimators which can reconstruct the correct labels with high probability in a uniform convergence sense. Experiments show that our method works well in practice. Copyright 2008 by the author(s)/owner(s).

关键词： learning systems

来源：评论

学校读者我要写书评

暂无评论

The true sample complexity of active learning

The true sample complexity of active learning

引用

21st Annual Conference on learning Theory, COLT 2008

作者： Balcan, Maria-Florina Hanneke, Steve Wortman, Jennifer Computer Science Department Carnegie Mellon University United States Machine Learning Department Carnegie Mellon University United States Computer and Information Science University of Pennsylvania United States

We describe and explore a new perspective on the sample complexity of active learning. In many situations where it was generally believed that active learning does not help, we show that active learning does help in the limit, often with exponential improvements in sample complexity. This contrasts with the traditional analysis of active learning problems such as non-homogeneous linear separators or depth-limited decision trees, in which Ω(1/∈) lower bounds are common. Such lower bounds should be interpreted carefully;indeed, we prove that it is always possible to learn an ∈-good classifier with a number of samples asymptotically smaller than this. These new insights arise from a subtle variation on the traditional definition of sample complexity, not previously recognized in the active learning literature.

关键词： Decision trees

来源：评论

学校读者我要写书评

暂无评论

A detecting peak's number technique for multimodal function optimization

引用

WSEAS Transactions on Information science and Applications 2008年第2期5卷 37-43页

作者： Hua, Qiang Wu, Bin Tian, Hao Machine Learning Center Faculty of Mathematics and Computer Science Hebei University Baoding 071002 China Department of Planning and Development Handan Iron and Steel Co. Ltd. Handan 056015 China

A Detecting Peak's Number (DPN) technique is proposed for multimodal optimization. In DPN technique, we want to know the peak's number of locally multimodal domain of every individual, firstly we use the idea of orthogonal intersection for getting the exploration direction in every locally multimodal domain, and then we attempt to detect peak's number in every one-dimension direction as the result of detecting of locally multimodal domain. At last we design an evolution algorithm (DPNA) based on the characters of DNP technique, which contain four characters: niching, variable population, variable radius and life time, and then give a series of experiment results which show the effectiveness of algorithm, as the DPNA is not only adapting to obtaining multiple optima or suboptima, but also effective for problem of ill-scaled and locally multimodal domain described in [11].

关键词： Optimization

来源：评论

学校读者我要写书评

暂无评论

Online Gaussian Mixture Model for concept modeling and discovery

Online Gaussian Mixture Model for concept modeling and disco...

引用

9th International Conference on Intelligent Technologies, Intech'08

作者： Fang, Chungheng Ralescu, Anca L. University of Cincinnati Department of Computer Science Machine Learning and Computational Intelligence Laboratory Cincinnati OH 45237-0030 United States

ISBN: (纸本)9789746152969

Concept discovery and modeling are fundamental problems in machine learning research. Real world concepts are usually high-dimensional and have complicated distributions along their dimensions. Gaussian Mixture Models(GMM) have proved useful in modeling such complicated distributions. We propose a data-driven concept modeling and discovery framework using GMM, with on-line updating mechanism for fast computation suitable for real world applications. Experiments show the efficacy and efficiency of the proposed algorithm.

关键词： Gaussian distribution

来源：评论

学校读者我要写书评

暂无评论

Temozolomide displays antimigratory effects in human glioblastoma cells mediated through neuregulin-1 down-regulation

引用

Surgical Neurology 2009年第1期71卷 134-134页

作者： F. Lefranc S. Spiegl-Kreinecker B. Haibe-Kains G. Bontempi C. Decaestecker W. Berger R. Kiss Department of Neurosurgery Erasme Academic Hospital Lab. Toxicology Inst. Pharmacy ULB Brussels Belgium Department of Neurosurgery Wagner Jauregg Hospital Linz MicroArray Unit Jules Bordet Institute Machine Learning Group Department of Computer Science ULB

来源：评论

学校读者我要写书评

暂无评论

FilterBoost: Regression and classification on large datasets

FilterBoost: Regression and classification on large datasets

引用

21st Annual Conference on Neural Information Processing Systems, NIPS 2007

作者： Bradley, Joseph K. Schapire, Robert E. Machine Learning Department Carnegie Mellon University Pittsburgh PA 15213 United States Department of Computer Science Princeton University Princeton NJ 08540 United States

ISBN: (纸本)160560352X

We study boosting in the filtering setting, where the booster draws examples from an oracle instead of using a fixed training set and so may train efficiently on very large datasets. Our algorithm, which is based on a logistic regression technique proposed by Collins, Schapire, & Singer, requires fewer assumptions to achieve bounds equivalent to or better than previous work. Moreover, we give the first proof that the algorithm of Collins et al. is a strong PAC learner, albeit within the filtering setting. Our proofs demonstrate the algorithm's strong theoretical properties for both classification and conditional probability estimation, and we validate these results through extensive experiments. Empirically, our algorithm proves more robust to noise and overfitting than batch boosters in conditional probability estimation and proves competitive in classification.

关键词： Classification (of information)

来源：评论

学校读者我要写书评

暂无评论

No-regret learning in convex games 08

No-regret learning in convex games

引用

25th International Conference on machine learning

作者： Gordon, Geoffrey J. Greenwald, Amy Marks, Casey Machine Learning Department Carnegie Mellon University Pittsburgh PA 15213 United States Department of Computer Science Brown University Providence RI 02912 United States

ISBN: (纸本)9781605582054

Quite a bit is known about minimizing different kinds of regret in experts problems, and how these regret types relate to types of equilibria in the multiagent setting of repeated matrix games. Much less is known about the possible kinds of regret in online convex programming problems (OCPs), or about equilibria in the analogous multiagent setting of repeated convex games. This gap is unfortunate, since convex games are much more expressive than matrix games, and since many important machine learning problems can be expressed as OCPs. In this paper, we work to close this gap: we analyze a spectrum of regret types which lie between external and swap regret, along with their corresponding equilibria, which lie between coarse correlated and correlated equilibrium. We also analyze algorithms for minimizing these regret types. As examples of our framework, we derive algorithms for learning correlated equilibria in polyhedral convex games and extensive-form correlated equilibria in extensive-form games. The former is exponentially more efficient than previous algorithms, and the latter is the first of its type. Copyright 2008 by the author(s)/owner(s).

关键词： learning systems

来源：评论

学校读者我要写书评

暂无评论

Discovering cyclic causal models by independent components analysis

Discovering cyclic causal models by independent components a...

引用

作者： Lacerda, Gustavo Spirtes, Peter Ramsey, Joseph Hoyer, Patrik O. Machine Learning Department School of Computer Science Carnegie Mellon University Pittsburgh PA 15213 United States Department of Philosophy Carnegie Mellon University Pittsburgh PA 15213 United States Dept. of Computer Science University of Helsinki Helsinki Finland

ISBN: (纸本)0974903949

We generalize Shimizu et al's (2006) ICA-based approach for discovering linear non-Gaussian acyclic (LiNGAM) Structural Equation Models (SEMs) from causally sufficient, continuous-valued observational data. By relaxing the assumption that the generating SEM's graph is acyclic, we solve the more general problem of linear non-Gaussian (LiNG) SEM discovery. LiNG discovery algorithms output the distribution equivalence class of SEMs which, in the large sample limit, represents the population distribution. We apply a LiNG discovery algorithm to simulated data. Finally, we give sufficient conditions under which only one of the SEMs in the output class is "stable".

关键词： Independent component analysis

来源：评论

学校读者我要写书评

暂无评论

Adaptive Feature Thresholding for off-line signature verification

Adaptive Feature Thresholding for off-line signature verific...

引用

International Conference on Image and Vision Computing New Zealand, IVCNZ

作者： Robert Larkins Michael Mayo Machine Learning Group Department of Computer Science University of Waikato New Zealand

This paper introduces Adaptive Feature Thresholding (AFT) which is a novel method of person-dependent off-line signature verification. AFT enhances how a simple image feature of a signature is converted to a binary feature vector by significantly improving its representation in relation to the training signatures. The similarity between signatures is then easily computed from their corresponding binary feature vectors. AFT was tested on the CEDAR and GPDS benchmark datasets, with classification using either a manual or an automatic variant. On the CEDAR dataset we achieved a classification accuracy of 92% for manual and 90% for automatic, while on the GPDS dataset we achieved over 87% and 85% respectively. For both datasets AFT is less complex and requires fewer images features than the existing state of the art methods, while achieving competitive results.

关键词： Handwriting recognition Image converters Discrete wavelet transforms Digital images Forgery machine learning computer science Automatic testing Benchmark testing Government

来源：评论

学校读者我要写书评

暂无评论

Joint latent topic models for text and citations

Joint latent topic models for text and citations

引用

14th ACM SIGKDD International Conference on Knowledge Discovery and data Mining, KDD 2008

作者： Nallapati, Ramesh M. Ahmed, Amr Xing, Eric P. Cohen, William W. Computer Science Department Stanford University 353 Serra Mall Stanford CA 94305 United States Machine Learning Department Carnegie Mellon University 5000 Forbes Avenue Pittsburgh PA 15213 United States

ISBN: (纸本)9781605581934

In this work, we address the problem of joint modeling of text and citations in the topic modeling framework. We present two different models called the Pairwise-Link-LDA and the Link-PLSA-LDA models. The Pairwise-Link-LDA model combines the ideas of LDA [4] and Mixed Membership Block Stochastic Models [1] and allows modeling arbitrary link structure. However, the model is computationally expensive, since it involves modeling the presence or absence of a citation (link) between every pair of documents. The second model solves this problem by assuming that the link structure is a bipartite graph. As the name indicates, Link-PLSA-LDA model combines the LDA and PLSA models into a single graphical model. Our experiments on a subset of Citeseer data show that both these models are able to predict unseen data better than the baseline model of Erosheva and Lafferty [8], by capturing the notion of topical similarity between the contents of the cited and citing documents. Our experiments on two different data sets on the link prediction task show that the Link-PLSA-LDA model performs the best on the citation prediction task, while also remaining highly scalable. In addition, we also present some interesting visualizations generated by each of the models. Copyright 2008 ACM.

关键词： Stochastic models

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：