检索结果-内蒙古大学图书馆

The Journal of machine learning Research 2017年第1期18卷

作者： Gregory Darnell Stoyan Georgiev Sayan Mukherjee Barbara E. Engelhardt Lewis-Sigler Institute Princeton University Princeton NJ Google Palo Alto CA Departments of Statistical Science Mathematics and Computer Science Duke University Durham NC Department of Computer Science Center for Statistics and Machine Learning Princeton University Princeton NJ

The scalability of statistical estimators is of increasing importance in modern applications. One approach to implementing scalable algorithms is to compress data into a low dimensional latent space using dimension reduction methods. In this paper, we develop an approach for dimension reduction that exploits the assumption of low rank structure in high dimensional data to gain both computational and statistical advantages. We adapt recent randomized low-rank approximation algorithms to provide an efficient solution to principal component analysis (PCA), and we use this efficient solver to improve estimation in large-scale linear mixed models (LMM) for association mapping in statistical genomics. A key observation in this paper is that randomization serves a dual role, improving both computational and statistical performance by implicitly regularizing the covariance matrix estimate of the random effect in an LMM. These statistical and computational advantages are highlighted in our experiments on simulated data and large-scale genomic studies.

关键词： Krylov subspace methods dimension reduction generalized eigendecompositon genomics linear mixed models low-rank random projections randomized algorithms supervised

来源：评论

学校读者我要写书评

暂无评论

Author Correction: π-HuB: the proteomic navigator of the human body

引用

Nature 2025年第8046期637卷 E22页

作者： Fuchu He Ruedi Aebersold Mark S Baker Xiuwu Bian Xiaochen Bo Daniel W Chan Cheng Chang Luonan Chen Xiangmei Chen Yu-Ju Chen Heping Cheng Ben C Collins Fernando Corrales Jürgen Cox Weinan E Jennifer E Van Eyk Jia Fan Pouya Faridi Daniel Figeys George Fu Gao Wen Gao Zu-Hua Gao Keisuke Goda Wilson Wen Bin Goh Dongfeng Gu Changjiang Guo Tiannan Guo Yuezhong He Albert J R Heck Henning Hermjakob Tony Hunter Narayanan Gopalakrishna Iyer Ying Jiang Connie R Jimenez Lokesh Joshi Neil L Kelleher Ming Li Yang Li Qingsong Lin Cui Hua Liu Fan Liu Guang-Hui Liu Yansheng Liu Zhihua Liu Teck Yew Low Ben Lu Matthias Mann Anming Meng Robert L Moritz Edouard Nice Guang Ning Gilbert S Omenn Christopher M Overall Giuseppe Palmisano Yaojin Peng Charles Pineau Terence Chuen Wai Poon Anthony W Purcell Jie Qiao Roger R Reddel Phillip J Robinson Paola Roncada Chris Sander Jiahao Sha Erwei Song Sanjeeva Srivastava Aihua Sun Siu Kwan Sze Chao Tang Liujun Tang Ruijun Tian Juan Antonio Vizcaíno Chanjuan Wang Chen Wang Xiaowen Wang Xinxing Wang Yan Wang Tobias Weiss Mathias Wilhelm Robert Winkler Bernd Wollscheid Limsoon Wong Linhai Xie Wei Xie Tao Xu Tianhao Xu Liying Yan Jing Yang Xiao Yang John Yates Tao Yun Qiwei Zhai Bing Zhang Hui Zhang Lihua Zhang Lingqiang Zhang Pingwen Zhang Yukui Zhang Yu Zi Zheng Qing Zhong Yunping Zhu State Key Laboratory of Medical Proteomics Beijing Proteome Research Center National Center for Protein Sciences (Beijing) Beijing Institute of Lifeomics Beijing China. hefc@***. International Academy of Phronesis Medicine (Guangdong) Guangdong China. hefc@***. Department of Biology Institute of Molecular Systems Biology ETH Zurich Zurich Switzerland. aebersold@imsb.biol.ethz.ch. Macquarie Medical School Macquarie University Sydney New South Wales Australia. Institute of Pathology and Southwest Cancer Center Southwest Hospital Third Military Medical University (Army Medical University) and Key Laboratory of Tumor Immunopathology Ministry of Education of China Chongqing China. Institute of Health Service and Transfusion Medicine Beijing China. Department of Pathology and The Sidney Kimmel Comprehensive Cancer Center Johns Hopkins University Baltimore MD USA. State Key Laboratory of Medical Proteomics Beijing Proteome Research Center National Center for Protein Sciences (Beijing) Beijing Institute of Lifeomics Beijing China. Key Laboratory of Systems Biology Center for Excellence in Molecular Cell Science Shanghai Institute of Biochemistry and Cell Biology Chinese Academy of Sciences Shanghai China. Department of Nephrology First Medical Center of Chinese PLA General Hospital Nephrology Institute of the Chinese People's Liberation Army State Key Laboratory of Kidney Diseases National Clinical Research Center for Kidney Diseases Beijing Key Laboratory of Kidney Disease Research Beijing China. Institute of Chemistry Academia Sinica Taipei China. National Biomedical Imaging Center State Key Laboratory of Membrane Biology Institute of Molecular Medicine Peking-Tsinghua Center for Life Sciences College of Future Technology Peking University Beijing China. School of Biological Sciences Queen's University of Belfast Belfast UK. Functional Proteomics Laboratory Centro Nacional de Biotecnología-CSIC Madrid Spain. Computational Systems Biochemistry Research Group Ma

来源：评论

学校读者我要写书评

暂无评论

Bayesian group factor analysis with structured sparsity

The Journal of Machine Learning Research

引用

The Journal of machine learning Research 2016年第1期17卷

作者： Kevin Murphy Bernhard Schölkopf Shiwen Zhao Chuan Gao Sayan Mukherjee Barbara E. Engelhardt Google MPI for Intelligent Systems Computational Biology and Bioinformatics Program Department of Statistical Science Duke University Durham NC Department of Statistical Science Duke University Durham NC Departments of Statistical Science Computer Science Mathematics Duke University Durham NC Department of Computer Science Center for Statistics and Machine Learning Princeton University Princeton NJ

Latent factor models are the canonical statistical tool for exploratory analyses of low-dimensional linear structure for a matrix of p features across n samples. We develop a structured Bayesian group factor analysis model that extends the factor model to multiple coupled observation matrices; in the case of two observations, this reduces to a Bayesian model of canonical correlation analysis. Here, we carefully define a structured Bayesian prior that encourages both element-wise and column-wise shrinkage and leads to desirable behavior on high-dimensional data. In particular, our model puts a structured prior on the joint factor loading matrix, regularizing at three levels, which enables element-wise sparsity and unsupervised recovery of latent factors corresponding to structured variance across arbitrary subsets of the observations. In addition, our structured prior allows for both dense and sparse latent factors so that covariation among either all features or only a subset of features can be recovered. We use fast parameter-expanded expectation-maximization for parameter estimation in this model. We validate our method on simulated data with substantial structure. We show results of our method applied to three high-dimensional data sets, comparing results against a number of state-of-the-art approaches. These results illustrate useful properties of our model, including i) recovering sparse signal in the presence of dense effects; ii) the ability to scale naturally to large numbers of observations; iii) flexible observation- and factor-specific regularization to recover factors with a wide variety of sparsity levels and percentage of variance explained; and iv) tractable inference that scales to modern genomic and text data sizes.

关键词： bayesian structured sparsity canonical correlation analysis mixture models parameter expansion sparse and low-rank matrix decomposition sparse priors

来源：评论

学校读者我要写书评

暂无评论

Smoothing multivariate performance measures

The Journal of Machine Learning Research

引用

The Journal of machine learning Research 2012年第1期13卷

作者： Xinhua Zhang Ankan Saha S. V. N. Vishwanathan Machine Learning Group NICTA Canberra Australia and Department of Computing Science University of Alberta Alberta Innovates Center for Machine Learning Edmonton Alberta Canada Department of Computer Science University of Chicago Chicago IL Departments of Statistics and Computer Science Purdue University West Lafayette IN

Optimizing multivariate performance measure is an important task in machine learning. Joachims (2005) introduced a Support Vector Method whose underlying optimization problem is commonly solved by cutting plane methods (CPMs) such as SVM-Perf and BMRM. It can be shown that CPMs converge to an ε accurate solution in O(1/λε) iterations, where λ is the trade-off parameter between the regularizer and the loss function. Motivated by the impressive convergence rate of CPM on a number of practical problems, it was conjectured that these rates can be further improved. We disprove this conjecture in this paper by constructing counter examples. However, surprisingly, we further discover that these problems are not inherently hard, and we develop a novel smoothing strategy, which in conjunction with Nesterov's accelerated gradient method, can find an ε accurate solution in O* (min{1/ε, 1/√λε}) iterations. Computationally, our smoothing technique is also particularly advantageous for optimizing multivariate performance scores such as precision/recall break-even point and ROCArea; the cost per iteration remains the same as that of CPMs. Empirical evaluation on some of the largest publicly available data sets shows that our method converges significantly faster than CPMs without sacrificing generalization ability.

关键词： max-margin methods multivariate performance measures non-smooth optimization smoothing support vector machines

来源：评论

学校读者我要写书评

暂无评论

AI Theory and Practice: A Discussion on Hard Challenges and Opportunities Ahead

引用

AI MAGAZINE 2010年第3期31卷 103-114页

作者： Horvitz, Eric MICROSOFT RESEARCH INSTITUTE FOR ADVANCED COMPUTER STUDIES THE UNIVERSITY OF MARYLAND COLLEGE PARK THE MACHINE LEARNING AND COMPUTER SCIENCE DEPARTMENTS CARNEGIE MELLON UNIVERSITY

A special track on directions in artificial intelligence at a Microsoft Research Faculty Summit included a panel discussion on key challenges and opportunities ahead in AI theory and practice. This article captures th... 详细信息

关键词： Artificial intelligence Hate Research & development--R&D Man machine interaction Studies

来源：评论

学校读者我要写书评

暂无评论

A Method for Reasoning with Structured and Continuous Attributes in the INLEN-2 Multistrategy Knowledge Discovery System 2

A Method for Reasoning with Structured and Continuous Attrib...

引用

2nd International Conference on Knowledge Discovery and Data Mining, KDD 1996

作者： Kaufman, Kenneth A. Michalski, Ryszard S. Machine Learning and Inference Laboratory George Mason University FairfaxVA22030 United States Gmu Departments of Computer Science and Systems Engineering Institute of Computer Science Polish Academy of Sciences Poland

ISBN: (纸本)1577350049

Structured attributes have domains (value sets) that are partially ordered sets, typically *** attributes allow knowledge discovery programs to incorporate background knowledge about hierarchical relationships among attribute *** generalization rules for structured attributes have been developed that take into consideration the type of nodes in the domain hierarchy (anchor or non-anchor) and the type of decision niles to be generated (characteristic, discriminant or minimum complexity).These generalization rules enhance the ability of knowledge discovery system INLEN-2 to exploit the semantic content of the domain knowledge in the process of generating *** the dependent attribute (e.g., a decision attribute) is structured, the system generates a system of hierarchically organized rules representing relationships between the values of this attribute and independent *** a situation often occurs in practice when the decision to be assigned to a situation can be at different levels of abstraction (e.g., this is a liver disease, or this is a liver cancer).Continuous attributes (e.g., physical measurements) are quantized into a hierarchy of values (ranges of values arranged into different levels).These methods are illustrated by an example concerning the discovery of patterns in world economics and demographics. © 1996 AAAI (***). All Rights Reserved.

关键词： Set theory

来源：评论

学校读者我要写书评

暂无评论

The AQ17-DCI system for data-driven constructive induction and its application to the analysis of world economics 9th

引用

9th International Symposium on Methodologies for Intelligent Systems, ISMIS 1996

作者： Bloedoru, Eric Michalski, Ryszard S. Machine Learning and Inference Laboratory George Mason University FairfaxVA United States GMU Departments of Computex Science and Systems Engineering the Institute of Computer Science at the Polish Academy of Sciences Warsaw Poland

ISBN: (纸本)9783540612865

Constructive induction divides the problem of learning an inductive hypothesis into two intertwined searches: one-for the "best" representation space, and two-for the "best" hypothesis in that space. In datadriven constructive induction (DCI), a learning system searches for a better representation space by analyzing the input examples (data). The presented datadriven constructive induction method combines an AQ-type learning algorithm with two classes of representation space improvement operators: constructors, and destructors. The implemented system, AQ17-DCI, has been experimentally applied to a GNP prediction problem using a World Bank database. The results show that decision rules learned by AQ17-DCI outperformed the rules learned in the original representation space both in predictive accuracy and rule simplicity. © Springer-Verlag Berlin Heidelberg 1996.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：