版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Univ Minnesota Div Biostat MMC 303420 Delaware St SE Minneapolis MN 55455 USA Univ Washington Dept Biostat Seattle WA 98195 USA Univ Washington Dept Stat Seattle WA 98195 USA
出 版 物:《STATISTICS IN MEDICINE》 (医学统计学)
年 卷 期:2019年第38卷第4期
页 面:583-600页
核心收录:
学科分类:0710[理学-生物学] 1004[医学-公共卫生与预防医学(可授医学、理学学位)] 1001[医学-基础医学(可授医学、理学学位)] 0714[理学-统计学(可授理学、经济学学位)] 10[医学]
基 金:National Institutes of Health [DP5OD009145] National Science Foundation [DMS-1252624]
主 题:additive model data-adaptive feature selection high-dimensional nonparametric regression sparse model
摘 要:In this paper, we consider fitting a flexible and interpretable additive regression model in a data-rich setting. We wish to avoid pre-specifying the functional form of the conditional association between each covariate and the response, while still retaining interpretability of the fitted functions. A number of recent proposals in the literature for nonparametric additive modeling are data adaptive, in the sense that they can adjust the level of flexibility in the functional fits to the data at hand. For instance, the sparse additive model makes it possible to adaptively determine which features should be included in the fitted model, the sparse partially linear additive model allows each feature in the fitted model to take either a linear or a nonlinear functional form, and the recent fused lasso additive model and additive trend filtering proposals allow the knots in each nonlinear function fit to be selected from the data. In this paper, we combine the strengths of each of these recent proposals into a single proposal that uses the data to determine which features to include in the model, whether to model each feature linearly or nonlinearly, and what form to use for the nonlinear functions. We establish connections between our approach and recent proposals from the literature, and we demonstrate its strengths in a simulation study.