版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Georgia Inst Technol Sch Ind & Syst Engn 755 Ferst Dr Atlanta GA 30332 USA
出 版 物:《WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS》 (威立跨学科评论:统计计算)
年 卷 期:2019年第11卷第1期
页 面:e1451-e1451页
核心收录:
学科分类:0202[经济学-应用经济学] 02[经济学] 020208[经济学-统计学] 07[理学] 0714[理学-统计学(可授理学、经济学学位)]
基 金:National Science Foundation [DMS-1613152, CCF-1740776] Transdisciplinary Research Institute for Advancing Data Science (TRIAD)
主 题:aggregated inference averaging estimator distributed statistical inference M-estimation one-step estimator
摘 要:Aggregated inference on distributed data becomes more and more important due to the larger size of data collected in different industries. Modeling and inference are needed in the case where data cannot be obtained at a central location;aggregated statistical inference is a major tool to solve the aforementioned problems. In the literature, problems under the setting of regression model (more generally, M-estimator) are extensively studied. There are at least two popular techniques for distributed estimation: (a) averaging estimators from local locations and (b) the one-step approach, which combines the simple averaging estimator with a classical Newton s method (using the local Hessian matrices) to generate a one-step estimator. It is proved that under certain assumptions, the above constructed estimators enjoy the same asymptotic properties as the centralized estimator, which is obtained as if all data were available at a central location. We review the aforementioned two major estimations. It can be seen that, in Big-Data problems, dividing the data to multiple machines and then using the aggregation technique to solve the estimation problem in parallel can speed up the computation with little compromise of the quality of the estimators. We discuss potential extensions to other models, such as support vector machine, principle component analysis, and so on. Numerical examples are omitted due to the space limitation;they can be easily found in the literature. This article is categorized under: Statistical Learning and Exploratory Methods of the Data Sciences Knowledge Discovery Statistical Learning and Exploratory Methods of the Data Sciences Modeling Methods Statistical Models Fitting Models Statistical and Graphical Methods of Data Analysis Modeling Methods and Algorithms