检索结果-内蒙古大学图书馆

A hierarchical prior for generalized linear models based on predictions for the mean response

BIOSTATISTICS 2022年第4期23卷 1165-1181页

作者： Alt, Ethan M. Psioda, Matthew A. Ibrahim, Joseph G. Brigham & Womens Hosp Div Pharmacoepidemiol & Pharmacoecon 1620 Tremt StSuite 3030 Boston MA 02120 USA Harvard Med Sch 1620 Tremt StSuite 3030 Boston MA 02120 USA Univ N Carolina Dept Biostat 135 Dauer Dr Chapel Hill NC 27599 USA

There has been increased interest in using prior information in statistical analyses. For example, in rare diseases, it can be difficult to establish treatment efficacy based solely on data from a prospective study due to low sample sizes. To overcome this issue, an informative prior to the treatment effect may be elicited. We develop a novel extension of the conjugate prior of that enables practitioners to elicit a prior prediction for the mean response for generalized linear models, treating the prediction as random. We refer to the hierarchical prior as the hierarchical prediction prior (HPP). For independent and identically distributed settings and the normal linear model, we derive cases for which the hyperprior is a conjugate prior. We also develop an extension of the HPP in situations where summary statistics from a previous study are available. The HPP allows for discounting based on the quality of individual level predictions, and simulation results suggest that, compared to the conjugate prior and the power prior, the HPP efficiency gains (e.g., lower mean squared error) where predictions are incompatible with the data. An efficient Monte Carlo Markov chain algorithm is developed. Applications illustrate that inferences under the HPP are more robust to prior-data conflict compared to selected nonhierarchical priors.

关键词： Bayesian inference generalized linear models Hierarchical prior Hyperprior

来源：评论

学校读者我要写书评

暂无评论

Regularized generalized linear models to Disclose Host-Microbiome Associations in Colorectal Cancer 23

Regularized Generalized Linear Models to Disclose Host-Micro...

引用

6th International Conference on Mathematics and Statistics, ICoMS 2023

作者： Ibrahimi, Eliana Norouzirad, Mina Meto, Melisa Lopes, Marta B. Department of Biology University of Tirana Albania Portugal Portugal

ISBN: (纸本)9798400700187

Recent studies have shown that gut microbiome is associated with colorectal cancer (CRC) progression and anti-cancer therapy efficacy. This study aims to optimize the ridge, elastic net, and lasso regularized generalized linear models (GLM), widely used for supervised machine learning, for multiclass classification tasks (healthy/adenoma/carcinoma). The models are applied to a benchmark gut microbiome dataset using raw and transformed data. A cross-validation procedure is used to select an optimal value for the shrinkage parameter, λ. The results show a higher accuracy of the ridge and elastic net models compared to the lasso model. We confirm known associations of several microbiome genera with CRC and adenoma. These findings are expected to contribute to the definition of CRC-microbiome signatures to be further validated in microbiome-related therapy studies. © 2023 ACM.

关键词： Colorectal Cancer Elastic net generalized linear models Gut Microbiome Lasso Ridge

来源：评论

学校读者我要写书评

暂无评论

Empirical Likelihood in generalized linear models with Working Covariance Matrix

引用

Acta Mathematicae Applicatae Sinica 2022年第1期38卷 87-97页

作者： Xiu-qing ZHOU Qi-bing GAO Chun-hua ZHU Xiu-li DU Liu-liu MAO School of Mathematical Science Nanjing Normal UniversityNanjing 210097China School of Statistics and Mathematics Nanjing Audit UniversityNanjing 211815China

Empirical likelihood in generalized linear models with multivariate responses and working covariance matrix is *** the weakest assumption on eigenvalues of Fisher’s information matrix and some other regular conditions,we prove that the non-parametric Wilk’s property still holds,that is,the empirical log-likelihood ratio at the true parameter values converges to the standard chi-square *** simulations are given to verify our theoretical result.

关键词： generalized linear models empirical likelihood multivariate response working covariance matrix

来源：评论

学校读者我要写书评

暂无评论

Incorporating spatial structure into inclusion probabilities for Bayesian variable selection in generalized linear models with the spike-and-slab elastic net

引用

JOURNAL OF STATISTICAL PLANNING AND INFERENCE 2022年 217卷 141-152页

作者： Leach, Justin M. Aban, Inmaculada Yi, Nengjun Univ Alabama Birmingham Dept Biostat Sch Publ Hlth 1665 Univ Blvd Birmingham AL 35233 USA

Spike-and-slab priors model predictors as arising from a mixture of distributions: those that should (slab) or should not (spike) remain in the model. The spike-and-slab lasso (SSL) is a mixture of double exponentials, extending the single lasso penalty by imposing different penalties on parameters based on their inclusion probabilities. The SSL was extended to generalized linear models (GLM) for application in genetics/genomics, and can handle many highly correlated predictors of a scalar outcome, but does not incorporate these relationships into variable selection. When images/spatial data are used to model a scalar outcome, relevant parameters tend to cluster spatially, and model performance may benefit from incorporating spatial structure into variable selection. We propose to incorporate spatial information by assigning intrinsic autoregressive priors to the logit prior probabilities of inclusion, which results in more similar shrinkage penalties among spatially adjacent parameters. Using MCMC to fit Bayesian models can be computationally prohibitive for large-scale data, but we fit the model by adapting a computationally efficient coordinate-descent-based EM algorithm. A simulation study and an application to Alzheimer's Disease imaging data show that incorporating spatial information can improve model fitness. (C) 2021 Elsevier B.V. All rights reserved.

关键词： Spike-and-slab Bayesian variable selection Penalized likelihood generalized linear models Elastic net Alzheimer's disease

来源：评论

学校读者我要写书评

暂无评论

Multivariate generalized linear models for Twin and Family Data

引用

BEHAVIOR GENETICS 2022年第2期52卷 123-140页

作者： Bonat, Wagner Hugo Hjelmborg, Jacob V. B. Univ Fed Parana Dept Stat Curitiba Parana Brazil Univ Southern Denmark Dept Epidemiol & Biostat Odense Denmark

Multivariate twin and family studies are one of the most important tools to assess diseases inheritance as well as to study their genetic and environment interrelationship. The multivariate analysis of twin and family data is in general based on structural equation modelling or linear mixed models that essentially decomposes sources of covariation as originally suggested by Fisher. In this paper, we propose a flexible and unified statistical modelling framework for analysing multivariate Gaussian and non-Gaussian twin and family data. The non-normality is taken into account by actually modelling the mean and variance relationship, while the covariance structure is modelled by means of a linear covariance model including the option to model the dispersion components as functions of known covariates in a regression model fashion. The marginal specification of our models allows us to extend classic models and biometric indices such as the bivariate heritability, genetic, environmental and phenotypic correlations to non-Gaussian data. We illustrate the proposed models through simulation studies and six data analyses and provide computational implementation in R through the package mglm4twin.

关键词： Estimating functions generalized linear models Multivariate regression Twin and family data

来源：评论

学校读者我要写书评

暂无评论

The Asymptotic Properties of Scad Penalized generalized linear models with Adaptive Designs

引用

Journal of Systems Science & Complexity 2021年第2期34卷 759-773页

作者： GAO Qibing ZHU Chunhua DU Xiuli ZHOU Xingcai YIN Dingxin School of Mathematics Science Nanjing Normal UniversityNanjing 210023China School of Statistics and Mathematics Nanjing Audit UniversityNanjing 211815China

This paper discusses the asymptotic properties of the SCAD(smoothing clipped absolute deviation)penalized quasi-likelihood estimator for generalized linear models with adaptive designs,which extend the related results for independent observations to dependent *** certain conditions,the authors proved that the SCAD penalized method correctly selects covariates with nonzero coefficients with probability converging to one,and the penalized quasi-likelihood estimators of non-zero coefficients have the same asymptotic distribution they would have if the zero coefficients were known in *** is,the SCAD estimator has consistency and oracle *** last,the results are illustrated by some simulations.

关键词： Adaptive designs generalized linear models oracle properties SCAD penalty function

来源：评论

学校读者我要写书评

暂无评论

Penalized empirical likelihood for generalized linear models with longitudinal data

引用

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION 2021年第2期50卷 608-623页

作者： Tan, Xiaoyan Yan, Li Shaanxi Normal Univ Sch Math & Informat Sci Xian 710119 Peoples R China

Penalized empirical likelihood for generalized linear models with longitudinal data is considered. It is shown that the penalized empirical likelihood estimators have the oracle property. Also, we conclude that the asymptotic distribution of penalized empirical likelihood ratio test statistic is a chi-square distribution. The finite sample performance of the proposed method is evaluated by some simulations and a real data example.

关键词： generalized linear models Hypothesis testing Longitudinal data Penalized empirical likelihood Variable selection

来源：评论

学校读者我要写书评

暂无评论

Detecting Significant Differences Between Information Retrieval Systems via generalized linear models 22

Detecting Significant Differences Between Information Retrie...

引用

31st ACM International Conference on Information and Knowledge Management (CIKM)

作者： Faggioli, Guglielmo Ferro, Nicola Fuhr, Norbert Univ Padua Padua Italy Univ Duisburg Essen Essen Germany

ISBN: (纸本)9781450392365

Being able to compare Information Retrieval (IR) systems correctly is pivotal to improving their quality. Among the most popular tools for statistical significance testing, we list t-test and ANOVA that belong to the linear models family. Therefore, given the relevance of linear models for IR evaluation, a great effort has been devoted to studying how to improve them to better compare IR systems. linear models rely on assumptions that IR experimental observations rarely meet, e.g. about the normality of the data or the linearity itself. Even though linear models are, in general, resilient to violations of their assumptions, departing from them might reduce the effectiveness of the tests. Hence, we investigate the use of the generalized linear Model (GLM) framework, a generalization of the traditional linear modelling that relaxes assumptions about the distribution and the shape of the models. To the best of our knowledge, there has been little or no investigation on the use of GLMs for comparing IR system performance. We discuss how GLMs work and how they can be applied in the context of IR evaluation. In particular, we focus on the link function used to build GLMs, which allows for the model to have non-linear shapes. We conduct a thorough experimentation using two TREC collections and several evaluation measures. Overall, we show how the log and logit links are able to identify more and more consistent significant differences (up to 25% more with 50 topics) than the identity link used today and with a comparable, or slightly better, risk of publication bias.

关键词： Information Retrieval Evaluation generalized linear models

来源：评论

学校读者我要写书评

暂无评论

Model averaging for generalized linear models in fragmentary data prediction

引用

Statistical Theory and Related Fields 2022年第4期6卷 344-352页

作者： Chaoxia Yuan Yang Wu Fang Fang KLATASDS-MOE School of StatisticsEast China Normal UniversityShanghaiPeople's Republic of China

Fragmentary data is becoming more and more popular in many areas which brings big chal-lenges to researchers and data *** existing methods dealing with fragmentary data consider a continuous response while in many applications the response variable is *** this paper,we propose a model averaging method for generalized linear models in fragmentary data *** candidate models are fitted based on different combinations of covariate availability and sample *** optimal weight is selected by minimizing the Kullback-Leibler loss in the completed cases and its asymptotic optimality is *** evidences from a simulation study and a real data analysis about Alzheimer disease are presented.

关键词： Asymptotlc optimallty fragmentary data generalized linear models model averagIng

来源：评论

学校读者我要写书评

暂无评论

Truthful and privacy-preserving generalized linear models

引用

INFORMATION AND COMPUTATION 2024年 301卷

作者： Qiu, Yuan Liu, Jinyan Wang, Di Georgia Inst Technol Coll Comp Atlanta GA USA Beijing Inst Technol Sch Comp Sci & Technol Beijing Peoples R China Provable Responsible AI & Data Analyt Lab Thuwal Saudi Arabia SDAIA KAUST Ctr Excellence Data Sci & Artificial Thuwal Saudi Arabia King Abdullah Univ Sci & Technol Div CEMSE Thuwal Saudi Arabia

This paper explores estimating generalized linear models (GLMs) when agents are strategic and privacy-conscious. We aim to design mechanisms that encourage truthful reporting, protect privacy, and ensure outputs are close to the true parameters. Initially, we address models with sub-Gaussian covariates and heavy-tailed responses with finite fourth moments, proposing a novel private, closed-form estimator. Our mechanism features: (1) o(1)-joint differential privacy with high probability;(2) o(1/n)-approximate Bayes Nash equilibrium for (1 - o(1))-fraction of agents;(3) o(1) error in parameter estimation;(4) individual rationality for (1 -o(1)) of agents;(5) o(1) payment budget. We then extend our approach to linear regression with heavy-tailed data, using an l(4)-norm shrinkage operator to propose a similar estimator and payment scheme. (c) 2024 Elsevier Inc. All rights are reserved, including those for text and data mining, AI training, and similar technologies.

关键词： generalized linear models Bayesian game Differential privacy Truthful mechanism design

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：