检索结果-内蒙古大学图书馆

generalized linear model based on latent factors and supervised components

COMPUTATIONAL STATISTICS 2025年第3期40卷 1475-1516页

作者： Gibaud, Julien Bry, Xavier Trottier, Catherine Univ Montpellier CNRS IMAG Montpellier France Univ Paul Valery Montpellier 3 AMIS F-34000 Montpellier France

In a context of component-based multivariate modeling we propose to model the residual dependence of the responses. Each response of a response vector is assumed to depend, through a generalized linear model, on a set of explanatory variables. The vast majority of explanatory variables are partitioned into conceptually homogeneous variable groups, viewed as explanatory themes. variables in themes are supposed many and some of them are highly correlated or even collinear. Thus, generalized linear regression demands dimension reduction and regularization with respect to each theme. Besides them, we consider a small set of "additional" covariates not conceptually linked to the themes, and demanding no regularization. Supervised Component generalized linear Regression proposed to both regularize and reduce the dimension of the explanatory space by searching each theme for an appropriate number of orthogonal components, which both contribute to predict the responses and capture relevant structural information in themes. In this paper, we introduce random latent variables (a.k.a. factors) so as to model the covariance matrix of the linear predictors of the responses conditional on the components. To estimate the model, we present an algorithm combining supervised component-based model estimation with factor model estimation. This methodology is tested on simulated data and then applied to an agricultural ecology dataset.

关键词： EM algorithm Factor model generalized linear latent variable model Multivariate generalized linear model Supervised components

来源：评论

学校读者我要写书评

暂无评论

Mixed Deep Gaussian Mixture model: a clustering model for mixed datasets

引用

ADVANCES IN DATA ANALYSIS AND CLASSIFICATION 2022年第1期16卷 31-53页

作者： Fuchs, Robin Pommeret, Denys Viroli, Cinzia Aix Marseille Univ CNRS Cent Marseille I2MMIO Marseille France Univ Lyon UCBL ISFA LSAF EA2429 Lyon France Univ Bologna Dept Stat Sci Bologna Italy

Clustering mixed data presents numerous challenges inherent to the very heterogeneous nature of the variables. A clustering algorithm should be able, despite of this heterogeneity, to extract discriminant pieces of information from the variables in order to design groups. In this work we introduce a multilayer architecture model-based clustering method called Mixed Deep Gaussian Mixture model that can be viewed as an automatic way to merge the clustering performed separately on continuous and non-continuous data. This architecture is flexible and can be adapted to mixed as well as to continuous or non-continuous data. In this sense we generalize generalized linear latent variable models and Deep Gaussian Mixture models. We also design a new initialisation strategy and a data driven method that selects the best specification of the model and the optimal number of clusters for a given dataset. Besides, our model provides continuous low-dimensional representations of the data which can be a useful tool to visualize mixed datasets. Finally, we validate the performance of our approach comparing its results with state-of-the-art mixed data clustering models over several commonly used datasets.

关键词： Binary and count data Deep Gaussian Mixture model generalized linear latent variable model MCEM algorithm Ordinal and categorical data Two-heads architecture

来源：评论

学校读者我要写书评

暂无评论

Environmental controls on butterfly occurrence and species richness in Israel: The importance of temperature over rainfall

引用

ECOLOGY AND EVOLUTION 2021年第17期11卷 12035-12050页

作者： Comay, Orr Ben Yehuda, Oz Schwartz-Tzachor, Racheli Benyamini, Dubi Pe'er, Israel Ktalav, Inbar Pe'er, Guy UFZ Helmholtz Ctr Environm Res Dept Ecosyst Serv Permoserstr 15 D-04318 Leipzig Germany German Ctr Integrat Biodivers Res iDiv Leipzig Germany Tel Aviv Univ Sch Zool Tel Aviv Israel Tel Aviv Univ Steinhardt Museum Nat Hist Tel Aviv Israel Achva Acad Coll Arugot Israel Ramat Hanadiv Zikhron Yaakov Israel Israeli Lepidopterists Soc Bet Arye Israel BMS IL Webportal GlueCAD Biodivers IT Haifa Israel Univ Haifa Lab Archaeozool Dept Archaeol Haifa Israel

Butterflies are considered important indicators representing the state of biodiversity and key ecosystem functions, but their use as bioindicators requires a better understanding of how their observed response is linked to environmental factors. Moreover, better understanding how butterfly faunas vary with climate and land cover may be useful to estimate the potential impacts of various drivers, including climate change, botanical succession, grazing, and afforestation. It is particularly important to establish which species of butterflies are sensitive to each environmental driver. The study took place in Israel, including the West Bank and Golan Heights. To develop a robust and systematic approach for identifying how butterfly faunas vary with the environment, we analyzed the occurrence of 73 species and the abundance of 24 species from Israeli Butterfly Monitoring Scheme (BMS-IL) data. We used regional generalized additive models to quantify butterfly abundance, and generalized linear latent variable models and generalized linear models to quantify the impact of temperature, rainfall, soil type, and habitat on individual species and on the species community. Species richness was higher for cooler transects, and also for hilly and mountainous transects in the Mediterranean region (rendzina and Terra rossa soils) compared with the coastal plain (Hamra soil) and semiarid northern Jordan Vale (loessial sierozem soil). Species occurrence was better explained by temperature (negative correlation) than precipitation, while for abundance the opposite pattern was found. Soil type and habitat were insignificant drivers of occurrence and abundance. Butterfly faunas responded very strongly to temperature, even when accounting for other environmental factors. We expect that some butterfly species will disappear from marginal sites with global warming, and a large proportion will become rarer as the region becomes increasingly arid.

关键词： biogeography bioindicators butterflies citizen science community ecology generalized linear latent variable model

来源：评论

学校读者我要写书评

暂无评论

generalized linear latent models for multivariate longitudinal measurements mixed with hidden Markov models

引用

JOURNAL OF MULTIVARIATE ANALYSIS 2016年 152卷 259-275页

作者： Xia, Ye-Mao Tang, Nian-Sheng Gou, Jian-Wei Nanjing Forestry Univ Dept Appl Math Nanjing 210037 Jiangsu Peoples R China Yunnan Univ Dept Stat Kunming 650091 Peoples R China

This article presents a generalized linear latent variable model for analyzing multivariate longitudinal data within the hidden Markov model framework. The relationships among multiple items are captured by several common latent factors. The linear coregionalization method is adopted to model the temporal processes of latent variables. The merit of this modeling strategy lies in the fact that the processes among latent variables are nonseparate and codependent from each other. To account for possible heterogeneity and interrelationship among the longitudinal data, a hidden Markov model is introduced to model the transition probabilities across different latent states over time. The Monte Carlo expectation conditional maximization (MCECM) algorithm is developed to estimate unknown parameters in the proposed model. The Wald- and score-type statistics are proposed to test the related dependence of processes. A simulation study is conducted to investigate the performance of the proposed methodology. An example from a longitudinal study of cocaine use is taken to illustrate the proposed methodology. (C) 2016 Elsevier Inc. All rights reserved.

关键词： generalized linear latent variable model Hidden Markov model linear coregionalization mold MCECM algorithm

来源：评论

学校读者我要写书评

暂无评论

latent variable models for ordinal data by using the adaptive quadrature approximation

引用

COMPUTATIONAL STATISTICS 2013年第2期28卷 597-619页

作者： Cagnone, Silvia Monari, Paola Univ Bologna Dept Stat Bologna Italy

latent variable models for ordinal data represent a useful tool in different fields of research in which the constructs of interest are not directly observable so that one or more latent variables are required to reduce the complexity of the data. In these cases problems related to the integration of the likelihood function of the model can arise. Indeed analytical solutions do not exist and in presence of several latent variables the most used classical numerical approximation, the Gauss Hermite quadrature, cannot be applied since it requires several quadrature points per dimension in order to obtain quite accurate estimates and hence the computational effort becomes not feasible. Alternative solutions have been proposed in the literature, like the Laplace approximation and the adaptive quadrature. Different studies demonstrated the superiority of the latter method particularly in presence of categorical data. In this work we present a simulation study for evaluating the performance of the adaptive quadrature approximation for a general class of latent variable models for ordinal data under different conditions of study. A real data example is also illustrated.

关键词： generalized linear latent variable model Ordinal data Adaptive Gauss Hermite quadrature EM algorithm

来源：评论

学校读者我要写书评

暂无评论

Bacterial composition in Swedish raw drinking water reveals three major interacting ubiquitous metacommunities

引用

MICROBIOLOGYOPEN 2022年第5期11卷 e1320页

作者： Brindefalk, Bjorn Brolin, Harald Save-Soderbergh, Melle Karlsson, Edvin Sundell, David Wikstrom, Per Jacobsson, Karin Toljander, Jonas Stenberg, Per Sjodin, Andreas Dryselius, Rikard Forsman, Mats Ahlinder, Jon Swedish Def Res Agcy FOI CBRN Secur & Def Umea Sweden Univ Gothenburg Sahlgrenska Acad Inst Med Dept Mol & Clin Med Gothenburg Sweden Swedish Food Agcy Sci Div Uppsala Sweden Karolinska Inst Inst Environm Med Stockholm Sweden Umea Univ Dept Ecol & Environm Sci EMG Umea Sweden Swedish Univ Agr Sci Dept Biomed Sci & Vet Publ Hlth Uppsala Sweden

Background Surface raw water used as a source for drinking water production is a critical resource, sensitive to contamination. We conducted a study on Swedish raw water sources, aiming to identify mutually co-occurring metacommunities of bacteria, and environmental factors driving such patterns. Methods The water sources were different regarding nutrient composition, water quality, and climate characteristics, and displayed various degrees of anthropogenic impact. Water inlet samples were collected at six drinking water treatment plants over 3 years, totaling 230 samples. The bacterial communities of DNA sequenced samples (n = 175), obtained by 16S metabarcoding, were analyzed using a joint model for taxa abundance. Results Two major groups of well-defined metacommunities of microorganisms were identified, in addition to a third, less distinct, and taxonomically more diverse group. These three metacommunities showed various associations to the measured environmental data. Predictions for the well-defined metacommunities revealed differing sets of favored metabolic pathways and life strategies. In one community, taxa with methanogenic metabolism were common, while a second community was dominated by taxa with carbohydrate and lipid-focused metabolism. Conclusion The identification of ubiquitous persistent co-occurring bacterial metacommunities in freshwater habitats could potentially facilitate microbial source tracking analysis of contamination issues in freshwater sources.

关键词： 16S rRNA anthropogenic effects bacterial community analysis biotic interactions generalized linear latent variable model

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：