咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Causal discoveries for high di... 收藏

Causal discoveries for high dimensional mixed data

作     者:Cai, Zhanrui Xi, Dong Zhu, Xuan Li, Runze 

作者机构:Iowa State Univ Dept Stat Ames IA USA Gilead Sci Foster City CA 94404 USA Novartis Pharmaceut E Hanover NJ USA Penn State Univ Dept Stat University Pk PA 16802 USA 

出 版 物:《STATISTICS IN MEDICINE》 (医学统计学)

年 卷 期:2022年第41卷第24期

页      面:4924-4940页

核心收录:

学科分类:0710[理学-生物学] 1004[医学-公共卫生与预防医学(可授医学、理学学位)] 1001[医学-基础医学(可授医学、理学学位)] 0714[理学-统计学(可授理学、经济学学位)] 10[医学] 

主  题:causal discoveries latent Gaussian model mixed data PC algorithm rank correlation 

摘      要:Causal relationships are of crucial importance for biological and medical research. Algorithms have been proposed for causal structure learning with graphical visualizations. While much of the literature focuses on biological studies where data often follow the same distribution, for example, the normal distribution for all variables, challenges emerge from epidemiological and clinical studies where data are often mixed with continuous, binary, and ordinal variables. We propose to use a mixed latent Gaussian copula model to estimate the underlying correlation structure via the rank correlation for mixed data. This correlation structure is then incorporated into a popular causal discovery algorithm, the PC algorithm, to identify causal structures. The proposed algorithm, called the latent-PC algorithm, is able to discover the true causal structure consistently under mild conditions in high dimensional settings. From simulation studies, the latent-PC algorithm delivers a competitive performance in terms of a similar or higher true positive rate and a similar or lower false positive rate, compared with other variants of the PC algorithm. In the high dimensional settings where the number of variables is more than the number of observations, the causal graphs identified by the latent-PC algorithm are closer to the true causal structures, compared to other competing algorithms. Further, we demonstrate the utility of the latent-PC algorithm in a real dataset for hepatocellular carcinoma. Causal structures for patient survival are visualized and connected with clinical interpretations in the literature.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分