检索结果-内蒙古大学图书馆

Development and validation of various phenotyping algorithms for Diabetes Mellitus using data from electronic health records

引用

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2017年 152卷 53-70页

作者： Esteban, Santiago Rodriguez Tablado, Manuel Peper, Francisco E. Mahumud, Yamila S. Ricci, Ricardo I. Kopitowski, Karin S. Terrasa, Sergio A. Hosp Italiano Buenos Aires Family & Community Div Buenos Aires DF Argentina Inst Univ Hosp Italiano Buenos Aires Res Dept Buenos Aires DF Argentina Inst Univ Hosp Italiano Buenos Aires Publ Hlth Dept Buenos Aires DF Argentina

Background and Objective: Recent progression towards precision medicine has encouraged the use of electronic health records (EHRs) as a source for large amounts of data, which is required for studying the effect of treatments or risk factors in more specific subpopulations. phenotyping algorithms allow to automatically classify patients according to their particular electronic phenotype thus facilitating the setup of retrospective cohorts. Our objective is to compare the performance of different classification strategies (only using standardized problems, rule-based algorithms, statistical learning algorithms (six learners) and stacked generalization (five versions)), for the categorization of patients according to their diabetic status (diabetics, not diabetics and inconclusive;Diabetes of any type) using information extracted from EHRs. Methods: Patient information was extracted from the EHR at Hospital Italiano de Buenos Aires, Buenos Aires, Argentina. For the derivation and validation datasets, two probabilistic samples of patients from different years (2005: n = 1663;2015: n = 800) were extracted. The only inclusion criterion was age (>= 40 & < 80 years). Four researchers manually reviewed all records and classified patients according to their diabetic status (diabetic: diabetes registered as a health problem or fulfilling the ADA criteria;non-diabetic: not fulfilling the ADA criteria and having at least one fasting glycemia below 126 mg/dL;inconclusive: no data regarding their diabetic status or only one abnormal value). The best performing algorithms within each strategy were tested on the validation set. Results: The standardized codes algorithm achieved a Kappa coefficient value of 0.59 (95% CI 0.49, 0.59) in the validation set. The Boolean logic algorithm reached 0.82 (95% CI 0.76, 0.88). A slightly higher value was achieved by the Feedforward Neural Network (0.9, 95% CI 0.85, 0.94). The best performing learner was the stacked generalization meta-learner t

关键词： electronic phenotyping algorithms Stacked generalization electronic health records Diabetes Mellitus

来源：评论

学校读者我要写书评

暂无评论

The Effectiveness of Multitask Learning for phenotyping with electronic Health Records Data 24

The Effectiveness of Multitask Learning for Phenotyping with...

引用

24th Pacific Symposium on Biocomputing (PSB)

作者： Ding, Daisy Yi Simpson, Chloe Pfohl, Stephen Kale, Dave C. Jung, Kenneth Shah, Nigam H. Stanford Univ Stanford Ctr Biomed Informat Res Stanford CA 94305 USA Univ Southern Calif Inst Informat Sci Marina Del Rey CA 90292 USA

ISBN: (纸本)9789813279827;9789813279810

electronic phenotyping is the task of ascertaining whether an individual has a medical condition of interest by analyzing their medical record and is foundational in clinical informatics. Increasingly, electronic phenotyping is performed via supervised learning. We investigate the effectiveness of multitask learning for phenotyping using electronic health records (EHR) data. Multitask learning aims to improve model performance on a target task by jointly learning additional auxiliary tasks and has been used in disparate areas of machine learning. However, its utility when applied to EHR data has not been established, and prior work suggests that its benefits are inconsistent. We present experiments that elucidate when multitask learning with neural nets improves performance for phenotyping using EHR data relative to neural nets trained for a single phenotype and to well-tuned baselines. We find that multitask neural nets consistently outperform single-task neural nets for rare phenotypes but underperform for relatively more common phenotypes. The effect size increases as more auxiliary tasks are added. Moreover, multitask learning reduces the sensitivity of neural nets to hyperparameter settings for rare phenotypes. Last, we quantify phenotype complexity and find that neural nets trained with or without multitask learning do not improve on simple baselines unless the phenotypes are sufficiently complex.

关键词： electronic Health Records electronic phenotyping algorithms Deep learning Multi-task learning

来源：评论

学校读者我要写书评

暂无评论

Automated disease cohort selection using word embeddings from electronic Health Records 23rd

Automated disease cohort selection using word embeddings fro...

引用

23rd Pacific Symposium on Biocomputing (PSB)

作者： Glicksberg, Benjamin S. Miotto, Riccardo Johnson, Kipp W. Shameer, Khader Li, Li Chen, Rong Dudley, Joel T. Icahn Sch Med Mt Sinai Dept Genet & Genom Sci 1 Gustave L Levy Pl New York NY 10065 USA Icahn Sch Med Mt Sinai Inst Next Generat Healthcare 1 Gustave L Levy Pl New York NY 10065 USA

ISBN: (纸本)9789813235533;9789813235526

Accurate and robust cohort definition is critical to biomedical discovery using electronic Health Records (EHR). Similar to prospective study designs, high quality EHR-based research requires rigorous selection criteria to designate case/control status particular to each disease. electronic phenotyping algorithms, which are manually built and validated per disease, have been successful in filling this need. However, these approaches are time-consuming, leading to only a relatively small amount of algorithms for diseases developed. Methodologies that automatically learn features from EHRs have been used for cohort selection as well. To date, however, there has been no systematic analysis of how these methods perform against current gold standards. Accordingly, this paper compares the performance of a state-of-the-art automated feature learning method to extracting researchgrade cohorts for five diseases against their established electronic phenotyping algorithms. In particular, we use word2vec to create unsupervised embeddings of the phenotype space within an EHR system. Using medical concepts as a query, we then rank patients by their proximity in the embedding space and automatically extract putative disease cohorts via a distance threshold. Experimental evaluation shows promising results with average F-score of 0.57 and AUC-ROC of 0.98. However, we noticed that results varied considerably between diseases, thus necessitating further investigation and/or phenotype-specific refinement of the approach before being readily deployed across all diseases.

关键词： electronic Health Records Automated cohort selection electronic phenotyping algorithms Vector-based representations Word embedding Feature learning

来源：评论

学校读者我要写书评

暂无评论

A rule-based electronic phenotyping algorithm for detecting clinically relevant cardiovascular disease cases

引用

BMC Research Notes 2017年第1期10卷 1-7页

作者： Esteban, Santiago Rodríguez Tablado, Manuel Ricci, Ricardo Ignacio Terrasa, Sergio Kopitowski, Karin Family and Community Medicine Division Hospital Italiano de Buenos Aires Tte. J. D. Peron Buenos Aires 4272 Argentina Research Department Instituto Universitario Del Hospital Italiano de Buenos Aires Buenos Aires Argentina

Background: The implementation of electronic medical records (EMR) is becoming increasingly common. Error and data loss reduction, patient-care efficiency increase, decision-making assistance and facilitation of event surveillance, are some of the many processes that EMRs help improve. In addition, they show a lot of promise in terms of data collection to facilitate observational epidemiological studies and their use for this purpose has increased significantly over the recent years. Even though the quantity and availability of the data are clearly improved thanks to EMRs, still, the problem of the quality of the data remains. This is especially important when attempting to determine if an event has actually occurred or not. We sought to assess the sensitivity, specificity, and agreement level of a codes-based algorithm for the detection of clinically relevant cardiovascular (CaVD) and cerebrovascular (CeVD) disease cases, using data from EMRs. Methods: Three family physicians from the research group selected clinically relevant CaVD and CeVD terms from the international classification of primary care, Second Edition (ICPC-2), the ICD 10 version 2015 and SNOMED-CT 2015 Edition. These terms included both signs, symptoms, diagnoses and procedures associated with CaVD and CeVD. Terms not related to symptoms, signs, diagnoses or procedures of CaVD or CeVD and also those describing incidental findings without clinical relevance were excluded. The algorithm yielded a positive result if the patient had at least one of the selected terms in their medical records, as long as it was not recorded as an error. Else, if no terms were found, the patient was classified as negative. This algorithm was applied to a randomly selected sample of the active patients within the hospital's HMO by 1/1/2005 that were 40-79 years old, had at least one year of seniority in the HMO and at least one clinical encounter. Thus, patients were classified into four groups: (1) Negative patients (2) P

关键词： Cardiovascular disease Cerebrovascular disease electronic medical records electronic phenotyping algorithms Rule-based algorithm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：