检索结果-内蒙古大学图书馆

Utility of gene-specific algorithms for predicting pathogenicity of uncertain gene variants

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2012年第2期19卷 207-211页

作者： Crockett, David K. Lyon, Elaine Williams, Marc S. Narus, Scott P. Facelli, Julio C. Mitchell, Joyce A. Univ Utah Sch Med Dept Biomed Informat Salt Lake City UT USA Univ Utah Sch Med Dept Pathol Salt Lake City UT USA Intermt Healthcare Clin Genet Inst Salt Lake City UT USA Univ Utah Ctr High Performance Comp Salt Lake City UT USA

The rapid advance of gene sequencing technologies has produced an unprecedented rate of discovery of genome variation in humans. A growing number of authoritative clinical repositories archive gene variants and disease phenotypes, yet there are currently many more gene variants that lack clear annotation or disease association. To date, there has been very limited coverage of gene-specific predictors in the literature. Here the evaluation is presented of "gene-specific" predictor models based on a naive Bayesian classifier for 20 gene-disease datasets, containing 3986 variants with clinically characterized patient conditions. The utility of gene-specific prediction is then compared with "all-gene" generalized prediction and also with existing popular predictors. Gene-specific computational prediction models derived from clinically curated gene variant disease datasets often outperform established generalized algorithms for novel and uncertain gene variants.

关键词： Amino acid properties gene variant classification machine learning phenotype prediction bioinformatics gene variants classification gene disease database developing/using computerized provider order entry designing usable (responsive) resources and systems methods for integration of information from disparate sources high-performance and large-scale computing distributed systems agents software engineering: architecture data exchange communication integration across care settings (inter- and intra-enterprise) system implementation and management issues languages computational methods statistical analysis of large datasets advanced algorithms identifying genome and protein structure and function detecting disease outbreaks and biological threats visualization of data and knowledge

来源：评论

学校读者我要写书评

暂无评论

Implementation of a deidentified federated data network for population-based cohort discovery

引用

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2012年第E1期19卷 E60-E67页

作者： Anderson, Nicholas Abend, Aaron Mandel, Aaron Geraghty, Estella Gabriel, Davera Wynden, Rob Kamerick, Michael Anderson, Kent Rainwater, Julie Tarczy-Hornoch, Peter Univ Washington Dept Biomed Hlth Informat Seattle WA 98109 USA Recombinant Data Corp Newton MA USA Univ Calif Davis Sacramento CA 95817 USA Univ Calif San Francisco San Francisco CA 94143 USA

Objective The Cross-Institutional Clinical Translational Research project explored a federated query tool and looked at how this tool can facilitate clinical trial cohort discovery by managing access to aggregate patient data located within unaffiliated academic medical centers. Methods The project adapted software from the Informatics for Integrating Biology and the Bedside (i2b2) program to connect three Clinical Translational Research Award sites: University of Washington, Seattle, University of California, Davis, and University of California, San Francisco. The project developed an iterative spiral software development model to support the implementation and coordination of this multisite data resource. Results By standardizing technical infrastructures, policies, and semantics, the project enabled federated querying of deidentified clinical datasets stored in separate institutional environments and identified barriers to engaging users for measuring utility. Discussion The authors discuss the iterative development and evaluation phases of the project and highlight the challenges identified and the lessons learned. Conclusion The common system architecture and translational processes provide high-level (aggregate) deidentified access to a large patient population (>5 million patients), and represent a novel and extensible resource. Enhancing the network for more focused disease areas will require research-driven partnerships represented across all partner sites.

关键词： Information dissemination information management clinical trials as topic cohort discovery federated network ethical study methods knowledge representations information storage and retrieval (text and images) knowledge bases surveys and needs analysis shrine i2b2 transmart data warehousing informatics evaluation machine learning predictive modeling statistical learning privacy technology modeling physiologic and disease processes linking the genotype and phenotype identifying genome and protein structure and function visualization of data and knowledge information dissemination information management clinical trials as topic cohort discovery federated network

来源：评论

学校读者我要写书评

暂无评论

Clinical utility of sequence-based genotype compared with that derivable from genotyping arrays

引用

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2012年第E1期19卷 E21-E27页

作者： Morgan, Alexander A. Chen, Rong Butte, Atul Janardhan Stanford Univ Dept Biochem Stanford Genome Technol Ctr Stanford CA 94305 USA Stanford Univ Dept Pediat Div Syst Med Biomed Informat Grad Training Program Stanford CA 94305 USA Lucile Packard Childrens Hosp Palo Alto CA USA

Objective We investigated the common-disease relevant information obtained from sequencing compared with that reported from genotyping arrays. Materials and methods Using 187 publicly available individual human genomes, we constructed genomic disease risk summaries based on 55 common diseases with reported gene-disease associations in the research literature using two different risk models, one based on the product of likelihood ratios and the other on the allelic variant with the maximum associated disease risk. We also constructed risk profiles based on the single nucleotide polymorphisms (SNPs) of these individuals that could be measured or imputed from two common genotyping array platforms. Results We show that the model risk predictions derived from sequencing differ substantially from those obtained from the SNPs measured on commercially available genotyping arrays for several different non-monogenic diseases, although high density genotyping arrays give identical results for many diseases. Conclusions Our approach may be used to compare the ability of different platforms to probe known genetic risks disease by disease.

关键词： genomic medicine personalized medicine genotyping sequencing modeling physiologic and disease processes linking the genotype and phenotype identifying genome and protein structure and function visualization of data and knowledge

来源：评论

学校读者我要写书评

暂无评论

Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010

引用

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2011年第5期18卷 557-562页

作者： de Bruijn, Berry Cherry, Colin Kiritchenko, Svetlana Martin, Joel Zhu, Xiaodan Natl Res Council Canada Inst Informat Technol Ottawa ON K1A 0R6 Canada

Objective As clinical text mining continues to mature, its potential as an enabling technology for innovations in patient care and clinical research is becoming a reality. A critical part of that process is rigid benchmark testing of natural language processing methods on realistic clinical narrative. In this paper, the authors describe the design and performance of three state-of-the-art text-mining applications from the National Research Council of Canada on evaluations within the 2010 i2b2 challenge. Design The three systems perform three key steps in clinical information extraction: (1) extraction of medical problems, tests, and treatments, from discharge summaries and progress notes;(2) classification of assertions made on the medical problems;(3) classification of relations between medical concepts. Machine learning systems performed these tasks using large-dimensional bags of features, as derived from both the text itself and from external sources: UMLS, cTAKES, and Medline. Measurements Performance was measured per subtask, using micro-averaged F-scores, as calculated by comparing system annotations with ground-truth annotations on a test set. Results The systems ranked high among all submitted systems in the competition, with the following F-scores: concept extraction 0.8523 (ranked first);assertion detection 0.9362 (ranked first);relationship detection 0.7313 (ranked second). Conclusion For all tasks, we found that the introduction of a wide range of features was crucial to success. Importantly, our choice of machine learning algorithms allowed us to be versatile in our feature design, and to introduce a large number of features without overfitting and without encountering computing-resource bottlenecks.

关键词： natural language processing semantics classification/*methods computerized medical records systems patient discharge/*statistics & numerical data text mining concept detection relation extraction document coding machine learning modeling physiologic and disease processes linking the genotype and phenotype identifying genome and protein structure and function visualization of data and knowledge

来源：评论

学校读者我要写书评

暂无评论

Computationally translating molecular discoveries into tools for medicine: translational bioinformatics articles now featured in JAMIA

引用

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2011年第4期18卷 352-353页

作者： Butte, Atul J. Shah, Nigam H. Stanford Univ Sch Med Dept Pediat Div Syst Med Stanford CA 94305 USA Lucile Packard Childrens Hosp Palo Alto CA USA Stanford Univ Sch Med Stanford Ctr Biomed Informat Res Stanford CA 94305 USA

来源：评论

学校读者我要写书评

暂无评论

Translational bioinformatics: linking knowledge across biological and clinical realms

引用

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2011年第4期18卷 354-357页

作者： Sarkar, Indra Neil Butte, Atul J. Lussier, Yves A. Tarczy-Hornoch, Peter Ohno-Machado, Lucila Univ Vermont Ctr Clin & Translat Sci Burlington VT 05405 USA Univ Vermont Coll Med Dept Microbiol & Mol Genet Burlington VT 05405 USA Univ Vermont Coll Engn & Math Sci Dept Comp Sci Burlington VT 05405 USA Stanford Univ Sch Med Dept Pediat Div Syst Med Stanford CA 94305 USA Univ Chicago Dept Med Med Genet Sect Chicago IL 60637 USA Univ Chicago Ludwig Ctr Metastasis Res UC Comprehens Canc Ctr Chicago IL 60637 USA Univ Chicago Inst Translat Med Computat Inst Inst Genom & Syst Biol Chicago IL 60637 USA Univ Washington Div Biomed & Hlth Informat Seattle WA 98195 USA Univ Washington Inst Translat Hlth Sci Seattle WA 98195 USA Univ Washington Inst Genom Med Seattle WA 98195 USA Univ Washington Dept Comp Sci Seattle WA 98195 USA Univ Calif San Diego Div Biomed Informat La Jolla CA 92093 USA

Nearly a decade since the completion of the first draft of the human genome, the biomedical community is positioned to usher in a new era of scientific inquiry that links fundamental biological insights with clinical knowledge. Accordingly, holistic approaches are needed to develop and assess hypotheses that incorporate genotypic, phenotypic, and environmental knowledge. This perspective presents translational bioinformatics as a discipline that builds on the successes of bioinformatics and health informatics for the study of complex diseases. The early successes of translational bioinformatics are indicative of the potential to achieve the promise of the Human genome Project for gaining deeper insights to the genetic underpinnings of disease and progress toward the development of a new generation of therapies.

关键词： Translational bioinformatics systems medicine systems biology bioinformatics biomedical informatics knowledge representation information retrieval phylogenetics modeling physiologic and disease processes linking the genotype and phenotype identifying genome and protein structure and function visualization of data and knowledge simulation of complex systems (at all levels: molecules to work groups to organizations) knowledge representations uncertain reasoning and decision theory languages computational methods statistical analysis of large datasets advanced algorithms discovery text and data mining methods natural-language processing automated learning ontologies

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：