检索结果-内蒙古大学图书馆

Leveraging medical thesauri and physician feedback for improving medical literature retrieval for case queries

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2012年第5期19卷 851-858页

作者： Sondhi, Parikshit Sun, Jimeng Zhai, ChengXiang Sorrentino, Robert Kohn, Martin S. Univ Illinois Dept Comp Sci Urbana IL 61801 USA IBM Thomas J Watson Res Ctr Yorktown Hts NY USA

Objective This paper presents a study of methods for medical literature retrieval for case queries, in which the goal is to retrieve literature articles similar to a given patient case. In particular, it focuses on analyzing the performance of state-of-the-art general retrieval methods and improving them by the use of medical thesauri and physician feedback. Materials and Methods The Kullback-Leibler divergence retrieval model with Dirichlet smoothing is used as the state-of-the-art general retrieval method. Pseudorelevance feedback and term weighing methods are proposed by leveraging MeSH and UMLS thesauri. Evaluation is performed on a test collection recently created for the ImageCLEF medical case retrieval challenge. Results Experimental results show that a well-tuned state-of-the-art general retrieval model achieves a mean average precision of 0.2754, but the performance can be improved by over 40% to 0.3980, through the proposed methods. Discussion The results over the ImageCLEF test collection, which is currently the best collection available for the task, are encouraging. There are, however, limitations due to small evaluation set size. The analysis shows that further refinement of the methods is necessary before they can be really useful in a clinical setting. Conclusion Medical case-based literature retrieval is a critical search application that presents a number of unique challenges. This analysis shows that the state-of-the-art general retrieval models are reasonably good for the task, but the performance can be significantly improved by developing new task-specific retrieval models that incorporate medical thesauri and physician feedback.

关键词： Case search clinical (L01.700.508.300.190) computer-assisted (L01.700.508.100) decision making decision support systems decision support techniques (L01.700.508.190) high-performance and large-scale computing information management (L01.399) information retrieval information storage and retrieval (L01.700.508.280) language models machine learning medical case-based retrieval medical case retrieval medical informatics (L01.313.500) natural language processing semantic weighing statistical analysis of large datasets uncertain reasoning and decision theory visualization of data and knowledge

来源：评论

学校读者我要写书评

暂无评论

Utility of gene-specific algorithms for predicting pathogenicity of uncertain gene variants

引用

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2012年第2期19卷 207-211页

作者： Crockett, David K. Lyon, Elaine Williams, Marc S. Narus, Scott P. Facelli, Julio C. Mitchell, Joyce A. Univ Utah Sch Med Dept Biomed Informat Salt Lake City UT USA Univ Utah Sch Med Dept Pathol Salt Lake City UT USA Intermt Healthcare Clin Genet Inst Salt Lake City UT USA Univ Utah Ctr High Performance Comp Salt Lake City UT USA

The rapid advance of gene sequencing technologies has produced an unprecedented rate of discovery of genome variation in humans. A growing number of authoritative clinical repositories archive gene variants and disease phenotypes, yet there are currently many more gene variants that lack clear annotation or disease association. To date, there has been very limited coverage of gene-specific predictors in the literature. Here the evaluation is presented of "gene-specific" predictor models based on a naive Bayesian classifier for 20 gene-disease datasets, containing 3986 variants with clinically characterized patient conditions. The utility of gene-specific prediction is then compared with "all-gene" generalized prediction and also with existing popular predictors. Gene-specific computational prediction models derived from clinically curated gene variant disease datasets often outperform established generalized algorithms for novel and uncertain gene variants.

关键词： Amino acid properties gene variant classification machine learning phenotype prediction bioinformatics gene variants classification gene disease database developing/using computerized provider order entry designing usable (responsive) resources and systems methods for integration of information from disparate sources high-performance and large-scale computing distributed systems agents software engineering: architecture data exchange communication integration across care settings (inter- and intra-enterprise) system implementation and management issues languages computational methods statistical analysis of large datasets advanced algorithms identifying genome and protein structure and function detecting disease outbreaks and biological threats visualization of data and knowledge

来源：评论

学校读者我要写书评

暂无评论

Exploiting time in electronic health record correlations

引用

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2011年第Sup1期18卷 I109-I115页

作者： Hripcsak, George Albers, David J. Perotte, Adler Columbia Univ Med Ctr Dept Biomed Informat New York NY 10032 USA

Objective To demonstrate that a large, heterogeneous clinical database can reveal fine temporal patterns in clinical associations;to illustrate several types of associations;and to ascertain the value of exploiting time. Materials and methods Lagged linear correlation was calculated between seven clinical laboratory values and 30 clinical concepts extracted from resident signout notes from a 22-year, 3-million-patient database of electronic health records. Time points were interpolated, and patients were normalized to reduce inter-patient effects. Results The method revealed several types of associations with detailed temporal patterns. Definitional associations included low blood potassium preceding 'hypokalemia.' Low potassium preceding the drug spironolactone with high potassium following spironolactone exemplified intentional and physiologic associations, respectively. Counterintuitive results such as the fact that diseases appeared to follow their effects may be due to the workflow of healthcare, in which clinical findings precede the clinician's diagnosis of a disease even though the disease actually preceded the findings. Fully exploiting time by interpolating time points produced less noisy results. Discussion Electronic health records are not direct reflections of the patient state, but rather reflections of the healthcare process and the recording process. With proper techniques and understanding, and with proper incorporation of time, interpretable associations can be derived from a large clinical database. Conclusion A large, heterogeneous clinical database can reveal clinical associations, time is an important feature, and care must be taken to interpret the results.

关键词： Electronic health records data mining time series associations modeling physiologic and disease processes linking the genotype and phenotype languages and computational methods statistical analysis of large datasets advanced algorithms high-performance and large-scale computing detecting disease outbreaks and biological threats simulation of complex systems (at all levels: molecules to work groups to organizations)

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：