检索结果-内蒙古大学图书馆

IEEE 28th Convention of Electrical and Electronics Engineers in Israel (IEEEI)

作者： Manevitz, Miriam Samson, Moshe Hebrew Univ Jerusalem Sch Comp Sci & Engn IL-91905 Jerusalem Israel

ISBN: (纸本)9781479959877

One of the central problems in bioinformatics is de-novo finding of recurring motifs in the DNA. Since these motifs are preserved throughout evolution they probably have a significant biological role. One of the most widely used existing tools uses Expectation Maximization (EM) algorithm in order to learn the parameters of a statistical model based on partial data. One such method is based on assuming the Motif-data is generated by a Hidden Markov Model (HMM). This method is called the meme algorithm. Despite its success, this method is in its essence a hill-climbing method, and as such, is known to be subject to being caught in local optima. In this work, we tackled the problem by using, instead, a genetic algorithm, and to search for the optimal probabilities of the HMM model. In certain occasions we succeeded in achieving better results using GA.

关键词： DNA biochemistry bioinformatics data mining evolution (biological) expectation-maximisation algorithm genetic algorithms genetics hidden Markov models learning (artificial intelligence) molecular biophysics molecular configurations probability DNA motif preservation EM algorithm HMM model meme algorithm de novo motif finding evolution expectation maximization algorithm genetic algorithm hidden Markov model hill-climbing method local optima motif data generation optimal probability search parameter learning partial data recurring DNA motif finding statistical model Equations Genetic algorithms Hidden Markov models Mathematical model Sociology Statistics expectation-maximisation algorithm Hidden Markov models Genetic algorithm (GA) Bioinformatics molecular biology Parameter Learning statistical models (nuclear) Sociology Molecular Conformation DNA data mining biochemistry Mathematical Model Genetics evolution

来源：评论

学校读者我要写书评

暂无评论

A Sequential Method for Discovering Probabilistic Motifs in Proteins

引用

Methods of Information in Medicine 2018年第1期43卷 9-12页

作者： K. Blekas D. I. Fotiadis A. Likas

Objectives: This paper proposes a greedy algorithm for learning a mixture of motifs model through likelihood maximization, in order to discover common substrings, known as motifs, from a given collection of related biosequences. Methods: The approach sequentially adds a new motif component to a mixture model by performing a combined scheme of global and local search for appropriately initializing the component parameters. A hierarchical clustering scheme is also applied initially which leads to the identification of candidate motif models and speeds up the global searching procedure. Results: The performance of the proposed algorithm has been studied in both artificial and real biological datasets. In comparison with the well-known meme approach, the algorithm is advantageous since it identifies motifs with significant conservation and produces larger protein fingerprints. Conclusion: The proposed greedy algorithm constitutes a promising approach for discovering multiple probabilistic motifs in biological sequences. By using an effective incremental mixture modeling strategy, our technique manages to successfully overcome the limitation of the meme scheme which erases motif occurrences each time a new motif is discovered.

关键词： Motif discovery mixture of motifs EM algorithm protein fingerprints meme algorithm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：