检索结果-内蒙古大学图书馆

2013 ieee 6th International Workshop on computational intelligence and Applications, IWCIA 2013

作者： Watanuki, Yosuke Tamura, Keiichi Kitakami, Hajime Takahashi, Yoshihumi Graduate School of Information Sciences Hiroshima City University Hiroshima Japan

ISBN: (纸本)9781467357265

Suffix trees, which are trie structures that present the suffixes of given sequences (e.g., strings), are widely used for sequence search in different application domains such as, text data mining, web intelligence, bioinformatics and computational biology. In particular, suffix trees are useful in bioinformatics applications, because they can search similar sub-sequences and extract frequent sequence patterns efficiently. In recent years, efficient construction of a suffix tree that allows faster sequence searches has become one of the most important challenges, because the number and size of the data that are stored in sequence databases have been increasing exponentially. This paper proposes a novel parallelization model for approximate sequence matching that uses disk-based suffix trees, which are built on hard disks not on memory, on a multi-core CPU. In the proposed parallelization model, we divide an entire sequence database into two or more sub-databases called partitions. For each partition, we build a suffix tree and define a task as an approximate sequence matching on one suffix tree. Moreover, the proposed parallelization model involves a multiple buffering management system to avoid conflicts among CPU-cores. We evaluated the proposed parallelization model using an actual amino acid sequence database on a PC. The experimental results show a substantial improvement in computation performance. © 2013 ieee.

关键词： bioinformatics

来源：评论

学校读者我要写书评

暂无评论

Designing predictors of halophilic and non-halophilic proteins using support vector machines

Designing predictors of halophilic and non-halophilic protei...

引用

ieee symposium on computational intelligence and bioinformatics and computational biology (CIBCB)

作者： Hui-Ling Huang Yerukala Sathipati Srinivasulu Phasit Charoenkwan Hua-Chin Lee Shinn-Ying Ho Department of Biological Science and Technology Institute of Bioinformatics and Systems Biology National Chiao Tung University Hsinchu Taiwan Institute of Bioinformatics and Systems Biology National Chiao Tung University Hsinchu Taiwan

ISBN: (纸本)9781467358743

Finding the molecular features causes the halophilicity in the halostable organisms is helpful to understand the halophilic adaption. In this study, we proposed a prediction method for halophilic proteins by using a machine learning method. The stages of this study are six-fold. First, we establish a non-redundant dataset of the halophilic proteins, collected from NCBI, Uniprotkb and EMBL-EBI databases. The dataset consists of 245 positive and negative proteins with sequence identity

关键词： Proteins Amino acids Support vector machines Accuracy bioinformatics Organisms Genetic algorithms

来源：评论

学校读者我要写书评

暂无评论

An Optimization Rule for In Silico Identification of Targeted Overproduction in Metabolic Pathways

引用

ieee-ACM TRANSACTIONS ON computational biology AND bioinformatics 2013年第4期10卷 914-926页

作者： Das, Mouli Murthy, C. A. De, Rajat K. Indian Stat Inst Machine Intelligence Unit Kolkata 700108 W Bengal India

In an extension of previous work, here we introduce a second-order optimization method for determining optimal paths from the substrate to a target product of a metabolic network, through which the amount of the target is maximum. An objective function for the said purpose, along with certain linear constraints, is considered and minimized. The basis vectors spanning the null space of the stoichiometric matrix, depicting the metabolic network, are computed, and their convex combinations satisfying the constraints are considered as flux vectors. A set of other constraints, incorporating weighting coefficients corresponding to the enzymes in the pathway, are considered. These weighting coefficients appear in the objective function to be minimized. During minimization, the values of these weighting coefficients are estimated and learned. These values, on minimization, represent an optimal pathway, depicting optimal enzyme concentrations, leading to overproduction of the target. The results on various networks demonstrate the usefulness of the methodology in the domain of metabolic engineering. A comparison with the standard gradient descent and the extreme pathway analysis technique is also performed. Unlike the gradient descent method, the present method,

关键词： Local minima Newton-Raphson method underdetermined problem metabolic pathways learning parameter

来源：评论

学校读者我要写书评

暂无评论

Mapping of DNA sequences using Hidden Markov Model Self Organizing Maps

Mapping of DNA sequences using Hidden Markov Model Self Orga...

引用

ieee symposium on computational intelligence in bioinformatics and computational biology

作者： Hiroshi Dozono Gen Niina Department of Advance Technology Fusion Saga Univeristy

ISBN: (纸本)9781467358743

Recently, next generation sequencing techniques have begun to produce huge amounts of sequencing data. To analyze these data, an efficient method that can handle large amounts of information is required. In this paper, we proposed a method for classifying sets of DNA sequences by using a hidden Markov model self-organizing map. For this purpose, a learning algorithm that requires low computational costs was developed. The availability of this method was examined in experiments classifying DNA sequences of various types of genes.

关键词： Hidden Markov models DNA Probes bioinformatics Context Vectors Algorithm design and analysis

来源：评论

学校读者我要写书评

暂无评论

Protein Secondary Structure Prediction Using Support Vector Machines (SVMs)

Protein Secondary Structure Prediction Using Support Vector ...

引用

ieee International Conference on Machine intelligence Research and Advancement (ICMIRA)

作者： Patel, Mayuri Shah, Hitesh GH Patel Coll Engn & Technol Dept Informat Technol Vallabh Vidyanagar 388120 Gujarat India GH Patel Coll Engn & Technol Elect & Commun Engn Dept Vallabh Vidyanagar 388120 Gujarat India

ISBN: (纸本)9780769550138

bioinformatics or computational biology is field of science in which biology, computer science and information technology merges into a single discipline. In modern computation biology, protein secondary structure prediction is a major problem. Secondary structure prediction is depends on its amino acid sequence. Current studies prefer machine learning techniques for classification and regression task. Recently many researchers used various data mining and machine learning tool for protein structure prediction. Our intention is to use model based (i.e., supervised learning) approach for protein secondary structure prediction and our objective is to enhance the prediction of 2D protein structure problem using advance machine learning techniques like, linear and non-linear support vector machine with different kernel functions. The datasets used for this problem are Protein Data Bank (PDB) sets, which is based on structural classification of protein (SCOP), RS126 and CB513.

关键词： bioinformatics CB513 Feature Selection Protein Data Bank (PDB) RS126

来源：评论

学校读者我要写书评

暂无评论

ieee Transactions on Pattern Analysis and Machine intelligence Editorial Board

引用

ieee/ACM Transactions on computational biology and bioinformatics 2014年第6期11卷 C2-C2页

Provides a listing of current staff, committee members and society officers.

关键词：

来源：评论

学校读者我要写书评

暂无评论

ieee Transactions on Pattern Analysis and Machine intelligence Information for Authors

引用

ieee/ACM Transactions on computational biology and bioinformatics 2014年第6期11卷 C3-C3页

Provides instructions and guidelines to prospective authors who wish to submit manuscripts.

关键词：

来源：评论

学校读者我要写书评

暂无评论

RNA Secondary Structure Prediction Using Soft Computing

引用

ieee-ACM TRANSACTIONS ON computational biology AND bioinformatics 2013年第1期10卷 2-17页

作者： Ray, Shubhra Sankar Pal, Sankar K. Indian Stat Inst Machine Intelligence Unit Kolkata 700108 India Indian Stat Inst Ctr Soft Comp Res Kolkata 700108 India

Prediction of RNA structure is invaluable in creating new drugs and understanding genetic diseases. Several deterministic algorithms and soft computing-based techniques have been developed for more than a decade to determine the structure from a known RNA sequence. Soft computing gained importance with the need to get approximate solutions for RNA sequences by considering the issues related with kinetic effects, cotranscriptional folding, and estimation of certain energy parameters. A brief description of some of the soft computing-based techniques, developed for RNA secondary structure prediction, is presented along with their relevance. The basic concepts of RNA and its different structural elements like helix, bulge, hairpin loop, internal loop, and multiloop are described. These are followed by different methodologies, employing genetic algorithms, artificial neural networks, and fuzzy logic. The role of various metaheuristics, like simulated annealing, particle swarm optimization, ant colony optimization, and tabu search is also discussed. A relative comparison among different techniques, in predicting 12 known RNA secondary structures, is presented, as an example. Future challenging issues are then mentioned.

关键词： RNA DNA protein combinatorial optimization dynamic programming soft computing genetic algorithms neural networks fuzzy logic metaheuristics machine learning

来源：评论

学校读者我要写书评

暂无评论

Predicting Protein Crystallization Using a Simple Scoring Card Method

Predicting Protein Crystallization Using a Simple Scoring Ca...

引用

ieee symposium on computational intelligence in bioinformatics and computational biology

作者： Watshara Shoombuatong Hui-Ling Huang Jeerayut Chaijaruwanich Phasit Charoenkwan Hua-Chin Lee Shinn-Ying Ho Department of Computer Science Bioinformatics Research Laboratory Chiang Mai University Department of Biological Science and Technology Institute of Bioinformatics and Systems Biology National Chiao Tung University Institute of Bioinformatics and Systems Biology National Chiao Tung University

ISBN: (纸本)9781467358743

Many computational methods have been developed to predict protein crystallization. Most methods use amino acid and dipeptide compositions as part of the informative features. To advance the prediction accuracy, the support vector machine (SVM) based classifiers and ensemble approaches were effective and commonly-used techniques. However, these techniques suffer from the low interpretation ability of insight into crystallization. In this study, we utilize a newly-developed scoring card method (SCM) with a dipeptide composition feature to predict protein crystallization. This SCM classifier obtains prediction results 74%, 0.55 and 0.83 for accuracy, sensitivity and specificity, respectively, which is comparable to the SVM classifier using the same benchmarks. The experimental results show that the SCM classifier has advantages of simplicity, high interpretability, and high accuracy in predicting protein crystallization, compared with existing SVM-based ensemble classifiers.

关键词： Protein crystallization Protein prediction Scoring card method Genetic algorithm

来源：评论

学校读者我要写书评

暂无评论

Early Lung Cancer Detection using Nucleus Segementation based Features

Early Lung Cancer Detection using Nucleus Segementation base...

引用

ieee symposium on computational intelligence in bioinformatics and computational biology

作者： Kesav Kancherla Srinivas Mukkamala Institute for Complex Additive Systems and Analysis (ICASA) Computational Analysis and Network Enterprise Solutions (CAaNES)

ISBN: (纸本)9781467358743

In this study we propose an early lung cancer detection methodology using nucleus based features. First the sputum samples from patients are labeled with Tetrakis Carboxy Phenyl Porphine (TCPP) and fluorescent images of these samples are taken. TCPP is a porphyrin that is able to assist in labeling lung cancer cells by increasing numbers of low density lipoproteins coating on the surface of cancer. We study the performance of well know machine learning techniques in the context of lung cancer detection on Biomoda dataset. We obtained an accuracy of 81% using 71 features related to shape, intensity and color in our previous work. By adding the nucleus segmented features we improved the accuracy to 87%. Nucleus segmentation is performed by using Seeded region growing segmentation method. Our results demonstrate the potential of nucleus segmented features for detecting lung cancer.

关键词： Lung Cancer detection bioinformatics Machine Learning Seeded Region Growing segmentation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：