Summary form only given. Machine Learning has become a very popular approach in addressing problems in the computationalbiology and bioinformatics area. In addition, multi-classifier systems have also gained populari...
详细信息
Summary form only given. Machine Learning has become a very popular approach in addressing problems in the computationalbiology and bioinformatics area. In addition, multi-classifier systems have also gained popularity among researchers working in machine learning and applications for their ability to fuse together multiple models and obtain better overall accuracy and classification results. This talk is concerned with current issues in the design of multi-classifier systems and presents some multi-classifier developments for several bioinformatics problems. The talk will first present an overview and current status of machine learning methods in bioinformatics and computationalbiology. The talk will then bring in some important issues in building ensembles of classifiers, with a focus on the diversity and combination of individual classifiers. Few diversification and combination schemes are presented along with guidelines for the selection of different training paradigms and performance metrics, based on the properties and distribution of the data. Then, the presentation will proceed with introducing our computationalintelligence based multi-classifier developments for solving several bioinformatics problems, such as recognizing sequences in DNA strings, micro-array gene expression data analysis, protein structure prediction. The talk will also present related machine learning issues in developing such systems, such as learning from imbalanced datasets and using appropriate performance metrics for model selection and evaluation. The presented approaches and results will advocate that ensembles of classifiers can be used as effective modelling tools in solving challenging bioinformatics problems.
The proceedings contain 170 papers. The topics discussed include: stimulus-dependent noise facilitates tracking performances of neuronal networks;range parameter induced bifurcation in a single neuron model with delay...
ISBN:
(纸本)3642132774
The proceedings contain 170 papers. The topics discussed include: stimulus-dependent noise facilitates tracking performances of neuronal networks;range parameter induced bifurcation in a single neuron model with delay-dependent parameters;messenger RNA polyadenylation site recognition in green alga;a study to neuron ensemble of cognitive cortex ISI coding represent stimulus;STDP within NDS neurons;support vector regression and ant colony optimization for grid resources prediction;an improved kernel principal component analysis for large-scale data set;optimization of training samples with affinity propagation algorithm for multi-class SVM classification;an effective support vector data description with relevant metric learning;and a support vector machine (SVM) classification approach to heart murmur detection.
The proceedings contain 170 papers. The topics discussed include: stimulus-dependent noise facilitates tracking performances of neuronal networks;range parameter induced bifurcation in a single neuron model with delay...
ISBN:
(纸本)3642133177
The proceedings contain 170 papers. The topics discussed include: stimulus-dependent noise facilitates tracking performances of neuronal networks;range parameter induced bifurcation in a single neuron model with delay-dependent parameters;messenger RNA polyadenylation site recognition in green alga;a study to neuron ensemble of cognitive cortex ISI coding represent stimulus;STDP within NDS neurons;support vector regression and ant colony optimization for grid resources prediction;an improved kernel principal component analysis for large-scale data set;optimization of training samples with affinity propagation algorithm for multi-class SVM classification;an effective support vector data description with relevant metric learning;and a support vector machine (SVM) classification approach to heart murmur detection.
We introduce ACOPHY, a novel framework to apply Ant Colony Optimization (ACO) for phylogenetic reconstruction. ACOPHY overcomes a main drawback of other attempts to reconstruct phylogenies by defining a compact ACO gr...
详细信息
We consider string matching with variable length gaps. Given a string T and a pattern P consisting of strings separated by variable length gaps (arbitrary strings of length in a specified range), the problem is to fin...
详细信息
mRNA-Seq is an emerging massive parallel sequencing based technology for identification and quantification of gene transcripts. Although gene level expression can easily be estimated from mRNA-Seq data, estimating the...
详细信息
mRNA-Seq is an emerging massive parallel sequencing based technology for identification and quantification of gene transcripts. Although gene level expression can easily be estimated from mRNA-Seq data, estimating the isoform level expression poses serious problems, and appropriate methods are required. In this paper, we introduce a mathematical method to estimate transcript isoform concentrations, i.e. transcript isoform expression estimation via nonnegative least squares (TIEE/NLS).
In this paper we present the main steps to fold amino acid interaction networks. This is a graph whose vertices are the proteins amino acids and whose edges are the interactions between them. We begin by summarize rel...
详细信息
In this paper we present the main steps to fold amino acid interaction networks. This is a graph whose vertices are the proteins amino acids and whose edges are the interactions between them. We begin by summarize relative works about this type of graphs to describe their topological properties. Then, we propose a genetic algorithm which reconstructs the secondary structure motifs. We continue our folding process with an ant colony approach. We guide the ant system to the tertiary structure relying on a probability that two amino acids interact as a function of their physico-chemical properties.
We present the application of genetic programming (GP) to the zero-sum, deterministic, full-knowledge board game of Lose Checkers. Our system implements strongly typed GP trees, explicitly defined introns, local mutat...
详细信息
We present the application of genetic programming (GP) to the zero-sum, deterministic, full-knowledge board game of Lose Checkers. Our system implements strongly typed GP trees, explicitly defined introns, local mutations, and multi-tree individuals. Explicitly defined introns in the genome allow for information selected out of the population to be kept as a reservoir for possible future use. Multi-tree individuals are implemented by a method inspired by structural genes in living organisms, whereby we take a single tree describing a state evaluator and split it.
The structural class information about a protein is important to understand its biological properties. NMR is one of the most powerful tools to obtain structural information of proteins in atomic resolution. However, ...
详细信息
The structural class information about a protein is important to understand its biological properties. NMR is one of the most powerful tools to obtain structural information of proteins in atomic resolution. However, an analysis of protein three-dimensional structure from NMR spectra usually requires laborious chemical shift assignment. We developed a new method for predicting the protein structural class directly from the NMR spectra without any chemical shift assignment. The results show that our method outperforms the methods using current secondary structure prediction.
Clustering protein-protein interaction network aims to find functional modules and protein complexes. There are many computational graph clustering methods that are used in this field, but few of them are intelligent ...
详细信息
Clustering protein-protein interaction network aims to find functional modules and protein complexes. There are many computational graph clustering methods that are used in this field, but few of them are intelligent computational methods. In this paper, we present a novel improved immune genetic algorithm to find dense subgraphs based on efficient vaccination method, variable-length antibody schema definition and new local and global mutations.
暂无评论