检索结果-内蒙古大学图书馆

A hybrid feature selection method based on Binary Jaya algorithm for micro-array data classification

COMPUTERS & ELECTRICAL ENGINEERING 2021年 90卷 106963-106963页

作者： Chaudhuri, Abhilasha Sahu, Tirath Prasad Natl Inst Technol Raipur Dept Informat Technol Chhattisgarh India

micro-array technology generates high-dimensional data. The high dimensionality of data hampers the learning capability of machine learning algorithms. Dimensionality can be reduced using feature selection (FS) techniques, which is an important and essential pre-processing step to process high dimensional data. In this work, a hybrid filter?wrapper approach is proposed for feature selection. The multi-attribute decision-making method called Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) is used as a filter for informative feature extraction. Further, Binary Jaya algorithm with time-varying transfer function is proposed as a wrapper feature selector to find the optimal subset of features. The proposed approach is tested on 10 benchmark micro-array datasets and compared with state-of-the-art methods. Experimental results suggest that the proposed approach performs better in terms of classification accuracy and it is 10 times faster than existing approaches.

关键词： TOPSIS Feature selection micro-array data Jaya algorithm Dimensionality reduction Hybrid feature selection

来源：评论

学校读者我要写书评

暂无评论

AMoDeBic: An adaptive Multi-objective Differential Evolution biclustering algorithm of microarray data using a biclustering binary mutation operator

引用

EXPERT SYSTEMS WITH APPLICATIONS 2024年第PartB期238卷

作者： Charfaoui, Younes Houari, Amina Boufera, Fatma Univ Mustapha STAMBOULI Mascara Mascara Algeria

In bioinformatics, biclustering is a crucial optimization task that can reveal hidden patterns and identify groups of genes that behave similarly under certain conditions. This study aims to efficiently identify high-quality and cohesive biclusters that share common characteristics across two data dimensions. To achieve this, we propose the first biclustering approach that utilizes Multi-objective Differential Evolution (DE), which is a novel technique for gene group discovery. Additionally, we introduce the Biclustering Binary Differential Evolution (BBDE), a new mutation operator that combines node addition and deletion, guided by an adaptive factor F. We thoroughly tested our method's effectiveness taking into account biological relevance, noise, overlap resistance, and statistics. We compared our results to state of the art algorithms using both synthetic and real datasets like Yeast Cell Cycle, Saccharomyces cerevisiae, and Human B Cell. Our algorithms outperformed the comparisons and effectively identified significant biclusters.

关键词： Differential Evolution algorithms Multi-objective Differential evolution Biclustering Evolutionary algorithms micro-array data Knowledge discovery

来源：评论

学校读者我要写书评

暂无评论

Filter-Based Feature selection for microarray data using Improved Binary Gravitational Search Algorithm 3

Filter-Based Feature selection for microarray data using Imp...

引用

3rd Conference on Swarm Intelligence and Evolutionary Computation (CSIEC)

作者： Rouhi, Amirreza Nezamabadi-pour, Hossein Politecn Milan Dept Elect & Informat Milan Italy Shahid Bahonar Univ Kerman Dept Elect Engn Intelligent Data Proc Lab IDPL Kerman Iran

ISBN: (纸本)9781538649787

Today, high-dimensional data have become one of the most important challenges in machine learning. Among thousands of features which exist in such data, some are redundant or unrelated and selecting a few of them improves classifier performance. micro-array data which are one of the most important high-dimensional data in medicine have a large number of features and a few number of samples. Thus, old simple methods can be used to select features of such data effectively. Among several methods which have been proposed for selecting features of high-dimensional data, Swarm intelligence-based methods have attracted attentions more than ever. These methods are suitable to solve time-consuming and complex problems such that they search near-optimal solution with desirable computational cost. In this paper, a filter based Swarm intelligence-based search method based on Improved Binary Gravitational Search Algorithm (IBSGA) is proposed to integrate filter approaches with Swarm intelligence-based methods to improve feature selection process in micro-array data. The proposed method is applied to 5 high-dimensional micro-array databases and the obtained results are compared with one of the up-to-date methods used for feature selection in micro-array data. Experimental results verify efficiency of the proposed algorithm.

关键词： Feature selection High-dimensional data micro-array data Swarm intelligence-based methods Filter methods

来源：评论

学校读者我要写书评

暂无评论

A hybrid feature selection approach based on ensemble method for high-dimensional data 2

A hybrid feature selection approach based on ensemble method...

引用

2nd Conference on Swarm Intelligence and Evolutionary Computation (CSIEC)

作者： Rouhi, Amirreza Nezamabadi-pour, Hossein Shahid Bahonar Univ Kerman Dept Elect Engn Kerman Iran

ISBN: (纸本)9781509043309

Nowadays, with the emergence of high-dimensional data, feature selection plays an important role in the domain of machine learning, particularly, classification problems, such that feature selection can be known as its vital and irremovable component. With the increase in the number of data dimensions, simple traditional methods show poor performance and cannot be used for effective and proper feature selection. Using embedded methods, this study first discusses data dimension reduction using a filter based approach. Two state-of-the-art meta-heuristic methods are then applied on the selected features and final desirable features are selected from the aggregation of their selected features. The proposed method is evaluated on 5 high-dimensional micro-array datasets and results are compared with several state-of-the-art feature selection approaches for high-dimensional data. Experimental results confirm the efficiency of the proposed method.

关键词： feature selection high dimensional data ensemble methods hybrid methods meta-heuristic methods micro-array data

来源：评论

学校读者我要写书评

暂无评论

Use of transcriptomic data for extending a model of the AppA/PpsR system in Rhodobacter sphaeroides

引用

BMC SYSTEMS BIOLOGY 2017年第20171228期11卷 146-146页

作者： Pandey, Rakesh Armitage, Judith P. Wadhams, George H. Univ Oxford Dept Biochem South Parks Rd Oxford England Natl Inst Immunol Aruna Asaf Ali Marg New Delhi India

Background: Photosynthetic (PS) gene expression in Rhodobacter sphaeroides is regulated in response to changes in light and redox conditions mainly by PrrB/A, FnrL and AppA/PpsR systems. The PrrB/A and FnrL systems activate the expression of them under anaerobic conditions while the AppA/PpsR system represses them under aerobic conditions. Recently, two mathematical models have been developed for the AppA/PpsR system and demonstrated how the interaction between AppA and PpsR could lead to a phenotype in which PS genes are repressed under semi-aerobic conditions. These models have also predicted that the transition from aerobic to anaerobic growth mode could occur via a bistable regime. However, they lack experimentally quantifiable inputs and outputs. Here, we extend one of them to include such quantities and combine all relevant micro-array data publically available for a PS gene of this bacterium and use that to parameterise the model. In addition, we hypothesise that the AppA/PpsR system alone might account for the observed trend of PS gene expression under semi-aerobic conditions. Results: Our extended model of the AppA/PpsR system includes the biological input of atmospheric oxygen concentration and an output of photosynthetic gene expression. Following our hypothesis that the AppA/PpsR system alone is sufficient to describe the overall trend of PS gene expression we parameterise the model and suggest that the rate of AppA reduction in vivo should be faster than its oxidation. Also, we show that despite both the reduced and oxidised forms of PpsR binding to the PS gene promoters in vitro, binding of the oxidised form as a repressor alone is sufficient to reproduce the observed PS gene expression pattern. Finally, the combination of model parameters which fit the biological data well are broadly consistent with those which were previously determined to be required for the system to show (i) the repression of PS genes under semi-aerobic conditions, and (ii) bista

关键词： micro-array data Signal transduction system Purple non-sulfur bacteria Photosynthetic bacteria Mathematical modelling Gene regulatory network Oxygen and light sensing

来源：评论

学校读者我要写书评

暂无评论

New feature selection for gene expression classification based on degree of class overlap in principal dimensions

引用

COMPUTERS IN BIOLOGY AND MEDICINE 2015年 64卷 292-298页

作者： Rakkeitwinai, Somsak Lursinsap, Chidchanok Aporntewan, Chatchawit Mutirangura, Apiwat Chulalongkorn Univ Fac Sci Dept Math & Comp Sci Bangkok 10330 Thailand Chulalongkorn Univ Fac Med Dept Anat Bangkok 10330 Thailand

micro-array data are typically characterized by high dimensional features with a small number of samples. Several problems in identifying genes causing diseases from micro-array data can be transformed into the problem of classifying the features extracted from gene expression in microarray data. However, too many features can cause low prediction accuracy as well as high computational complexity. Dimensional reduction is a method to eliminate irrelevant features to improve the prediction accuracy. Typically, the eigenvalues or dimensional data variance from principal component analysis are used as criteria to select relevant features. This approach is simple but not efficient since it does not concern the degree of data overlap in each dimension in the feature space. A new method to select relevant features based on degree of dimensional data overlap with proper feature selection was introduced. Furthermore, our study concentrated on small sized data sets which usually occur in reality. The experimental results signified that this new approach can achieve substantially higher prediction accuracy when compared with other methods. (C) 2015 Elsevier Ltd. All rights reserved.

关键词： micro-array data Dimension reduction Feature selection Feature extraction principal component Analysis Support Vector Machine

来源：评论

学校读者我要写书评

暂无评论

Constructing Multivariate Prognostic Gene Signatures with Censored Survival data

Constructing Multivariate Prognostic Gene Signatures with Ce...

引用

作者： Derick R. Peterson

Modern high-throughput technologies allow us to simultaneously measure the expressions of a huge number of candidate predictors, some of which are likely to be associated with survival. One difficult task is to search among an enormous number of potential predictors and to correctly identify most of the important ones, without mistakenly identifying too many spurious associations. Mere variable selection is insufficient, however, for the information from the multiple predictors must be intelligently combined and calibrated to form the final composite predictor. Many commonly used procedures overfit the training data, miss many important predictors, or both. Although it is impossible to simultaneously adjust for a huge number of predictors in an unconstrained way, we propose a method that offers a middle ground where some partial multivariate adjustments can be made in an adaptive fashion, regardless of the number of candidate predictors. We demonstrate the performance of our proposed procedure in a simulation study within the Cox proportional hazards regression framework, and we apply our new method to a publicly available data set to construct a novel prognostic gene signature for breast cancer survival. less

关键词： micro-array data Prognostic signature Gene selection Shrinkage estimation Cox proportional hazards regression Censored survival data

来源：评论

学校读者我要写书评

暂无评论

Mining biologically significant co-regulation patterns from microarray data

Mining biologically significant co-regulation patterns from ...

引用

1st International Conference on Rough Sets and Knowledge Technology (RSKT 2006)

作者： Zhao, Yuhai Yin, Ying Wang, Guoren Northeastern Univ Inst Comp Syst Shenyang 110004 Peoples R China

ISBN: (纸本)3540362975

In this paper, we propose a novel model, namely g-Cluster, to mine biologically significant co-regulated gene clusters. The proposed model can (1) discover extra co-expressed genes that cannot be found by current pattern/tendency-based methods, and (2) discover inverted relationship overlooked by pattern/tendency-based methods. We also design two tree-based algorithms to mine all qualified g-Clusters. The experimental results show: (1) our approaches are effective and efficient, and (2) our approaches can find an amount of co-regulated gene clusters missed by previous models, which are potentially of high biological significance.

关键词： bioinformatics clustering micro-array data

来源：评论

学校读者我要写书评

暂无评论

A clustering algorithm for gene expression data using wavelet packet decomposition

A clustering algorithm for gene expression data using wavele...

引用

36th Asilomar Conference on Signals, Systems and Computers

作者： Rao, A Univ Texas Dept ECE Austin TX 78712 USA

ISBN: (纸本)0780375769

Mining large data & deriving meaning from the mined data in Bioinformatics is a computationally intensive & relevant issue. In this paper we present an efficient algorithm to cluster genes into similar functional groups. This is a technique for extracting and characterizing rhythmic expression profiles from genome-wide DNA micro-array hybridization data. These patterns are clues to discovering rhythmic genes implicated in cell-cycle, circadian, or other biological processes. These functionalities are defined in the paper (anti-correlated, similar time expression etc). We present a signal-processing approach to this problem. We also explore an information theoretic criterion for. identifying those genes exhibiting maximum variation in behavior. The genes, are, clustered and then relationships are derived for the proposition of a temporal cell-cycle model governing regulatory behavior. We are presently considering the Human Fibroblast and Yeast data set for analysis.

关键词： wavelets micro-array data entropy

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：