检索结果-内蒙古大学图书馆

A comprehensive review and comparison of existing computational methods for protein function prediction

BRIEFINGS IN BIOINFORMATICS 2024年第4期25卷 bbae289页

作者： Lin, Baohui Luo, Xiaoling Liu, Yumeng Jin, Xiaopeng Shenzhen Technol Univ Coll Big Data & Internet Shenzhen 518118 Guangdong Peoples R China Guangdong Prov Key Lab Novel Secur Intelligence Te Shenzhen 518000 Guangdong Peoples R China Shenzhen Univ Coll Comp Sci & Software Engn Shenzhen 518061 Guangdong Peoples R China

Protein function prediction is critical for understanding the cellular physiological and biochemical processes, and it opens up new possibilities for advancements in fields such as disease research and drug discovery. During the past decades, with the exponential growth of protein sequence data, many computational methods for predicting protein function have been proposed. Therefore, a systematic review and comparison of these methods are necessary. In this study, we divide these methods into four different categories, including sequence-based methods, 3D structure-based methods, PPI network-based methods and hybrid information-based methods. Furthermore, their advantages and disadvantages are discussed, and then their performance is comprehensively evaluated and compared. Finally, we discuss the challenges and opportunities present in this field.

关键词： protein function prediction sequence-based methods 3D structure-based methods PPI network-based methods and hybrid information-based methods

来源：评论

学校读者我要写书评

暂无评论

A Survey of Computational methods and Databases for lncRNA-MiRNA Interaction Prediction

引用

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023年第5期20卷 2810-2826页

作者： Sheng, Nan Huang, Lan Gao, Ling Cao, Yangkun Xie, Xuping Wang, Yan Jilin Univ Coll Comp Sci & Technol Changchun 130012 Peoples R China Jilin Univ Sch Artificial Intelligence Changchun 130012 Peoples R China

Long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) are two prevalent non-coding RNAs in current research. They play critical regulatory roles in the life processes of animals and plants. Studies have shown that lncRNAs can interact with miRNAs to participate in post-transcriptional regulatory processes, mainly involved in regulating cancer development, metastatic progression, and drug resistance. Additionally, these interactions have significant effects on plant growth, development, and responses to biotic and abiotic stresses. Deciphering the potential relationships between lncRNAs and miRNAs may provide new insights into our understanding of the biological functions of lncRNAs and miRNAs, and the pathogenesis of complex diseases. In contrast, gathering information on lncRNA-miRNA interactions (LMIs) through biological experiments is expensive and time-consuming. With the accumulation of multi-omics data, computational models are extremely attractive in systematically exploring potential LMIs. To the best of our knowledge, this is the first comprehensive review of computational methods for identifying LMIs. Specifically, we first summarized the available public databases for predicting animal and plant LMIs. Second, we comprehensively reviewed the computational methods for predicting LMIs and classified them into two categories, including network-based methods and sequence-based methods. Third, we analyzed the standard evaluation methods and metrics used in LMI prediction. Finally, we pointed out some problems in the current study and discuss future research directions. Relevant databases and the latest advances in LMI prediction are summarized in a GitHub repository https://***/sheng-n/lncRNA-miRNA-interaction-methods, and we'll keep it updated.

关键词： Computational methods lncRNA-miRNA interaction prediction network-based methods sequence-based methods

来源：评论

学校读者我要写书评

暂无评论

Protein-peptide binding residue prediction based on protein language models and cross-attention mechanism

引用

ANALYTICAL BIOCHEMISTRY 2024年 694卷 115637页

作者： Hu, Jun Chen, Kai-Xin Rao, Bing Ni, Jing-Yuan Thafar, Maha A. Albaradei, Somayah Arif, Muhammad Zhejiang Univ Technol Coll Informat Engn Hangzhou 310023 Peoples R China Suzhou Inst Syst Med Ctr AI & Computat Biol Suzhou 215123 Peoples R China Hangzhou City Univ Sch Informat & Elect Engn Hangzhou 310015 Peoples R China Nanjing Univ Informat Sci & Technol NUIST Reading Acad Nanjing 210044 Peoples R China Taif Univ Coll Comp & Informat Technol Dept Comp Sci Taif 21944 Saudi Arabia King Abdulaziz Univ Fac Comp & Informat Technol Dept Comp Sci Jeddah Saudi Arabia Hamad Bin Khalifa Univ Coll Sci & Engn Doha 34110 Qatar

Accurate identifications of protein-peptide binding residues are essential for protein-peptide interactions and advancing drug discovery. To address this problem, extensive research efforts have been made to design more discriminative feature representations. However, extracting these explicit features usually depend on third-party tools, resulting in low computational efficacy and suffering from low predictive performance. In this study, we design an end-to-end deep learning-based method, E2EPep, for protein-peptide binding residue prediction using protein sequence only. E2EPep first employs and fine-tunes two state-of-the-art pre-trained protein language models that can extract two different high-latent feature representations from protein sequences relevant for protein structures and functions. A novel feature fusion module is then designed in E2EPep to fuse and optimize the above two feature representations of binding residues. In addition, we have also design E2EPep+, which integrates E2EPep and PepBCL models, to improve the prediction performance. Experimental results on two independent testing data sets demonstrate that E2EPep and E2EPep + could achieve the average AUC values of 0.846 and 0.842 while achieving an average Matthew's correlation coefficient value that is significantly higher than that of existing most of sequence-based methods and comparable to that of the state-of-the-art structurebased predictors. Detailed data analysis shows that the primary strength of E2EPep lies in the effectiveness of feature representation using cross-attention mechanism to fuse the embeddings generated by two fine-tuned protein language models. The standalone package of E2EPep and E2EPep + can be obtained at https://github. com/ckx259/*** for academic use only.

关键词： Peptide-binding residue prediction Protein language model sequence-based methods Cross-attention mechanism

来源：评论

学校读者我要写书评

暂无评论

Predicting protein-ligand binding residues with deep convolutional neural networks

引用

BMC BIOINFORMATICS 2019年第1期20卷 1-12页

作者： Cui, Yifeng Dong, Qiwen Hong, Daocheng Wang, Xikun East China Normal Univ Fac Educ 3663 N Zhongshan Rd Shanghai 200062 Peoples R China East China Normal Univ Sch Data Sci & Engn 3663 N Zhongshan Rd Shanghai 200062 Peoples R China Liaoning Normal Univ High Sch Dalian Peoples R China

BackgroundLigand-binding proteins play key roles in many biological processes. Identification of protein-ligand binding residues is important in understanding the biological functions of proteins. Existing computational methods can be roughly categorized as sequence-based or 3D-structure-based methods. All these methods are based on traditional machine learning. In a series of binding residue prediction tasks, 3D-structure-based methods are widely superior to sequence-based methods. However, due to the great number of proteins with known amino acid sequences, sequence-based methods have considerable room for improvement with the development of deep learning. Therefore, prediction of protein-ligand binding residues with deep learning requires *** this study, we propose a new sequence-based approach called DeepCSeqSite for ab initio protein-ligand binding residue prediction. DeepCSeqSite includes a standard edition and an enhanced edition. The classifier of DeepCSeqSite is based on a deep convolutional neural network. Several convolutional layers are stacked on top of each other to extract hierarchical features. The size of the effective context scope is expanded as the number of convolutional layers increases. The long-distance dependencies between residues can be captured by the large effective context scope, and stacking several layers enables the maximum length of dependencies to be precisely controlled. The extracted features are ultimately combined through one-by-one convolution kernels and softmax to predict whether the residues are binding residues. The state-of-the-art ligand-binding method COACH and some of its submethods are selected as baselines. The methods are tested on a set of 151 nonredundant proteins and three extended test sets. Experiments show that the improvement of the Matthews correlation coefficient (MCC) is no less than 0.05. In addition, a training data augmentation method that slightly improves the performance is discussed in th

关键词： Protein Binding residues sequence-based methods 3D-structure-based methods Deep convolutional networks

来源：评论

学校读者我要写书评

暂无评论

Computational modeling of membrane proteins

引用

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS 2015年第1期83卷 1-24页

作者： Leman, Julia Koehler Ulmschneider, Martin B. Gray, Jeffrey J. Johns Hopkins Univ Dept Chem & Biomol Engn Baltimore MD 21218 USA Johns Hopkins Univ Dept Mat Sci & Engn Baltimore MD 21218 USA

The determination of membrane protein (MP) structures has always trailed that of soluble proteins due to difficulties in their overexpression, reconstitution into membrane mimetics, and subsequent structure determination. The percentage of MP structures in the protein databank (PDB) has been at a constant 1-2% for the last decade. In contrast, over half of all drugs target MPs, only highlighting how little we understand about drug-specific effects in the human body. To reduce this gap, researchers have attempted to predict structural features of MPs even before the first structure was experimentally elucidated. In this review, we present current computational methods to predict MP structure, starting with secondary structure prediction, prediction of trans-membrane spans, and topology. Even though these methods generate reliable predictions, challenges such as predicting kinks or precise beginnings and ends of secondary structure elements are still waiting to be addressed. We describe recent developments in the prediction of 3D structures of both -helical MPs as well as -barrels using comparative modeling techniques, de novo methods, and molecular dynamics (MD) simulations. The increase of MP structures has (1) facilitated comparative modeling due to availability of more and better templates, and (2) improved the statistics for knowledge-based scoring functions. Moreover, de novo methods have benefited from the use of correlated mutations as restraints. Finally, we outline current advances that will likely shape the field in the forthcoming decade. Proteins 2015;83:1-24. (c) 2014 Wiley Periodicals, Inc.

关键词： membrane proteins protein structure protein modeling sequence-based methods structure prediction de novo folding homology modeling molecular dynamics simulations alpha-helical membrane proteins beta-barrel membrane proteins

来源：评论

学校读者我要写书评

暂无评论

Comparison of structure-based and threading-based approaches to protein functional annotation

引用

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS 2010年第1期78卷 118-134页

作者： Brylinski, Michal Skolnick, Jeffrey Georgia Inst Technol Ctr Study Syst Biol Sch Biol Atlanta GA 30318 USA

To exploit the vast amount of sequence information provided by the Genomic revolution, the biological function of these sequences must be identified. As a practical matter, this is often accomplished by functional inference. Purely sequence-based approaches, particularly in the "twilight zone" of low sequence similarity levels, are complicated by many factors. For proteins, structure-based techniques aim to overcome these problems;however, most require high-quality crystal structures and suffer from complex and equivocal relations between protein fold and function. In this study, in extensive benchmarking, we consider a number of aspects of structure-based functional annotation: binding pocket detection, molecular function assignment and ligand-based virtual screening. We demonstrate that protein threading driven by a strong sequence profile component greatly improves the quality of purely structure-based functional annotation in the "twilight zone." By detecting evolutionarily related proteins, it considerably reduces the high false positive rate of function inference derived on the basis of global structure similarity alone. Combined evolution/structure-based function assignment emerges as a powerful technique that can make a significant contribution to comprehensive proteome annotation.

关键词： binding pocket detection gene ontology molecular function protein function annotation protein threading sequence-based methods structure-based methods virtual screening

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：