咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Genomic Language Models: Oppor... 收藏
arXiv

Genomic Language Models: Opportunities and Challenges

作     者:Benegas, Gonzalo Ye, Chengzhong Albors, Carlos Li, Jianan Canal Song, Yun S. 

作者机构:Computer Science Division University of California Berkeley United States Department of Statistics University of California Berkeley United States Center for Computational Biology University of California Berkeley United States 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2024年

核心收录:

主  题:Genome 

摘      要:Large language models (LLMs) are having transformative impacts across a wide range of scientific fields, particularly in the biomedical sciences. Just as the goal of Natural Language Processing is to understand sequences of words, a major objective in biology is to understand biological sequences. Genomic Language Models (gLMs), which are LLMs trained on DNA sequences, have the potential to significantly advance our understanding of genomes and how DNA elements at various scales interact to give rise to complex functions. To showcase this potential, we highlight key applications of gLMs, including functional constraint prediction, sequence design, and transfer learning. Despite notable recent progress, however, developing effective and efficient gLMs presents numerous challenges, especially for species with large, complex genomes. Here, we discuss major considerations for developing and evaluating *** Codes 92-08, 92B20, 68T50, 68T07 © 2024, CC BY-NC-ND.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分