文献详情 >Foundational Large Language Mo... 收藏

arXiv

Foundational Large Language Models for Materials Research

作者：Mishra, Vaibhav Singh, Somaditya Ahlawat, Dhruv Zaki, Mohd Bihani, Vaibhav Grover, Hargun Singh Mishra, Biswajit Miret, Santiago Mausam Krishnan, N.M. Anoop

作者机构：Department of Computer Science and Engineering Department of Civil Engineering Yardi School of Artificial Intelligence Indian Institute of Technology Delhi India Cerebras Systems Inc. United States Intel Labs

出版物：《arXiv》 (arXiv)

年卷期：2024年

核心收录：

主　　题：Crystallography

摘要：Materials discovery and development are critical for addressing global challenges in renewable energy, sustainability, and advanced technology. Yet, the exponential growth in materials science literature comprising vast amounts of textual data has created significant bottlenecks in knowledge extraction, synthesis, and scientific reasoning. Large Language Models (LLMs) offer unprecedented opportunities to accelerate materials research through automated analysis and prediction. Still, their effective deployment for materials discovery requires domain-specific adaptation for language understanding and solving domain-relevant tasks. Here, we present LLaMat, a family of foundational models for materials science, developed through continued pretraining of LLaMA models on an extensive corpus of materials literature and crystallographic data, followed by instruction- and task-finetuning. Through systematic evaluation, we demonstrate that LLaMat excels in materials-specific natural language processing and structured information extraction tasks outperforming commercial LLMs, while maintaining general linguistic capabilities. The specialized LLaMat-CIF variant demonstrates remarkable capabilities in crystal structure generation, predicting stable crystals with high coverage across the periodic table. Intriguingly, despite LLaMA-3’s superior performance in comparison to LLaMA-2, we observe that LLaMat-2 demonstrates unexpectedly enhanced domain-specific performance across diverse materials science tasks, including structured information extraction from text and tables and crystal structure generation. These results point to a potential adaptation rigidity in overtrained LLMs such as LLaMA-3. Altogether, the present work demonstrates the effectiveness of domain adaptation towards the development of practically deployable LLM copilots for materials research. Beyond materials science, our findings reveal important considerations for domain adaptation of LLMs—model selection, tr

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Foundational Large Language Models for Materials Research

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Foundational Large Language Models for Materials Research

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：