AuroraGPT: Foundation Models for Science

Researchers are building science-oriented foundation models to accelerate discoveries and innovation in biology, materials science, chemistry and other fields.

The AuroraGPT project leverages the Aurora exascale supercomputer at the Argonne Leadership Computing Facility (ALCF) to develop and enhance understanding of powerful foundation models (FMs), such as large language models, for scientific discovery. The ALCF is a U.S. Department of Energy Office of Science user facility.

By creating FMs for science—while developing underlying capabilities, tools and workflows, data resources, and other processes and artifacts—the AuroraGPT project aims to significantly improve how science is conducted by fostering a deeper integration of AI capabilities into research workflows.

Scientists are creating and evaluating a series of increasingly powerful FMs, each with more parameters and/or trained on more data than those that precede it. The models are designed to assist researchers in making more informed and efficient discoveries. The AuroraGPT research program focuses on producing this sequence of FMs while ensuring that each provides both a scientifically useful capability and knowledge concerning scientific and computational performance to guide the design of the next model in the sequence.

Intended outcomes from the project include the following:

Datasets and data pipelines for preparing science training data
Software infrastructure and workflows to train, evaluate, and deploy LLMs at scale for scientific research
Evaluation of state-of-the-art LLM models to determine where they fall short in deep scientific tasks and where deep data may have an impact
Assessment of the value of augmenting web training data with science-specific data (full-text scientific papers, structured scientific datasets)
Research grade artifacts (models) for the scientific community and adaptation for downstream uses
Promotion of responsible AI best practices
International collaborations around the long-term goal of AI for science