Skip to main content
Article | Mathematics and Computer Science

WildfireGPT: A New Tool for Wildfire Analysis

WildfireGPT uses retrieval-augmented generation to enhance predictions and response strategies for safer wildfire management.

Wildfires destroy vegetation, animal habitats, infrastructure, homes and human lives. As the severity of wildfires has increased in recent years, researchers have amassed enormous volumes of data about the causes, risks and impact. But how can all this data be analyzed effectively?

What role can large language models play in wildfire management?

Large language models (LLMs) can handle massive amounts of data. Their success in enhancing natural language comprehension has now spurred the development of LLMs for scientific applications. With a focus on wildfire analysis, researchers at the U.S. Department of Energy’s (DOE) Argonne National Laboratory and the University of Pennsylvania have developed WildfireGPT.

Why is Wildfire GPT needed?

LLMs typically are trained by searching information from various domains. The Argonne-UPenn team recognized that while general-purpose LLMs can be useful for a wide range of tasks, they may not be optimal for specific domains such as wildfire management. For decision-makers seeking information about wildfire risks, responses to queries must be domain-specific. Equally important, the responses must be scientifically accurate so that informed decisions can be made. WildfireGPT is a prototype LLM designed to meet these requirements.

How does WildfireGPT differ from other LLM models?

WildfireGPT is trained by using a comprehensive dataset of wildfire-related data, including historical fire records, weather patterns and vegetation characteristics. A distinctive element of the WildfireGPT design is its use of retrieval-augmented generation, or RAG,” said Tanwi Mallick, an assistant computer scientist in Argonne’s Mathematics and Computer Science (MCS) division. RAG allows scientific literature and projections about natural hazards to supplement simpler searches and be included in the LLM agent’s response.”

How does it work?

The LLM agent interacts with the user, asking questions to better focus the inquiry and formulate a plan that addresses the user’s requirements. The LLM agent evaluates the user’s input to decide whether additional information is needed. If not, the LLM generates a response for the user. If further information is needed, the LLM uses a tool from the toolbox module to retrieve external information. The retrieved information is combined with information in the memory module, which includes elements from previous conversations, and the LLM agent then generates a response for the user (see Fig. 1).

Fig. 1: Overview of the LLM agent in a retrieval-augmented generation framework. The agent evaluates user input, retrieves relevant information, and, if needed, uses tools to search for additional data. It then merges retrieved data with past interactions stored in memory to generate a response.
Who will use WildfireGPT?

By providing a powerful tool for analyzing vast amounts of data and scientific research, WildfireGPT has significant potential for revolutionizing wildfire analysis. WildfireGPT can be used to examine previous wildfire events, identify patterns and make predictions about future fire risk. This information could be valuable for firefighters, land managers and policymakers in exploring targeted responses to wildfires, including resource allocation, thereby mitigating the destructive effects of wildfires and enhancing public safety.

Experts found WildfireGPT’s interactive capabilities particularly helpful, enabling more human-like consulting experience than provided by a search engine,” said Robert Ross, a senior computer scientist in Argonne’s MCS division.  

How well has WildfireGPT been tested?

The researchers tested the model’s performance on four case studies: across the United States: ecosystem fire management in Naperville, IL; urban wildfire mitigation in Beaverton, OR; community hazard mitigation planning in Mora Country, NM; and post-wildfire public safety in Sangre De Cristo Mountains, NM. Results showed that WildfireGPT performs well, with overall scores ranging from approximately 98% to 93% for relevance, correctness, accessibility and entailment, respectively. Feedback from experts noted that WildfireGPT was useful in boosting productivity by quickly finding and interpreting relevant data and literature.

What are the benefits?

The development of the WildfireGPT model represents a step forward in applying large language models to the complex domain of wildfire management. By tailoring the model to the specific needs of this field, the researchers have demonstrated the potential for LLMs to provide valuable insights and support for critical decision-making. As a prototype, WildfireGPT lays the foundation for further development and integration of LLMs into wildfire management.

What are its current limitations and what are the next steps?

One limitation noted was in communication: some users found WildfireGPT’s lengthy responses made extracting information difficult. Also, WildfireGPT occasionally had difficulty transitioning from defining the user’s concerns to developing a plan and generating a response with data-driven evidence. In addition to addressing these concerns, the researchers plan further case studies with different domain experts to evaluate WildfireGPT’s capabilities in addressing real-world problems.

Where can people find further information?

For further information, see the paper by Yangxinyu Xie, Bowen Jiang, Tanwi Mallick, Joshua David Bergerson, John K. Hutchison, Duane R. Verner, Jordan Branham, M. Ross Alexander, Robert B. Ross, Yan Feng, Leslie-Anne Levy, Weijie Su, and Camillo J. Taylor, WildfireGPT: Tailored Large Language Model for Wildfire Analysis,” arXiv preprint arXiv:2402.07877 (v2), 2024.

Argonne National Laboratory seeks solutions to pressing national problems in science and technology by conducting leading-edge basic and applied research in virtually every scientific discipline. Argonne is managed by UChicago Argonne, LLC for the U.S. Department of Energy’s Office of Science.

The U.S. Department of Energy’s Office of Science is the single largest supporter of basic research in the physical sciences in the United States and is working to address some of the most pressing challenges of our time. For more information, visit https://​ener​gy​.gov/​s​c​ience.