Presentation
Configuring Large Language Models for Regional Ocean Model Development
DescriptionRecent work at NSF NCAR has developed Python packages and documentation for instantiating regional ocean models in the Community Earth System Model, but how can we guide a community of users through the subsequent tuning and development of purpose-built models? Here, we leverage recent advances in natural language processing and large language models (LLMs) to explore novel tools for guiding regional model development. We demonstrate that a curated, high-quality dataset based on a small number of interviews with experts can be used to fine-tune or context-prompt LLMs for use in regional modeling. This style of training data—regional modeling narratives—emphasizes the importance of high-quality, disciplinary data in LLM development, and it has the potential to provide access to previously siloed, institutional experience. In the future, we aim to grow this dataset and incorporate more technical documentation that the LLM can dynamically retrieve to inform more concrete guidance.

Event Type
Research and ACM SRC Posters
TimeTuesday, 18 November 20258:00am - 5:00pm CST
LocationSecond Floor Atrium
Archive
view
