Finetuning Language Models to Emit Linguistic Expressions of Uncertainty

Large language models (LLMs) are increasingly employed in information-seeking and decision-making tasks. Despite their broad utility, LLMs tend to generate information that conflicts with real-world facts, and their persuasive style can make these inaccuracies appear confident and convincing. As a r...

Full description

Saved in:

Bibliographic Details
Main Authors	Chaudhry, Arslan, Thiagarajan, Sridhar, Gorur, Dilan
Format	Journal Article
Language	English
Published	18.09.2024
Subjects	Computer Science - Computation and Language Computer Science - Learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Large language models (LLMs) are increasingly employed in information-seeking and decision-making tasks. Despite their broad utility, LLMs tend to generate information that conflicts with real-world facts, and their persuasive style can make these inaccuracies appear confident and convincing. As a result, end-users struggle to consistently align the confidence expressed by LLMs with the accuracy of their predictions, often leading to either blind trust in all outputs or a complete disregard for their reliability. In this work, we explore supervised finetuning on uncertainty-augmented predictions as a method to develop models that produce linguistic expressions of uncertainty. Specifically, we measure the calibration of pre-trained models and then fine-tune language models to generate calibrated linguistic expressions of uncertainty. Through experiments on various question-answering datasets, we demonstrate that LLMs are well-calibrated in assessing their predictions, and supervised finetuning based on the model's own confidence leads to well-calibrated expressions of uncertainty, particularly for single-claim answers.
DOI:	10.48550/arxiv.2409.12180