TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Single document news summarization has seen substantial progress on faithfulness in recent years, driven by research on the evaluation of factual consistency, or hallucinations. We ask whether these advances carry over to other text summarization domains. We propose a new evaluation benchmark on top...
Saved in:
Main Authors | , , , , , , , , , , , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
20.02.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!