Document Intelligence Metrics for Visually Rich Document Evaluation
The processing of Visually-Rich Documents (VRDs) is highly important in information extraction tasks associated with Document Intelligence. We introduce DI-Metrics, a Python library devoted to VRD model evaluation comprising text-based, geometric-based and hierarchical metrics for information extrac...
Saved in:
Main Authors | , , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
23.05.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The processing of Visually-Rich Documents (VRDs) is highly important in
information extraction tasks associated with Document Intelligence. We
introduce DI-Metrics, a Python library devoted to VRD model evaluation
comprising text-based, geometric-based and hierarchical metrics for information
extraction tasks. We apply DI-Metrics to evaluate information extraction
performance using publicly available CORD dataset, comparing performance of
three SOTA models and one industry model. The open-source library is available
on GitHub. |
---|---|
DOI: | 10.48550/arxiv.2205.11215 |