SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content

We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classificati...

Full description

Saved in:
Bibliographic Details
Main Authors Gandhi, Apurva, Serrao, Ryan, Fang, Biyi, Antonius, Gilbert, Hong, Jenna, Nguyen, Tra My, Yi, Sheng, Nosakhare, Ehi, Shaffer, Irene, Srinivasan, Soundararajan, Gupta, Vivek
Format Journal Article
LanguageEnglish
Published 08.11.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classification of these sentences into task/non-task sentences. SLATE greatly outperforms a baseline two-model (sentence segmentation followed by classification model) approach, achieving a task F1 score of 84.4%, a sentence segmentation (boundary similarity) score of 88.4% and three times lower latency compared to the baseline. Furthermore, we provide insights into tackling challenges of performing NLP on the inking domain. We release both our code and dataset for this novel task.
DOI:10.48550/arxiv.2211.04454