PERMUTATION INVARIANCE FOR REPRESENTING LINEARIZED TABULAR DATA

An embodiment for encoding permutation-invariant representations of linearized tabular data. The embodiment may receive input including tabular data and linearize a column or row within the received tabular data. The embodiment may automatically assign an increasing sequence of position identifiers...

Full description

Saved in:
Bibliographic Details
Main Authors MIHINDUKULASOORIYA, NANDANA, Bagchi, Sugato, Gliozzo, Alfio Massimiliano, Dash, Sarthak
Format Patent
LanguageEnglish
Published 21.12.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:An embodiment for encoding permutation-invariant representations of linearized tabular data. The embodiment may receive input including tabular data and linearize a column or row within the received tabular data. The embodiment may automatically assign an increasing sequence of position identifiers to each non-delimiting tokenized cell in the linearized column or row until a header delimiter is reached. The embodiment may, in response to reaching the header delimiter, automatically assign a monotonically increasing sequence of position identifiers for each non-delimiting tokenized cell positioned after the header delimiter, restarting from an integer corresponding to 1 greater than the position identifier assigned to the header delimiter for each non-delimiting tokenized cell positioned after cell delimiters. The embodiment may automatically assign a static position identifier for each of the cell delimiters in the linearized column or row and output an encoded permutation-invariant representation of the linearized column or row.
Bibliography:Application Number: US202217807461