Fast JSON parser using metaprogramming on GPU

We demonstrate a new idea of a parallel GPU JSON parser, which is able to optimize the parsing and initial transformation process through metaprogramming. It outperforms other well-known solutions like simdjson, Pandas, as well as cuDF- which also works on GPU. The resulting data is ready to be furt...

Full description

Saved in:
Bibliographic Details
Published in2022 IEEE 9th International Conference on Data Science and Advanced Analytics (DSAA) pp. 1 - 10
Main Authors Kaczmarski, Krzysztof, Narebski, Jakub, Piotrowski, Stanislaw, Przymus, Piotr
Format Conference Proceeding
LanguageEnglish
Published IEEE 13.10.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We demonstrate a new idea of a parallel GPU JSON parser, which is able to optimize the parsing and initial transformation process through metaprogramming. It outperforms other well-known solutions like simdjson, Pandas, as well as cuDF- which also works on GPU. The resulting data is ready to be further processed in common data frame formats and may be incorporated by RAPIDS, Apache Arrow or Pandas. Our parser can therefore be a part of an industrial Extract-Transform-Load workflow.
DOI:10.1109/DSAA54385.2022.10032381