Fast JSON parser using metaprogramming on GPU
We demonstrate a new idea of a parallel GPU JSON parser, which is able to optimize the parsing and initial transformation process through metaprogramming. It outperforms other well-known solutions like simdjson, Pandas, as well as cuDF- which also works on GPU. The resulting data is ready to be furt...
Saved in:
Published in | 2022 IEEE 9th International Conference on Data Science and Advanced Analytics (DSAA) pp. 1 - 10 |
---|---|
Main Authors | , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
13.10.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | We demonstrate a new idea of a parallel GPU JSON parser, which is able to optimize the parsing and initial transformation process through metaprogramming. It outperforms other well-known solutions like simdjson, Pandas, as well as cuDF- which also works on GPU. The resulting data is ready to be further processed in common data frame formats and may be incorporated by RAPIDS, Apache Arrow or Pandas. Our parser can therefore be a part of an industrial Extract-Transform-Load workflow. |
---|---|
DOI: | 10.1109/DSAA54385.2022.10032381 |