This library/tool will generate semi-indexes on JSON files as per the paper: Semi-Indexing Semi-Structured Data in Tiny Space.
For a given JSON file, hw-json-simd
will generate two semi-index
files, which both together can be loaded into a single in-memory semi-index.
The semi-index files can be generated using two methods, which will be called standard and simple, which correspond to sections 4 and 5 of the Semi-Indexing Semi-Structured Data in Tiny Space paper respectively.
Navigation of the JSON text using the standard index is suppored by the hw-json
project for more information. There is currently
no support for navigation of the JSON text using the simple index.
cat test.json | pv -t -e -b -a | time hw-json-simd create-index \
-i /dev/stdin
--output-ib-file test.json.ib.idx
--output-bp-file test.json.bp.idx
--method standard
cat test.json | pv -t -e -b -a | time hw-json-simd create-index \
-i /dev/stdin
--output-ib-file test.json.ib.idx
--output-bp-file test.json.bp.idx
--method simple
cabal new-install hw-json-simd