GCX is grammar compressor based on the induced suffix ordering technique introduced in GCIS. Our method builds on the factorization of the DC3 algorithm to create the rules of a straight line grammar for the input string.
GCX supports very fast extraction operation on the encoded grammar without the need to decompress the complete string.
In this version, the size of the *grammar rules varies at each level, this value is calculated based on the average LCP of all suffixes (ordered) in the text.
TBD
To compress a text, execute:
./main -c <plain_file> <compressed_file>
To decompress a text, execute:
./main -d <compressed_file> <decompressed_file>
To extract substrings from the compressed text, execute:
./main -e <compressed_file> <output_file> <queries_file>