Skip to content

GCX is grammar compressor based on the induced suffix ordering technique introduced in GCIS.

Notifications You must be signed in to change notification settings

DanyelleAngelo/GCX

Repository files navigation

Grammar Compression modulo X (GCX)

Introduction

GCX is grammar compressor based on the induced suffix ordering technique introduced in GCIS. Our method builds on the factorization of the DC3 algorithm to create the rules of a straight line grammar for the input string.

GCX supports very fast extraction operation on the encoded grammar without the need to decompress the complete string.

In this version, the size of the *grammar rules varies at each level, this value is calculated based on the average LCP of all suffixes (ordered) in the text.

Compilation

TBD

Compression

To compress a text, execute:

./main -c <plain_file> <compressed_file>

Decompression

To decompress a text, execute:

./main -d <compressed_file> <decompressed_file> 

Extraction

To extract substrings from the compressed text, execute:

./main -e <compressed_file> <output_file> <queries_file>

About

GCX is grammar compressor based on the induced suffix ordering technique introduced in GCIS.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published