text compression with 75% compression ratio (maximum)
Huffman coding is used, and will compress 8-bit of every lower case alphabet and 6 additonal symbol become 6-bit. Total 32 characters are chosen from 128 Non-Extended ASCII code.
Both input and output file are .csv file. The text data is from cornell movie dialog corpus https://www.cs.cornell.edu/~cristian/Cornell_Movie-Dialogs_Corpus.html