Skip to content

adnanardhian/text-compression

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 

Repository files navigation

text-compression

text compression with 75% compression ratio (maximum)

Huffman coding is used, and will compress 8-bit of every lower case alphabet and 6 additonal symbol become 6-bit. Total 32 characters are chosen from 128 Non-Extended ASCII code.

Both input and output file are .csv file. The text data is from cornell movie dialog corpus https://www.cs.cornell.edu/~cristian/Cornell_Movie-Dialogs_Corpus.html

About

text compression with 75% compression ratio (maximum)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages