This application process a directory of files and converts them into ngrams of a user specified size.
The menu has 8 options
- The user can specify a directory for the files
- The user can specify the ngram size
- The user can specify a output file path and name
- Build the ngrams and output to csv
- Exit the app
whitespaceFiltering
they can toggle the regex that removes all none lettercalculatePercentFrequency
will add a thrid column to the csv with the percentage frequancy that ngram appeared in totaluseSlidingNGramCalculation
will slide along the source text moving 1 at a time or jumping the size of an ngram each time
My menu updates to show your current settings and also starts with some reasonable defaults.