Code to generate/annotate/handle lists of frequency from corpora.
Normalize Word Frequency v0.1.5
R code to normalize raw frequency counts into fpmw, fpbw, zipf, zipf per billion and other popular measures to indicate word frequency. To use Normalize Word Frequency :
Prepare your input file:
- make sure your txt or csv input files have a header: the column with raw frequency you want to normalize must be called "Frequency"
Set input specifications in the code:
- set the paths for input and output files (line 28-30)
- set the file extension (36)
- set the file separators (48)
- set size of corpus (70) - this version reports the size of Itwac.
Set output specifications in the code:
- choose what transformations to apply by commenting/ uncommenting (56-65)
This version has been deprecated.