Analyzes texts for compliance with the Zipf's second law and Heaps' law.
Having Python 3 installed, clone the project and install its dependencies:
git clone git@github.com:ZitRos/edu-texts-analyzer.git
cd edu-texts-analyzer
pip3 install -r requirements.txt
Texts for analysis are taken from texts
directory.
Every file in this directory and its subdirectories will be treated as a text file.
There are already some articles, but you may place your own.
Having Python 3 installed, install dependencies and run the program:
py index.py
It will generate Zipf.xlsx
file with word ranks/frequencies data.
Output will go to Heaps.xlsx
file.