Texts Statistical Analysis

Analyzes texts for compliance with the Zipf's second law and Heaps' law.

Prerequisites

Having Python 3 installed, clone the project and install its dependencies:

git clone git@github.com:ZitRos/edu-texts-analyzer.git
cd edu-texts-analyzer
pip3 install -r requirements.txt

Texts for analysis are taken from texts directory. Every file in this directory and its subdirectories will be treated as a text file. There are already some articles, but you may place your own.

Zipf's Law

Having Python 3 installed, install dependencies and run the program:

py index.py

It will generate Zipf.xlsx file with word ranks/frequencies data.

Heaps' Law

Output will go to Heaps.xlsx file.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
texts		texts
.gitignore		.gitignore
Heaps_law_plot.xlsx		Heaps_law_plot.xlsx
Zipf's_law_plot.xlsx		Zipf's_law_plot.xlsx
index.py		index.py
license		license
readme.md		readme.md
requirements.txt		requirements.txt
stats.py		stats.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Texts Statistical Analysis

Prerequisites

Zipf's Law

Heaps' Law

About

Releases

Packages

Languages

License

nikitaeverywhere/edu-texts-analyzer

Folders and files

Latest commit

History

Repository files navigation

Texts Statistical Analysis

Prerequisites

Zipf's Law

Heaps' Law

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages