Romanian Wikipedia dump that is cleaned and pre-processed, for language model capacity and perplexity evaluation.
-
Updated
Jun 3, 2021
Romanian Wikipedia dump that is cleaned and pre-processed, for language model capacity and perplexity evaluation.
demo of domain corpus bootstrapping using language model perplexity
An Implementation One of Natural Language Processing Method : Language Modelling
Add a description, image, and links to the language-model-perplexity topic page so that developers can more easily learn about it.
To associate your repository with the language-model-perplexity topic, visit your repo's landing page and select "manage topics."