This repository aggregates different Natural Language Processing datasets in Brazilian Portuguese.
https://github.com/MatheusFerraroni/datasets_from_minerxenforo
Script used to translate https://github.com/MatheusFerraroni/multiwoz-PTBR
Translated files https://github.com/MatheusFerraroni/multiwoz-PTBR/tree/master/data/MultiWOZ_2.2
Cleaner version of BrWac. Removed sites where the content may be political or offensive http://www.lrc.ic.unicamp.br/~ferraroni/datasets/BrWac-CLEAN/