Skip to content
This repository has been archived by the owner on Aug 4, 2024. It is now read-only.

ZeCariocaUnicamp/nlp_ptbr_datasets

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

Datasets in Portuguese to NLP

This repository aggregates different Natural Language Processing datasets in Brazilian Portuguese.

Datasets generated with Miner-XenForo

https://github.com/MatheusFerraroni/datasets_from_minerxenforo

MultiWOZ-PTBR

Script used to translate https://github.com/MatheusFerraroni/multiwoz-PTBR

Translated files https://github.com/MatheusFerraroni/multiwoz-PTBR/tree/master/data/MultiWOZ_2.2

BrWac

Cleaner version of BrWac. Removed sites where the content may be political or offensive http://www.lrc.ic.unicamp.br/~ferraroni/datasets/BrWac-CLEAN/

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published