Skip to content

Latest commit

 

History

History
executable file
·
8 lines (5 loc) · 434 Bytes

README.md

File metadata and controls

executable file
·
8 lines (5 loc) · 434 Bytes

phrases

a preprocessed corpus of lojban sentences for machine translation exercises

Contains sentences from jboTatoeba project. If a sentence has been translated by gleki,ilmen,uakci,jelca they are shown instead of the original translation at Tatoeba.

Lojban sentences are additionally preprocessed: diacritic orthography removed, cmavo clusters split, dots removed, sentences using {zoi} removed.

Sentences marked as B removed.