Skip to content

ololobus/slavic_text_scht

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

St. Petersburg Corpus of Hagiographic Texts

Old Church Slavic corpus

http://project.phil.spbu.ru/scat/page.php?page=project

Parser

Run to get entire xml text.

./tei_parser.py xml/Aleksandr_svirskij.xml

TODO:

  • return text sentence by sentence
  • return text clause by clause
  • keep info about named entities (<name> tag)

Releases

No releases published

Packages

No packages published

Languages