Releases: gcelano/LemmatizedAncientGreekXML
Releases · gcelano/LemmatizedAncientGreekXML
Lemmatized Ancient Greek Texts (v1.2.5)
Changes:
- Lemmas for prepositions, particles, and a few clear mistakes concerning article lemmas have been corrected. This has increased dramatically the number of the lemmas available in the corpus: 21493806 lemmas against 25522507 tokens.
LemmatizedAncientGreekXML (v1.2.4)
Changes:
- The codepoint "’" is used as apostrophe or as quotation mark. Some known issues stemming from (wrong) Betacode conversion makes tokenization for this codepoint hard to handle. This version corrects some tokenization errors to move quotation mark "’" into a separate token
LemmatizedAncientGreekXML (v1.2.3)
Changes:
- "’" position for elision has been corrected, i.e., put in the same element of
the elided word (This error was due to the fact that the apostrophe has been
encoded with different codepoints)
LemmatizedAncientGreekXML (v1.2.2)
Changes:
- In tlg0018.tlg010.opp-grc1.xml and tlg0018.tlg015.opp-grc1.xml the erroneous
’Kv at the beginning of the first sentence has been corrected into Ἐν - In tlg0018.tlg019.opp-grc1.xml the erroneous ’Η at the beginning of the first
sentence has been corrected into Ἡ - "’" position has been corrected, i.e., put at the end of a sentence
- Duplicate l1 and l2 are deleted
LemmatizedAncientGreekXML (v1.2.1)
Improved documentation in README.md
lemma redundancy deleted
LemmatizedAncientGreekXML (v1.2.0)
Update README.md
LemmatizedAncientGreekXML (v1.0.1)
An xml version of the Morpheus database, which has been used to retrieve lemmas, has been added.
LemmatizedAncientGreekXML (v1.0.0)
This release contains the data as they have been automatically produced, without attempting manual corrections.