From 06e6f4e3ee6ff3283ea35e691fd3168c74915f09 Mon Sep 17 00:00:00 2001 From: "Giuseppe G. A. Celano" Date: Tue, 8 Aug 2017 14:56:27 +0200 Subject: [PATCH] Update README.md --- README.md | 7 +++---- 1 file changed, 3 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index 1ee4f8a..811e02d 100644 --- a/README.md +++ b/README.md @@ -2,13 +2,12 @@ This repository contains Ancient Greek texts which have been tokenized, POS-tagged, sentence-splitted, and lemmatized automatically. The texts come from the following repositories, which currently contain most of the Ancient Greek texts freely accessible over the internet: -1. https://github.com/PerseusDL/canonical-greekLit/releases/tag/0.0.4 -2. https://github.com/OpenGreekAndLatin/First1KGreek/releases/tag/v1.1 +1. https://github.com/PerseusDL/canonical-greekLit/releases/tag/0.0.236 +2. https://github.com/OpenGreekAndLatin/First1KGreek/releases/tag/1.1.1802 As for the tokenization, POS tagging and sentence splitting, the data rely on those provided in: -1. https://github.com/gcelano/CTSAncientGreekXML -2. https://github.com/gcelano/POStaggedAncientGreekXML +1. https://github.com/gcelano/POStaggedAncientGreekXML/releases/tag/v1.2.0 Refer to these repositories for further documentation. In the present repository, the POS tag + the word form of a token have been automatically linked to those contained in Morpheus and MorpheusUnderPhilologic. Since the latter databases also contain lemmata, this allowed their automatic extraction.