Skip to content

GoogleCodeIn project: Converter for tags from UD-tree bank to Apertium format.

Notifications You must be signed in to change notification settings

alxmamaev/UdTree2Apertium

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

UdTree2Apertium

This utility will help you to convert UdTree files to Apertium format.

How it is work

First you need to get a raw Apertium file. Example for english:

cat en-ud-train.conllu | grep -e '^$' -e '^[0-9]' | cut -f2 | sed 's/$/¶/g' | apertium-destxt | lt-proc -w ~/source/apertium//languages/apertium-eng/eng.automorf.bin | apertium-retxt | sed 's/¶//g' > en-ud-train.apertium

Then you need to run this utility:

python converter.py tags/eng.csv en-ud-train.apertium en-ud-train.conllu eng.tagged

Example

alxmamaev@alxmamaev-pc MINGW32 ~/Projects/UdTree2Apertium (master)
$ python converter.py tags/eng.csv en-ud-train.apertium en-ud-train.conllu eng.tagged
^Al/Al<np><ant><m><sg>$
^-/-<guio>$
^Zaman/Zaman<np><cog><sg>$
^:/:<sent>$
^American/American<adj>$
^forces/force<n><pl>$

About

GoogleCodeIn project: Converter for tags from UD-tree bank to Apertium format.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages