Skip to content

Latest commit

 

History

History
35 lines (25 loc) · 2.61 KB

README.md

File metadata and controls

35 lines (25 loc) · 2.61 KB

selnolig-check

This repository contains the source code and documentation of selnolig-check. selnolig-check tests the German ligature suppression patterns of the LuaLaTeX package selnolig for morphological correctness and relative completeness, based on an extensive corpus.

We conducted the majority of this project as our final project for the class Introducton to Computational Linguistics at the University of Massachusetts at Amherst in the fall of 2012.

Running the Programs

In order to run the programs, two external resources are required, which are not included in this repository:

The programs are supposed fo be run in the following order:

  1. in src/testing_dictionary/:
    1. corpus_to_words
    2. words_to_ligs
    3. ligs_to_ligdict
  2. in src/selnolig_check/:
    1. ligdict_to_smor (this is just a script to call SMOR with the correct input and output files)
    2. smor_to_morphemes
    3. morphemes_to_analyses
    4. analyses_to_errors

Licenses

The code is licensed under a Simplified BSD License, to be viewed in the file LICENSE.md.

The documentation is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.