Skip to content

Commit 86b2b1e

Browse files
committedDec 3, 2018
small changes
1 parent 4dacc50 commit 86b2b1e

File tree

2 files changed

+17
-0
lines changed

2 files changed

+17
-0
lines changed
 

‎.gitignore

+2
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,8 @@ alpha.twolc
88
*#*
99
.DS_Store
1010
docs/_*
11+
lexgues/*.lexc
12+
lexgues/*.text
1113
tmp/*
1214
history*
1315
parser.out

‎docs/lexguessing.rst

+15
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,22 @@ Human-readable dictionaries often use quite large numbers of distinct inflection
1616
Patterns for inflection classes
1717
===============================
1818

19+
Each inflection class can be described with one or more *patterns*. Patterns are written as a CSV file, i.e. a table with a few columns:
1920

21+
CONT
22+
Continuation class from which the affixes can be attached to the stems described by this pattern. The continuation is the same thing as in LEXC entries.
23+
24+
ICLASS
25+
Inflection class, e.g. the number used in a dictionary.
26+
27+
MPHON
28+
A pattern which describes the lexicon entries may belong to this class (ICLASS). The pattern resembles LEXC lexicon entries. The pattern may be either a regular expression ``<...>`` or a single exceptional lexeme. The regular expressions follow the XFST syntax used in LEXC except that curly braces ``{}`` are reserved for names of morphophonemes where they are not quoted with a percent sign ``%``, thus e.g. ``a:{ao}`` is just a symbol pair. The input side of regular expressions is the base form of lexemes and the output side is the morphophonemic representation. Epsilons **0** may be used when something in the base-form corresponds to nothing in the morphophonemic representation and vice versa. (Be careful not to mix zero symbols ``Ø`` and epsilons **0**. Remember that the zero symbol occurs olny as a part of morphophonemes, e.g. ``a:{aØ}`` in these patterns, never by itself)
29+
30+
COMMENT
31+
Comments do not affect the results but they are otherwise useful for the developer or maintainer of the patterns.
32+
33+
34+
Regular expressions may consist of concatenation, Kleene star and plus, brackets for grouping, parenthesis for optionality, disjunction and intersection. In particular, one may define expressions and use them in other definitions or patterns. In the first column of a definition is a keyword ``Define``, in the second column is the name of the defined expression, and in the third column is the expression.
2035

2136
Compiling a guessing FST
2237
========================

0 commit comments

Comments
 (0)