Skip to content

JacyNumbers

FrancisBond edited this page Jul 3, 2007 · 7 revisions

Numbers, Numeral Classifiers and more

Kanji NUmbers

Hindu-Arabic numbers

What we should do is parse numbers and pass them through using SMAF.

ChaSen settings

We set ChaSen to join numbers together:

(連結品詞 ((名詞 数))
          ((記号 アルファベット)))
(COMPOSIT_POS ((名詞 数))
          ((記号 アルファベット)))

Work Around

Until we have a proper preprocessor, we just add all numbers from 0--999, using the script utils/make-num.perl. We could add more as needed.

References

Clone this wiki locally