Skip to content

Following Kevin Knight’s tradition, Japanese KATAKANA and English are used to demonstrate the linguistic diversity and to illustrate transliteration and translation. For this project NLP’s modern quantitative techniques and statistical methods, dynamic programming algorithms (Viterbi, CKY, Forward-Backward, Inside-Outside, and Beam Search), and …

Notifications You must be signed in to change notification settings

csdankim/NLP_KATAKANA_ENG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Natural Language Processing

Project: Cipher / Decipher Katakana and English

 

Goal

Following Kevin Knight's tradition, understand and implement followings:

  • finite-state machines (weighted FSAs and FSTs)
  • syntactic structures (weighted context-free grammars and parsing algorithms)
  • machine learning methods (maximum likelihood and expectation-maximization)
  • modern quantitative techniques in NLP that use large corpora and statistical learning
  • various dynamic programming algorithms (Viterbi, CKY, Forward-Backward, and Inside-Outside)
  • Japanese language as a running example to demonstrate the linguistic diversity, to illustrate transliteration and translation, and to understand the Viterbi and EM algorithms
  • For the linguistic background of Japanese, please see this video.
  • For finite-state toolkit, USC ISI's CARMEL is used.

 

Contents

About

Following Kevin Knight’s tradition, Japanese KATAKANA and English are used to demonstrate the linguistic diversity and to illustrate transliteration and translation. For this project NLP’s modern quantitative techniques and statistical methods, dynamic programming algorithms (Viterbi, CKY, Forward-Backward, Inside-Outside, and Beam Search), and …

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published