- Have implemented Markov Chain like model for 2 and 3 word ngrams ( 5 words did not fit in free tier shiny)
Shiny Dashboard Implementation
- Calculate probabilities for single word also -1 gram - ( maybe top 1k words only)
- Remove low frequency ngrams
- Implement statistical testing - (chi squared)
- Flask endpoint for faster prediction done separately. Code changed to load model object only the first time.