Skip to content

jakubcaputa/Language_predictor

Repository files navigation

Language_predictor

This project was created in a group of two as a part od OOP course. The main goal was to predict whether the text that we are providing is English, Polish or German. The algorithm is based on monogram and bigram counts( the frequency of appering certain letters in language) in articles from Wikipedia. We used jsoup parsers to extract raw text from HTML.

Another functionality is text prediction. The algorithms suggest possible words based on the previous one.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages