Language_predictor

This project was created in a group of two as a part od OOP course. The main goal was to predict whether the text that we are providing is English, Polish or German. The algorithm is based on monogram and bigram counts( the frequency of appering certain letters in language) in articles from Wikipedia. We used jsoup parsers to extract raw text from HTML.

Another functionality is text prediction. The algorithms suggest possible words based on the previous one.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.idea		.idea
out/production/PO_PROJECT		out/production/PO_PROJECT
src		src
.gitignore		.gitignore
ENGLISH_URLS.txt		ENGLISH_URLS.txt
GERMAN_URLS.txt		GERMAN_URLS.txt
POLISH_URLS.txt		POLISH_URLS.txt
PO_PROJECT.iml		PO_PROJECT.iml
README.md		README.md
jsoup-1.12.1.jar		jsoup-1.12.1.jar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Language_predictor

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

jakubcaputa/Language_predictor

Folders and files

Latest commit

History

Repository files navigation

Language_predictor

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages