Natural Language Processing

Dataset used: Twitter codemix data.

Calculated Code Mixing Index(CMI) for each tweet and seperated tweets into 10 sets based on the CMI values. For each set we found perplexity, and found the relation between CMI and Perplexity on the data we collected.
Each folder has README.md inside describing what we have done.

Contributors:

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
LanguageModelling		LanguageModelling
Perplexity_CMI		Perplexity_CMI
README.md		README.md