Intro2NLP

An introductory tutorial on how to do Natural Language Processing using NLTK (Natural Language Toolkit) in Python.

intro2NLP.ipynb: a Jupyter notebook which shows how to access, clean, and analyze a corpus using the nltk library.

After accessing Jane Austen's Sense and Sensibility on the nltk.corpus package, I preprocess the text by e.g. removing stopwords and punctuations, then plot the distribution of word frequency and apply sentiment analysis using the textblob library. To show how to do sentiment analysis using a classifier, I train a Naive Bayes classifier on the movie_review dataset available on nltk.corpus.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
austen-sense.txt		austen-sense.txt
intro2NLP.ipynb		intro2NLP.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intro2NLP

About

Releases

Packages

Languages

pirmoradian/Intro2NLP

Folders and files

Latest commit

History

Repository files navigation

Intro2NLP

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages