NLP3o is a set of tools for Natural Language Processing, and a web app for educational purpose.
The goal is to understand how the main NLP algorithms work. No efficiency nor speed.
The name NLP3o is a word game between NLP and the protocol robot C3PO.
- Tokenise a text
- Remove stop words (English, German, Italian dictionaries)
- Pre-process text (remove punctuation, lower case)
- Extract top words
Everything you need is detailed in the requirements text file (req.txt).
This app is built using Python 3
To clone and run this application, you'll need Git.
# Clone this repository
$ git clone https://github.com/Mashimo/NLP3o
# Install dependencies
# Please refer to Python and Flask documentation
To start the app you just run this script.
On OS X, Linux and Cygwin you have to indicate that this is an executable file before you can run it:
$ chmod a+x run.py
Then the script can simply be executed as follows:
./run.py
On Windows you have to run the script as an argument to the Python interpreter from the virtual environment, e.g.
$ flask\Scripts\python run.py
After the server initializes it will listen on port 5000 waiting for connections.
Now open up your web browser and enter one of the following URLs in the address field:
http://localhost:5000
http://localhost:5000/index
Home page
Settings
Results page
The app can be tested on a Heroku dyno too. (soon)
This project is licensed under the MIT License - see the LICENSE file for details
- Hat tip to dataBASIC.io for the inspiration
- the Stanford NLP course