Skip to content

Entry-level workshop about Natural Language Processing using spaCy library presented at Warsaw IT Days 2019.

Notifications You must be signed in to change notification settings

knum-mimuw/spacy-workshop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Natural Language Processing using spaCy library

Building aspect-based sentiment analysis system to analyze product reviews using spaCy and Keras.

This workshop was originally presented at Warsaw IT Days 2019 by Stanisław Giziński and Krzysztof Kowalczyk from our Machine Learning Club.

We are also currently implementing full support for Polish language in spaCy, you can track our progress via our GitHub organization and the see results on our project website

Environment setup

We will be using Anaconda distribution of Python to make installation of machine learning libraries easier. Any other distribution of Python>=3.7 should do fine, but if you want to have exactly the same setup:

  1. Clone this repository: git clone https://github.com/knum-mimuw/spacy-workshop
  2. Download and install Miniconda Python 3.7 installer, make sure to add binaries to PATH variable when prompted at the end of installation
  3. Open the terminal (on Windows, use newly installed Anaconda Prompt instead of CMD / Powershell)
  4. Create conda environment: conda create -n spacy-wdi python=3.7.1 spacy jupyterlab, this may take a while
  5. Activate the environment: source activate spacy-wdi (on Windows: activate spacy-wdi)
  6. Download machine learning models: python -m spacy download en
  7. Navigate to the cloned repository folder (cd spacy-workshop) and start jupyter lab (jupyter lab)
  8. Download "The Guardian Articles" dataset and extract it (there is only one CSV in there, place it in the cloned repository folder.
  9. Download "Semeval Aspect-Based Sentiment Analysis" dataset. Unfortunately you have to create an account there, because we are not allowed to redistribute this dataset directly due to licensing issues.

To check if the setup process was completed, go to localhost:8888, select "new console: Python 3" and type the following lines into the console:

import spacy
nlp = spacy.load("en_core_web_sm")

If the code doesn't crash, everything was installed correctly.

Workshop

During the workshops, we will be:

About

Entry-level workshop about Natural Language Processing using spaCy library presented at Warsaw IT Days 2019.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published