NLP Course

Natural Language Processing Course in HUJi.

From Basic to advance models.

Classification, Word Embedding, Sentiment Analysis, Information Extraction, Semantic Role Labeling, Machine Translation, Pretraining and Transformers.

Text Generation - Markov Languages Models

Unigram and Bigram models, Back off model, linear_interpolation_smoothing.

Part-of-Speech Tagging - Hidden Markov Models

bigram HMM tagger, Viterbi algorithm, Add-one smoothing.

Sentiment Analysis - Log Linear Model, Word2Vec representation, LSTM model

Build 3 models - Simple Log Linear, Log Linear with Word2Vec and LSTM. On the Dataset of Sentiment Treebank by Stanford (movie reviews, and their sentiment value).

Using Pytorch.

Simple log-linear model: This model will use a simple one-hot embedding for the words in order to perform the task.

Word2Vec log-linear model: This model is almost identical to the simple log-linear model, except it uses pre-trained Word2Vec embeddings instead of a simple one-hot embedding.

LSTM model: In this model, each LSTM cell receives as input the Word2Vec embedding of a word in the input sentence. Then the model takes the two hidden states of the LSTM layer (the last hidden state of both directions of the bi-LSTM layer) – and concatenate them. Later, he put this concatenation through a linear layer and finally output the sigmoid of the result.

Transition-based Parsing - Maximum Spanning Tree Parser

Implementing the MST (Maximum Spanning Tree) parser for unlabeled dependency parsing, using the perceptron algorithm and Inference using Chu-Liu-Edmonds algorithm.

For the feature function components based on Words bigrams, we have a feature (value 1) for every two words with arc between them. For the feature function components based on POS tags bigrams, we use the part of speech tags given in the testset (save us running a POS tagger). The scoring function is defined to be the dot product of the feature function by a weight vector w. For Inference (computing the MST), we use the our modified version (max instead of min) of Chu-Liu-Edmonds algorithm.

Pretraining and Transformers

Solving classification tasks via 3 different ways with 3 different portions of the dataset. First model: Logistic Regression model using normalized Bag-of-Words representation. Second model: Finetune a Transformer model with a SequenceClassification head and an appropriate tokenizer. Third model: Run zero-shot classification for this task using a pretrained model.

Using scikit-learn and transformers from huggingface.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Hidden Markov Models POS tagger.py		Hidden Markov Models POS tagger.py
Log Linear and LSTM models.py		Log Linear and LSTM models.py
Markov Language Models.py		Markov Language Models.py
Pretraining and Transformers.py		Pretraining and Transformers.py
README.md		README.md
Transition-based Parsing - Maximum Spanning Tree Parser.py		Transition-based Parsing - Maximum Spanning Tree Parser.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Course

Text Generation - Markov Languages Models

Part-of-Speech Tagging - Hidden Markov Models

Sentiment Analysis - Log Linear Model, Word2Vec representation, LSTM model

Transition-based Parsing - Maximum Spanning Tree Parser

Pretraining and Transformers

About

Releases

Packages

Languages

itamar-sh/NLP-Course

Folders and files

Latest commit

History

Repository files navigation

NLP Course

Text Generation - Markov Languages Models

Part-of-Speech Tagging - Hidden Markov Models

Sentiment Analysis - Log Linear Model, Word2Vec representation, LSTM model

Transition-based Parsing - Maximum Spanning Tree Parser

Pretraining and Transformers

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages