Lab Task-4: Part-of-Speech (POS) Tagging, Chunking and Named Entity Recognition (NER) using RNN and Bi-LSTM.

The Lab-project consists of a main folder named as

19074045-Megha-Agarwal which consists of three sub-folders –

Chunking
- Chunking_RNN –
  - Chunking_RNN.ipynb
  - Chunking_RNN_model.pb
- Chunking_Bi-LSTM –
  - Chunking_Bi-LSTM.ipynb
  - Chunking_Bi-LSTM_model.pb
NER –
- NER_RNN –
  - NER_RNN.ipynb
  - NER_RNN_model.pb
- NER_Bi-LSTM –
  - NER_Bi-LSTM.ipynb
  - NER_Bi-LSTM_model.pb
POS –
- POS_RNN –
  - POS_RNN.ipynb
  - POS_RNN_model.pb
- POS_Bi-LSTM –
  - POS_Bi-LSTM.ipynb
  - POS_Bi-LSTM_model.pb
Google-Colab notebook (.ipynb) – It consists of the whole code which trains and test the data.
Model trained (.pb) – It consists of the model trained i.e., RNN or Bi-LSTM.

RNN -

Recurrent neural network (RNN) is a neural network that is suitable for modeling sequential information. Although theoretically it is able to capture longdistance dependencies, in practice it suffers from the gradient vanishing/exploding problems. RNN recalls the past and its selections are motivated with the aid of what it has learned from the past.

A recurrent neural network appears very just like feedforward neural networks, except it also has connections pointing backwards. At each time step t (additionally called a frame), the RNN’s gets the inputs x(t) in addition to its personal output from the preceding time step, y(t–1). In view that there is no previous output at the primary time step, it’s far usually set to 0.

Advantages:

The principal advantage of RNN is that RNN can model a collection of records (i.e. time collection) so that each pattern can be assumed to be dependent on previous ones.
Recurrent neural networks are even used with convolutional layers to extend the powerful pixel neighbourhood.

Bi- LSTM -

Bidirectional recurrent neural networks(RNN) are really just putting two independent RNNs together.

This structure allows the networks to have both backward and forward information about the sequence at every time step.

Using bidirectional will run our inputs in two ways, one from past to future and one from future to past and what differs this approach from unidirectional is that in the LSTM that runs backward, we preserve information from the future and using the two hidden states combined you are able in any point in time to preserve information from both past and future.

Advantages:

It solves the problem of fixed sequence to sequence prediction.

POS Tagging:

Part-of-Speech (PoS) tagging, then it may be defined as the process of assigning one of the parts of speech to the given word. It is generally called POS tagging. In simple words, we can say that POS tagging is a task of labelling each word in a sentence with its appropriate part of speech. We already know that parts of speech include nouns, verb, adverbs, adjectives, pronouns, conjunction and their sub-categories.

Here, we have tried to implement it using RNN (Recurrent Neural netwrork) and Bi-lstm (Bi directional Lstm)

Implementation

First, we import all necessary libraries and download conll2000 dataset:

Step 1: Process the dataset and categorise the data into words (x) and tags(y) and splitted the data into training , testing , validation sets.
Step 2: Implemented both the RNN and Bi-LSTM models using libraries and plotted their results and finally we visualized and compared individual models.

Result:

POS tagging using RNN we get 99.26 accuracy and using Bi-lstm we get 99.37% accuracy.

Chunking:

Chunking refers to the process of taking individual pieces of information and grouping them into larger units.

Implementation

First, we import all necessary libraries and download conll2000 dataset:

Step 1: Process the dataset and categorise the data into words (x) and tags(y) and splitted the data into training , testing , validation sets.
Step 2: Implemented both the RNN and Bi-LSTM models using libraries and plotted their results and finally we visualized and compared individual models.

Result:

Chunking, using RNN gave 98.37% accuracy and using Bi-lstm 98.62% accuracy.

NER:

NER (Named Entity recognition) is a task of information extraction that seeks to locate and classify named enitties mentioned in unstructured texts into pre-defined categories such as name of a person, locations , quantities , measurments etc.

Here, we have tried to implement it using RNN (Recurrent Neural netwrork) and Bi-lstm (Bi directional Lstm).

Implementation:

First, we import all necessary libraries and download conll2003 dataset:

Step 1: Load the dataset and extract mappings required for neural network. Split dataset into test and train after padding.
Step 2: Build the model architecture which will have 4 layers as embedding layer , Bidirectional Lstm, Lstm layer, Time distributed layer. Fit the model visualise them and get summary and results of the model.

Result:

NER, using RNN got 99.54% accuracy and using Bi-lstm got 97.11% accuracy.

Conclusion:

In all three of the above, i.e. POS Tagging, Chunking, NER, we got very high accuracy using Bi-LSTM and RNN when compared with CRF++ implementation.

Using CRF++, we got the accuracies as follows:

Chunking : 94-96%
NER : 73-84%
POS : 93-94%

Our present accuracies using Bi-LSTM and RNN are far better than the ones using CRF++.

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
2SAT.sublime-snippet		2SAT.sublime-snippet
AhoCorasick.sublime-snippet		AhoCorasick.sublime-snippet
Automaton.sublime-snippet		Automaton.sublime-snippet
DEBUG.sublime-snippet		DEBUG.sublime-snippet
DSU.sublime-snippet		DSU.sublime-snippet
EERTREE.sublime-snippet		EERTREE.sublime-snippet
HLD.sublime-snippet		HLD.sublime-snippet
LCA.sublime-snippet		LCA.sublime-snippet
MO.sublime-snippet		MO.sublime-snippet
Matrix.sublime-snippet		Matrix.sublime-snippet
Mint.sublime-snippet		Mint.sublime-snippet
Mobius.sublime-snippet		Mobius.sublime-snippet
PERTREE.sublime-snippet		PERTREE.sublime-snippet
PHI.sublime-snippet		PHI.sublime-snippet
PersistentCentroid.sublime-snippet		PersistentCentroid.sublime-snippet
Point.sublime-snippet		Point.sublime-snippet
PushRelabelCode		PushRelabelCode
README.md		README.md
SEGTREE.sublime-snippet		SEGTREE.sublime-snippet
SMS.sublime-snippet		SMS.sublime-snippet
TREAP.sublime-snippet		TREAP.sublime-snippet
TRIE.sublime-snippet		TRIE.sublime-snippet
ValidatorTemplate.txt		ValidatorTemplate.txt
add_xor.sublime-snippet		add_xor.sublime-snippet
angleCompare.sublime-snippet		angleCompare.sublime-snippet
auxTree.sublime-snippet		auxTree.sublime-snippet
binpow.sublime-snippet		binpow.sublime-snippet
blockcut.sublime-snippet		blockcut.sublime-snippet
cht.sublime-snippet		cht.sublime-snippet
closepair.sublime-snippet		closepair.sublime-snippet
cpp.sublime-snippet		cpp.sublime-snippet
decompose.sublime-snippet		decompose.sublime-snippet
dinic.sublime-snippet		dinic.sublime-snippet
dominator_tree.sublime-snippet		dominator_tree.sublime-snippet
dsu.sublime-snippet		dsu.sublime-snippet
dsu_tree.sublime-snippet		dsu_tree.sublime-snippet
dynamicDSU.sublime-snippet		dynamicDSU.sublime-snippet
dynamicseg.sublime-snippet		dynamicseg.sublime-snippet
fake.sublime-snippet		fake.sublime-snippet
fenwick.sublime-snippet		fenwick.sublime-snippet
fft.sublime-snippet		fft.sublime-snippet
fwht.sublime-snippet		fwht.sublime-snippet
gaussian.sublime-snippet		gaussian.sublime-snippet
getLIS.sublime-snippet		getLIS.sublime-snippet
getLPS.sublime-snippet		getLPS.sublime-snippet
hilbert.sublime-snippet		hilbert.sublime-snippet
is_prime.sublime-snippet		is_prime.sublime-snippet
lazyPer.sublime-snippet		lazyPer.sublime-snippet
legend.sublime-snippet		legend.sublime-snippet
lichao.sublime-snippet		lichao.sublime-snippet
lineseg.sublime-snippet		lineseg.sublime-snippet
mcSFlow.sublime-snippet		mcSFlow.sublime-snippet
ncr.sublime-snippet		ncr.sublime-snippet
new_trie.sublime-snippet		new_trie.sublime-snippet
ntt.sublime-snippet		ntt.sublime-snippet
orderedSet.sublime-snippet		orderedSet.sublime-snippet
pairop.sublime-snippet		pairop.sublime-snippet
polynomial.sublime-snippet		polynomial.sublime-snippet
power.sublime-snippet		power.sublime-snippet
primeFactorize.sublime-snippet		primeFactorize.sublime-snippet
rng.sublime-snippet		rng.sublime-snippet
second_fenwick.sublime-snippet		second_fenwick.sublime-snippet
segtree.sublime-snippet		segtree.sublime-snippet
sieve.sublime-snippet		sieve.sublime-snippet
stacksize.sublime-snippet		stacksize.sublime-snippet
stdc++.h		stdc++.h
toposort.sublime-snippet		toposort.sublime-snippet
totient.sublime-snippet		totient.sublime-snippet
treap.sublime-snippet		treap.sublime-snippet
trie.sublime-snippet		trie.sublime-snippet
union_seg.sublime-snippet		union_seg.sublime-snippet
zFunction.sublime-snippet		zFunction.sublime-snippet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Lab Task-4: Part-of-Speech (POS) Tagging, Chunking and Named Entity Recognition (NER) using RNN and Bi-LSTM.

The Lab-project consists of a main folder named as

RNN -

Advantages:

Bi- LSTM -

Advantages:

POS Tagging:

Implementation

Result:

Chunking:

Implementation

Result:

NER:

Implementation:

Result:

Conclusion:

About

Uh oh!

Releases

Packages

Languages

anshugarg15/Snippets

Folders and files

Latest commit

History

Repository files navigation

Lab Task-4: Part-of-Speech (POS) Tagging, Chunking and Named Entity Recognition (NER) using RNN and Bi-LSTM.

The Lab-project consists of a main folder named as

RNN -

Advantages:

Bi- LSTM -

Advantages:

POS Tagging:

Implementation

Result:

Chunking:

Implementation

Result:

NER:

Implementation:

Result:

Conclusion:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages