Predictive classification model using Natural Language Processing (NLP) for IMDB movie rating.

Using deep learning model to train over 49,000 IMDB rating dataset to categorize either the review is positive and negative.

Description

The project's objective is to categorize the IMDB movies rating.
The IMDB movie reviews contain enormous amount of data, which can be used to predict whether the movie review is a negative or positive review.
The dataset contains anomalies such as HTML tags (removed using RegEx), lowercase/uppercase, and duplicates data.
The method used for the deep learning model are word embedding, LSTM and Bidirectional.
Several method can be used to improve the model such as lemmatization, stemming, CNN, n-grams, etc.

Deep learning model images

Results

Training loss & Validation loss:

Training accuracy & Validation accuracy:

Model score:

Discussion

The model achieved 84% accuracy during training.
Both recall and f1 score report 85%.
However, the model starts to overfit after 2nd epochs. Early stopping can be used to prevent overfitting. The dropout data can be increased to control overfitting.

Credits:

Shout out to @Ankit152 for the IMDB Dataset. Check out the dataset by clicking the link below. 😄

Dataset link

IMDB-Sentiment-Analysis

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
static		static
.gitattributes		.gitattributes
README.md		README.md
Sentiment_analysis.py		Sentiment_analysis.py
Sentiment_deploy.py		Sentiment_deploy.py
model.h5		model.h5
ohe_pkl		ohe_pkl
tokenizer_sentiment.json		tokenizer_sentiment.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predictive classification model using Natural Language Processing (NLP) for IMDB movie rating.

Description

Deep learning model images

Results

Training loss & Validation loss:

Training accuracy & Validation accuracy:

Model score:

Discussion

Credits:

Dataset link

About

Releases

Packages

Languages

safwanshamsir99/Sentiment-Analysis

Folders and files

Latest commit

History

Repository files navigation

Predictive classification model using Natural Language Processing (NLP) for IMDB movie rating.

Description

Deep learning model images

Results

Training loss & Validation loss:

Training accuracy & Validation accuracy:

Model score:

Discussion

Credits:

Dataset link

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages