Skip to content

AI 대학원 1-2 [AI를 위한 통계학] 개인 분석 보고서 : 영화 리뷰 감정 분석

Notifications You must be signed in to change notification settings

Yerin99/IMDB-Movie-Review-Sentiment-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

IMDB Movie Review Sentiment Analysis

This project analyzes the sentiment of IMDB movie reviews using machine learning models. The goal is to classify reviews as positive or negative and extract meaningful insights using Logistic Regression and other models.

Project Overview

  • Dataset: IMDB Movie Reviews (50,000 reviews)
  • Goal: Sentiment classification and feature analysis.
  • Models: Logistic Regression, Random Forest, Naive Bayes.

Key Results

Model Accuracy
Logistic Regression 89.7%
Random Forest 85.3%
Naive Bayes 85.8%

Why Logistic Regression?

  • Provides interpretable feature importance.
  • Performs well with sparse, high-dimensional text data.

Experiments

Feature Analysis

  • Top positive words: great, excellent, amazing, perfect
  • Top negative words: worst, awful, boring, terrible

Word Cloud

positive words wordCloud negative words wordCloud

Central Limit Theorem

I conducted CLT experiments on review length, negation count, and exclamation marks to validate statistical properties. CLT on Length of Reviews CLT on Negation Count in Reviews CLT on Exclamation Mark Count in Reviews

How to Run

Just download the ipynb file, and run it!

Contact

For questions or suggestions, please reach out to diluny2@naver.com!

About

AI 대학원 1-2 [AI를 위한 통계학] 개인 분석 보고서 : 영화 리뷰 감정 분석

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published