Skip to content

Implemented a web scraper in Python using the BeautifulSoup library to scrape movie reviews from IMDb. The script collects review titles, review text, user ratings, review dates, and user names and stores them in a JSON file

Notifications You must be signed in to change notification settings

adabarbulescu/IMDb-Reviews-Scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

IMDb Reviews Scraping

A web scraper built with Python and Beautiful Soup that collects movie review data from IMDb.

Technologies Used

  • Python
  • Beautiful Soup
  • Requests
  • JSON

Requirements

  • Python 3
  • Beautiful Soup
  • Requests

Usage

  1. Clone the repository

Copy code

git clone https://github.com/adabarbulescu/IMDb-Reviews-Scraping.git

  1. Navigate to the directory

Copy code

cd IMDb-Reviews-Scraping

  1. Run the script

Copy code

python main.py

  1. The review data will be saved in a JSON file called review_data.json in the project directory.

Note

The script may take a while to run as it is scraping a large amount of data from multiple pages.

Future Work

Implement the feature to automatically scrape additional reviews by pressing the "Load more" button on review pages

About

Implemented a web scraper in Python using the BeautifulSoup library to scrape movie reviews from IMDb. The script collects review titles, review text, user ratings, review dates, and user names and stores them in a JSON file

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published