Skip to content

AhmadullahLaskar/MiniProject-Book-Price-Tracker-using-Scrapy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

7 Commits
Β 
Β 
Β 
Β 

Repository files navigation

MiniProject-Book-Price-Tracker-using-Scrapy πŸ“š

A B.Tech Minor Project (May–Dec 2025)
Koneru Lakshmaiah University


πŸ“Œ Overview

Book Price Tracker is a mini web scraping project designed to extract and monitor book prices from BooksToScrape.com using the Scrapy framework. The goal is to demonstrate how structured data like prices, ratings, and titles can be programmatically collected and stored for analysis or alerting purposes.


✨ Features

  • Tracks book titles, prices, and star ratings from BooksToScrape
  • Built with Scrapy, a powerful Python framework for large-scale scraping
  • Stores data in CSV or JSON format
  • Fast, efficient, and modular architecture
  • Supports pagination for scraping multiple pages
  • Can be deployed as a scheduled scraper for daily/weekly monitoring

🧠 Problem Statement

Online bookstores frequently update prices, and manually checking them can be time-consuming. This project automates the process of collecting book prices and ratings from a target site. The solution is lightweight, fast, and serves as a base for real-world price tracker applications.


πŸ“Š Performance

  • Pagination Support: Yes
  • Speed: Very fast
  • Accuracy: High (XPath and CSS selectors used)
  • Target Site: Books to Scrape

βš™οΈ Tools

  1. Scrapy

    • Framework used to crawl and extract data efficiently
    • Modular spider-based architecture
    • Fast and scalable

    Scrapy Screenshot

  2. XPath / CSS Selectors

    • Precise extraction of price, title, and star-rating
  3. CSV/JSON Output

    • Scraped data saved in structured formats for analysis

πŸ“Έ Demonstrations

Sample Output (Google Colab / Terminal):

  • Extracted titles, prices, and ratings from multiple pages
image Screenshot 2025-07-19 163652

πŸ› οΈ Tech Stack

  • Language: Python
  • Framework: Scrapy
  • Selectors: XPath, CSS
  • Storage: CSV, JSON
  • Platform: Google Colab / Jupyter Notebook / CLI

πŸ’» How to Run

-bash

Step 1: Install Scrapy

pip install scrapy

Step 2: Start a new Scrapy project

scrapy startproject book_tracker

Step 3: Create a spider

cd book_tracker scrapy genspider books books.toscrape.com

Step 4: Run the spider and save output

scrapy crawl books -o output.csv

Contact

For any inquiries or feedback, please contact:

  • Name: Ahmadullah Laskar
  • Education: B.tech AI&DS , KLU, Guntur
  • Email: ahmadullahlaskar2004@gmail.com
  • GitHub: Ahmadullah_Laskar

Mentor

Dr. Sahinur Rahman Laskar
Assistant Professor
School of Computer Science, UPES, Dehradun, India
Email: sahinurlaskar.nits@gmail.com / sahinur.laskar@ddn.upes.ac.in

About

This project implements scrapy to compare price of books

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published