Skip to content
This repository has been archived by the owner on May 7, 2024. It is now read-only.
/ pyratebay Public archive

Python script to download torrents from piratebay.

License

Notifications You must be signed in to change notification settings

notjedi/pyratebay

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pyDownloader

Downloading torrents from pirated sites is bit of a hassle. This Python based CLI helps you to download torrents with ease.

Dependencies

  • BeautifulSoup - pip install bs4
  • Selenium - pip install selenium

Requirements

Any of the following browsers:

  • Chrome (Supports headless mode)
  • Firefox
  • Internet Explorer

Note: Support for Firefox and IE browsers has not been implemented yet

Usage

usage: pyratebay.py [-h] [-c [{all,audio,video,apps,games,other}]] query

positional arguments:
  query                 Name of the media to download

optional arguments:
  -h, --help            show this help message and exit
  -c, --category [{all,audio,video,apps,games,other}]
                        Searches for the given 'name' in the specified
                        category (default = all)

From a developer standpoint

The newly updated pirate bay sites work differently. It fetches the data only after loading the webpage. The table is dynamically populated by JavaScript, this makes it impossible to scrape the data of the webpage using the normal requests or urllib module as it only returns a basic HTML content of the webpage without any table and other stuff. To overcome this problem, one should let the webpage load first and then scrape the contents of the webpage using the selenium or bs4 modules. This way of scraping the web is known as dynamic web scraping, which is indeed implemented in this program.


demo.gif


Credits

  • webdriver_manager - simplifies the management of binary drivers for different browsers

License

MIT LICENSE