Simple tool designed to extract text from standard or password-protected PDFs. Prompts the user to input the password securely or press Enter if there is no password. It then outputs the extracted text into a specified file or directory of the users choice.
- Python 3.10+
- Required Python packages listed in
requirements.txt
-
Create a Virtual Environment:
python -m venv venv
-
Activate the Virtual Environment:
-
On Windows:
.\venv\Scripts\activate
-
On macOS/Linux:
source venv/bin/activate
-
-
Install Required Packages:
pip install -r requirements.txt
- With directory
python extractor.py -i '.\path\to\file.pdf' -o '.\path\to\output.txt'- Without directory
python extractor.py -i '.\file.pdf' -o 'output.txt'-i,--input: Path to the input PDF file.-o,--output: Path to the output text file.