This system uses the llmparse library to extract tables from PDF files and convert them into Excel format, streamlining the manual process of handling tabular transaction data.
Use Case If you have numerous transaction records embedded in PDF tables, this tool automates the extraction and conversion into Excel, saving time and reducing errors.
Setup Instructions
- Apply for a LlamaParse API Key – Register and obtain your API key from the cloud service. (https://cloud.llamaindex.ai/)
- Store the API Key – Add the key to your .env file.
- Set Up the Environment – Configure your Python virtual environment : python -m venv env
- Activate env (Windows): .\env\Scripts\Activate (Linux) : source env/bin/activate
- Install Dependencies – Run: pip install -r requirements.txt python ParsePDFTable.py