Skip to content

Llama Parse PDF: Extract tables from PDF documents and convert them into Excel format, simplifying the process of managing transaction data.

Notifications You must be signed in to change notification settings

JennyTan5522/Llama-Parse-PDF-Table

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

LLMParse PDF Table to Excel

This system uses the llmparse library to extract tables from PDF files and convert them into Excel format, streamlining the manual process of handling tabular transaction data.

Use Case If you have numerous transaction records embedded in PDF tables, this tool automates the extraction and conversion into Excel, saving time and reducing errors.

Setup Instructions

  1. Apply for a LlamaParse API Key – Register and obtain your API key from the cloud service. (https://cloud.llamaindex.ai/)
  2. Store the API Key – Add the key to your .env file.
  3. Set Up the Environment – Configure your Python virtual environment : python -m venv env
  4. Activate env (Windows): .\env\Scripts\Activate (Linux) : source env/bin/activate
  5. Install Dependencies – Run: pip install -r requirements.txt python ParsePDFTable.py

About

Llama Parse PDF: Extract tables from PDF documents and convert them into Excel format, simplifying the process of managing transaction data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages