Skip to content
#

document-processing-pipeline

Here are 13 public repositories matching this topic...

pRISM is a repository that combines Retrieval-Augmented Generation (RAG) with a multi-LLM voting approach to create accurate and reliable AI-generated outputs. It integrates multiple language models, including Mistral, Claude 3.5, and OpenAI, to enhance performance through advanced consensus techniques

  • Updated Jun 20, 2025
  • Python

AI-powered system for summarizing PDF content with Armenian, Russian, and English language support. Automatically extracts and summarizes text, applies OCR to images, and identifies visual elements in documents. Built for efficient multilingual PDF processing.

  • Updated Jun 22, 2025
  • HTML

A sophisticated Python pipeline for automatically extracting dimensions and cabinet codes from architectural floorplan PDFs. This tool converts various dimension formats into standardized measurements and provides structured output with visualization capabilities.

  • Updated Sep 28, 2025
  • Python

MerchantPulse-Integrated Offline AI for modern finance. Convert statements, trades, and loan files into clean, validated data in seconds. Runs fully on your hardware—no cloud, no leaks. Built for credit analytics, risk, and compliance with speed and security at its core.

  • Updated Sep 29, 2025
  • Python

Improve this page

Add a description, image, and links to the document-processing-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-processing-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more