[Feature]: RAG from PDF File #1087

areytechai · 2025-02-06T12:29:03Z

Background & Description

is there a way to use rag methodology with PDF or docx file read from filesystem

API & Usage

No response

How to implement

No response

zsogitbe · 2025-02-08T05:08:31Z

Yes, you need a library that can extract text from PDF and DOCX files (there are free libraries available), and then you can use the standard procedure for Retrieval-Augmented Generation (RAG) in LLamaSharp.

areytechai · 2025-02-08T08:38:15Z

@zsogitbe example of procecudure pleaes

zsogitbe · 2025-02-08T09:41:34Z

Extract text from a PDF using a free C# tool.
Chunk the extracted text for RAG.
Save the text chunks into Kernel memory with LLamaSharp.
Perform searches within the memory using LLamaSharp.

The quality will depend on the efficiency and cleverness of executing the former steps (it is not as easy as many novice AI 'experts' think).

I am willing to provide an example in exchange for payment (willing to give a quote for this based on your requirements). Otherwise, please look at the basic examples in the LLamaSharp repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: RAG from PDF File #1087

[Feature]: RAG from PDF File #1087

areytechai commented Feb 6, 2025

zsogitbe commented Feb 8, 2025

areytechai commented Feb 8, 2025

zsogitbe commented Feb 8, 2025

[Feature]: RAG from PDF File #1087

[Feature]: RAG from PDF File #1087

Comments

areytechai commented Feb 6, 2025

Background & Description

API & Usage

How to implement

zsogitbe commented Feb 8, 2025

areytechai commented Feb 8, 2025

zsogitbe commented Feb 8, 2025