From f33fab4abdc315620df25ba9c92e3a3febe4379f Mon Sep 17 00:00:00 2001 From: Michele Dolfi Date: Wed, 30 Oct 2024 09:11:00 +0100 Subject: [PATCH] add bitmap images as format Signed-off-by: Michele Dolfi --- README.md | 2 +- docs/index.md | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 256fc0c8f..ae923ccc9 100644 --- a/README.md +++ b/README.md @@ -22,7 +22,7 @@ Docling parses documents and exports them to the desired format with ease and sp ## Features -* 🗂️ Multi-format support for input (PDF, DOCX, PPTX, HTML, AsciiDoc, MarkDown) and output (Markdown, JSON, YAML) +* 🗂️ Multi-format support for input (PDF, DOCX, PPTX, Bitmap images, HTML, AsciiDoc, MarkDown) and output (Markdown, JSON, YAML) * 📑 Advanced PDF document understanding incl. page layout, reading order & table structures * 🧩 Strongly typed Pydantic v2 data structure named [DoclingDocument](https://ds4sd.github.io/docling/concepts/docling_document/) which supports hierarchies and provides native iterators and chunkers. * 📝 Metadata extraction, including title, authors, references & language diff --git a/docs/index.md b/docs/index.md index 2a2b59d8e..d561f4908 100644 --- a/docs/index.md +++ b/docs/index.md @@ -19,7 +19,7 @@ Docling parses documents and exports them to the desired format with ease and sp ## Features -* 🗂️ Multi-format support for input (PDF, DOCX, PPTX, HTML, AsciiDoc, MarkDown) and output (Markdown, JSON, YAML) +* 🗂️ Multi-format support for input (PDF, DOCX, PPTX, Bitmap images, HTML, AsciiDoc, MarkDown) and output (Markdown, JSON, YAML) * 📑 Advanced PDF document understanding incl. page layout, reading order & table structures * 🧩 Strongly typed Pydantic v2 data structure named [DoclingDocument](./concepts/docling_document.md) which supports hierarchies and provides native iterators and chunkers. * 📝 Metadata extraction, including title, authors, references & language