diff --git a/pages/docs/features/ocr.mdx b/pages/docs/features/ocr.mdx index b5efc46e3..f319dee3f 100644 --- a/pages/docs/features/ocr.mdx +++ b/pages/docs/features/ocr.mdx @@ -164,11 +164,23 @@ ocr: 3. Download the service account JSON key file 4. Set the `GOOGLE_SERVICE_KEY_FILE` environment variable using one of the supported methods -### 4. Custom OCR (Planned) +### 4. Document Parser + +The built-in document parser OCR strategy supports native parsing of some +document file formats (like PDF, DOCX, and XLSX) as text. + +**Configuration:** +```yaml +# `librechat.yaml` +ocr: + strategy: "document_parser" +``` + +### 5. Custom OCR (Planned) Support for custom OCR providers and user-defined strategies is planned for future releases. -### 5. Upload Files to Provider (Direct) +### 6. Upload Files to Provider (Direct) For supported LLM Providers (**OpenAI, AzureOpenAI, Anthropic, and Google**) and their respective models, files can now be sent directly to the provider APIs as message attachments, allowing the provider to use their own native OCR implementations to parse files using the `Upload to Provider` option in the file attachment dropdown menu.