Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
16 changes: 14 additions & 2 deletions pages/docs/features/ocr.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -164,11 +164,23 @@ ocr:
3. Download the service account JSON key file
4. Set the `GOOGLE_SERVICE_KEY_FILE` environment variable using one of the supported methods

### 4. Custom OCR (Planned)
### 4. Document Parser

The built-in document parser OCR strategy supports native parsing of some
document file formats (like PDF, DOCX, and XLSX) as text.

**Configuration:**
```yaml
# `librechat.yaml`
ocr:
strategy: "document_parser"
```

### 5. Custom OCR (Planned)

Support for custom OCR providers and user-defined strategies is planned for future releases.

### 5. Upload Files to Provider (Direct)
### 6. Upload Files to Provider (Direct)

For supported LLM Providers (**OpenAI, AzureOpenAI, Anthropic, and Google**) and their respective models, files can now be sent directly to the provider APIs as message attachments,
allowing the provider to use their own native OCR implementations to parse files using the `Upload to Provider` option in the file attachment dropdown menu.
Expand Down