-
Notifications
You must be signed in to change notification settings - Fork 477
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
docs: Add README for advanced image processing (#969)
- Loading branch information
Showing
4 changed files
with
41 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,41 @@ | ||
# Advanced Image Processing | ||
|
||
The Chat With Your Data solution accelerator supports advanced image processing using Azure AI Vision and GPT-4 Vision to obtain more meaningful data from images. | ||
|
||
With advanced image processing enabled, GPT-4 vision is used to generate rich captions of images, and Azure Computer Vision is used to generate embeddings of images, in addition to the text embeddings generated from the image captions. | ||
|
||
When querying data in Azure AI Search, vector search is performed using embeddings created for the search query using both Azure OpenAI and Azure Computer Vision. | ||
|
||
Finally, if any image documents are retrieved from Azure AI Search, the images themselves are passed to the LLM as well as the text caption generated for the image. | ||
|
||
All of this allows a user to ask questions about the meaning of images in the knowledge store, rather than just the text in an image. | ||
|
||
Compare the results when uploading the following image: | ||
|
||
![image](./images/architecture.png) | ||
|
||
Without advanced image processing: | ||
|
||
![image](./images/without_advanced_image_processing.png) | ||
|
||
With advanced image processing: | ||
|
||
![image](./images/with_advanced_image_processing.png) | ||
|
||
## Enabling Advanced Image Processing | ||
|
||
To enable advanced image processing: | ||
|
||
- If deploying from the UI, set "Use Advanced Image Processing" to "true". | ||
|
||
- If deploying using azd, run the following before deploying: | ||
|
||
```bash | ||
azd env set USE_ADVANCED_IMAGE_PROCESSING true | ||
``` | ||
|
||
Once enabled, advanced image processing will be enabled for all supported image types, however it can be enabled/disabled for individual image types in the Admin app. | ||
|
||
![image](./images/enable_advanced_image_processing.png) | ||
|
||
Advanced image processing is only used in the `custom` conversation flow and not the `byod` flow, as Azure OpenAI On Your Data only supports Ada embeddings. It is currently not possible to use advanced image processing when integrated vectorization is enabled. |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.