Skip to content

How Does a Search Index Handle Updated PDF Files with the Same Name but Different Content? #2301

Answered by pamelafox
svenfeld asked this question in Q&A
Discussion options

You must be logged in to vote

For this repo, the manual ingestion assumes that the same name is the same file. Our manual ingestion script does not attempt to remove any chunks, you would need to manually remove them if desired.
See:
https://github.com/Azure-Samples/azure-search-openai-demo/blob/main/docs/data_ingestion.md#removing-documents

However, perhaps you're asking about how Azure AI search indexers work? There is more details about that in the documentation here:
https://learn.microsoft.com/en-us/azure/search/search-howto-index-changed-deleted-blobs?tabs=portal

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by svenfeld
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants