Skip to content

Conversation

@aarongiera
Copy link

Span extensions need to be serializable for spacy's multiprocessing to work. Currently a serialization error occurs when adding the quickumls pipeline with nlp.pipe and n_processes > 1.

@jianlins
Copy link
Collaborator

jianlins commented Mar 4, 2024

Why would you prefer List over Set when add new matches to span._.umls_matches? Would that possibly cause duplicated matches? Is the "Set" cause the serialization issue? Can you add tests functions?

@jianlins
Copy link
Collaborator

jianlins commented Mar 4, 2024

Did you implement the to_disk and from_disk function?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants