Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

To relabel a specific pair. #47

Open
PMK1991 opened this issue Sep 13, 2021 · 1 comment
Open

To relabel a specific pair. #47

PMK1991 opened this issue Sep 13, 2021 · 1 comment

Comments

@PMK1991
Copy link

PMK1991 commented Sep 13, 2021

How can we retrain the model by updating the label of a specific pair once the model is trained?

@ieriii
Copy link
Collaborator

ieriii commented Mar 27, 2022

labels for all pairs are stored in the *_training.json.
pandas-dedupe does not have a method to change the labels of a specific pair directly.

A workaround is to use update_model to keep adding labelled pairs to your *_training.json, including the pairs you'd like to change. You can update the model by doing, for example:

pandas_dedupe.dedupe_dataframe(df, ['first_name', 'last_name'], update_model=True)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants