Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pandas_dedupe Not working on Windows #37

Open
cfatls opened this issue Jan 13, 2021 · 2 comments
Open

Pandas_dedupe Not working on Windows #37

cfatls opened this issue Jan 13, 2021 · 2 comments

Comments

@cfatls
Copy link

cfatls commented Jan 13, 2021

Installed microsoft build tools 2015 as per the video tutorial mentioned in the docs
https://www.youtube.com/watch?v=lCFEzRaqoJA&ab_channel=KeithLyons

However, whenever trying to run , it gets stuck on the:
"Finished labeling
Clustering... "

Works on linux with no hassle. On windows, however, can't get past clustering, regardless of how large or small the file I'm deduping is.
Please help. Any advice is appreciated.

I'm running windows 10.
Python 3.8.3 (default, Jul 2 2020, 17:30:36) [MSC v.1916 64 bit (AMD64)]

@belkacem-ayachi
Copy link

Same issue here

@ieriii
Copy link
Collaborator

ieriii commented Apr 2, 2021

Unfortunately, I couldn't reproduce the issue on my windows machine.

Two potential suggestions:
. try to set the argument n_cores to a value lower than the number of cores on your laptop.
. install python and pandas-dedupe in a new virtual environment and see whether it works.

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants