You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
#send output to csv
df_final.to_csv('deduplication_output.csv')
My data frame has 143 rows, and I found 3 duplicated rows. But when I tried to run this, I got this error:
7 fits failed with the following error:
Traceback (most recent call last):
File "/usr/local/lib/python3.7/dist-packages/sklearn/model_selection/_validation.py", line 680, in _fit_and_score
estimator.fit(X_train, y_train, **fit_params)
File "/usr/local/lib/python3.7/dist-packages/sklearn/linear_model/_logistic.py", line 1558, in fit
% classes_[0]
ValueError: This solver needs samples of at least 2 classes in the data, but the data contains only one class: 0
warnings.warn(some_fits_failed_message, FitFailedWarning)
/usr/local/lib/python3.7/dist-packages/sklearn/model_selection/_search.py:972: UserWarning: One or more of the test scores are non-finite: [nan nan nan nan nan nan nan]
category=UserWarning,
Clustering...
# duplicate sets 143
The text was updated successfully, but these errors were encountered:
Hi, I am trying to run this code
#initiate deduplication
df_final = pandas_dedupe.dedupe_dataframe(df1,['name', 'address1'])
#send output to csv
df_final.to_csv('deduplication_output.csv')
My data frame has 143 rows, and I found 3 duplicated rows. But when I tried to run this, I got this error:
The text was updated successfully, but these errors were encountered: