diff --git a/README.md b/README.md index 8377665d..beb70eae 100644 --- a/README.md +++ b/README.md @@ -58,26 +58,26 @@ and run fuzzy matching against all company names in the `abcd`: ``` r match_result <- match_name(loanbook_demo, abcd_demo) match_result -#> # A tibble: 329 × 28 -#> id_loan id_direct_loantaker name_direct_loantaker id_intermediate_pare…¹ -#> -#> 1 L1 C294 Vitale Group -#> 2 L3 C292 Rowe-Rowe IP5 -#> 3 L5 C305 Ring AG & Co. KGaA -#> 4 L6 C304 Kassulke-Kassulke -#> 5 L6 C304 Kassulke-Kassulke -#> 6 L7 C227 Morissette Group -#> 7 L7 C227 Morissette Group -#> 8 L8 C303 Barone s.r.l. -#> 9 L9 C301 Werner Werner AG & Co. KG… IP10 -#> 10 L9 C301 Werner Werner AG & Co. KG… IP10 -#> # ℹ 319 more rows -#> # ℹ abbreviated name: ¹​id_intermediate_parent_1 -#> # ℹ 24 more variables: name_intermediate_parent_1 , -#> # id_ultimate_parent , name_ultimate_parent , -#> # loan_size_outstanding , loan_size_outstanding_currency , -#> # loan_size_credit_limit , loan_size_credit_limit_currency , -#> # sector_classification_system , … +#> # A tibble: 326 × 22 +#> id_loan id_direct_loantaker name_direct_loantaker id_ultimate_parent +#> +#> 1 L1 C294 Vitale Group UP15 +#> 2 L3 C292 Rowe-Rowe UP288 +#> 3 L5 C305 Ring AG & Co. KGaA UP104 +#> 4 L6 C304 Kassulke-Kassulke UP83 +#> 5 L6 C304 Kassulke-Kassulke UP83 +#> 6 L7 C227 Morissette Group UP134 +#> 7 L7 C227 Morissette Group UP134 +#> 8 L8 C303 Barone s.r.l. UP163 +#> 9 L9 C301 Werner Werner AG & Co. KGaA UP138 +#> 10 L9 C301 Werner Werner AG & Co. KGaA UP138 +#> # ℹ 316 more rows +#> # ℹ 18 more variables: name_ultimate_parent , loan_size_outstanding , +#> # loan_size_outstanding_currency , loan_size_credit_limit , +#> # loan_size_credit_limit_currency , sector_classification_system , +#> # sector_classification_direct_loantaker , lei_direct_loantaker , +#> # isin_direct_loantaker , id_2dii , level , sector , +#> # sector_abcd , name , name_abcd , score , … ``` ### 2. Prioritize validated matches @@ -92,26 +92,26 @@ matches, prioritizing (by default) `direct_loantaker` matches over ``` r prioritize(match_result) -#> # A tibble: 177 × 28 -#> id_loan id_direct_loantaker name_direct_loantaker id_intermediate_pare…¹ -#> -#> 1 L6 C304 Kassulke-Kassulke -#> 2 L13 C297 Ladeck -#> 3 L20 C287 Weinhold -#> 4 L21 C286 Gallo Group -#> 5 L22 C285 Austermuhle GmbH -#> 6 L24 C282 Ferraro-Ferraro Group -#> 7 L25 C281 Lockman, Lockman and Lock… -#> 8 L26 C280 Ankunding, Ankunding and … -#> 9 L27 C278 Donati-Donati Group -#> 10 L28 C276 Ferraro, Ferraro e Ferrar… +#> # A tibble: 177 × 22 +#> id_loan id_direct_loantaker name_direct_loantaker id_ultimate_parent +#> +#> 1 L6 C304 Kassulke-Kassulke UP83 +#> 2 L13 C297 Ladeck UP69 +#> 3 L20 C287 Weinhold UP35 +#> 4 L21 C286 Gallo Group UP63 +#> 5 L22 C285 Austermuhle GmbH UP187 +#> 6 L24 C282 Ferraro-Ferraro Group UP209 +#> 7 L25 C281 Lockman, Lockman and Lockman UP296 +#> 8 L26 C280 Ankunding, Ankunding and Anku… UP67 +#> 9 L27 C278 Donati-Donati Group UP45 +#> 10 L28 C276 Ferraro, Ferraro e Ferraro SPA UP195 #> # ℹ 167 more rows -#> # ℹ abbreviated name: ¹​id_intermediate_parent_1 -#> # ℹ 24 more variables: name_intermediate_parent_1 , -#> # id_ultimate_parent , name_ultimate_parent , -#> # loan_size_outstanding , loan_size_outstanding_currency , -#> # loan_size_credit_limit , loan_size_credit_limit_currency , -#> # sector_classification_system , … +#> # ℹ 18 more variables: name_ultimate_parent , loan_size_outstanding , +#> # loan_size_outstanding_currency , loan_size_credit_limit , +#> # loan_size_credit_limit_currency , sector_classification_system , +#> # sector_classification_direct_loantaker , lei_direct_loantaker , +#> # isin_direct_loantaker , id_2dii , level , sector , +#> # sector_abcd , name , name_abcd , score , … ``` The result is a dataset with identical columns to the input loanbook,