A repository for R code that uses a fuzzy join operation to find scientific plant name matches in abstracts of scientific articles with the option of using parallel processing.
r dplyr text-classification fuzzy-matching iterators parallel-processing tokenization rprogramming tidytext text-preprocessing fuzzyjoin doparallel worldflora
-
Updated
Oct 30, 2023