Fuzzy matching and more functionality for spaCy.
-
Updated
Jul 6, 2024 - Python
Fuzzy matching and more functionality for spaCy.
DuckDB Community Extension adding RapidFuzz algorithms for search, deduplication, and record linkage.
Fast Batch String Matching in Python (Levenshtein, Jaro-Winkler, Hamming) with Zero Cache Misses - made for Python, written in C++
Fast Scalable Dedupe - Fuzzy Matching With Opensearch + nmslib + Rapidfuzz
A simple and efficient spelling correction system that uses Python's rapidfuzz library to find and correct misspelled sentences by matching them with the closest correct ones from a given dataset.
✅ completed | Voices assistant for windows managing system applications
NovelNudge is a book recommendation engine that embeds titles, descriptions, authors, and genres using SentenceTransformers. It combines these vectors and ranks similar books with cosine similarity and fuzzy title matching.
Match address data files together (CSV, XLSX, Parquet) using fuzzy matching (rapidfuzz) and an (optional) graphical user interface. Built for UK addresses. Best used in combination with the uk_address_matcher repo: https://github.com/moj-analytical-services/uk_address_matcher
современный голосовой ассистент для Windows
Phoenix II Discord Bot
The repository is a duplicate of the local folder which contains codes created by Yuanzhan Gao (yg8ch@virginia.edu) to conduct scaled fuzzy matching procedure on EIDL and PPP dataset. Please see the README file for more information.
Cross-platform desktop application for merging and deduplicating browser bookmarks with GUI and CLI support. Supports Netscape HTML and Chrome JSON formats with intelligent URL canonicalization and fuzzy title matching.
Performs OCR on a list of images using Tesseract and performs fuzzy string matching with a given list of strings.
DEMO: extract media tags with Spotify API to relational Docker backend
Automated business record matching using fuzzy algorithms (RapidFuzz) and browser automation (Playwright)
Add a description, image, and links to the rapidfuzz topic page so that developers can more easily learn about it.
To associate your repository with the rapidfuzz topic, visit your repo's landing page and select "manage topics."