cearch (sea-urch) is a codebase search tool that parses the code files in your git repos, isolates the logical units using treesitter, embeds them, and then allows you to do nearest neighbor lookups of your code to detect duplicated and similar logic in your codebase.
- prepare your repo by running
cearch initfrom any path in your git repo, this will:- add
.cearch/to your .gitignore - download an embedding model and cache it in
.cearch/
- add
- index your repo by running
cearch index - search indexed symbols by using
cearch query 'pub fn nothing() {' -n $NUM_HITS - delete your saved index and cached embedding models with
cearch clean
- Clone the repo using git
cargo build- Make commits
- Submit a PR
- Please consider searching the ./plans/* directory for open issues
