Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ArcticEmbedLEncoder #2694

Draft
wants to merge 6 commits into
base: master
Choose a base branch
from
Draft

Conversation

vincent-4
Copy link
Contributor

@vincent-4 vincent-4 commented Jan 26, 2025

Starting PR for @b8zhong
Mostly looking for a sanity check of initial impl of src/main/java/io/anserini/encoder/dense/ArcticEmbedLEncoder.java

  • Bindings generally follow Splade pattern
  • Currently using empty URLs and model names (to-fix)

Not implemented yet:

  • Component test for output verification
  • (L2 Norm)
  • End-to-end test with HNSW retrieval
  • (L1 Check) and check with cached vs. ONNX model
  • Support for multi-threaded search (32 threads mentioned) from Discussion: sharded indexes #2690

Q: am I good to convert Arctic-embed-l to ONNX, then run with src/main/java/io/anserini/encoder/dense/ArcticEmbedLEncoder.java now?

@vincent-4 vincent-4 changed the title Upstream master ArcticEmbedLEncoder Jan 30, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants