Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Baselines #75

Open
rbroc opened this issue Oct 14, 2024 · 3 comments
Open

Baselines #75

rbroc opened this issue Oct 14, 2024 · 3 comments
Assignees

Comments

@rbroc
Copy link
Owner

rbroc commented Oct 14, 2024

On top of the feature-based classifiers, we want to have the following baselines:

  • Tf-Idf (already implemented)
  • Sentence embedding-based
  • LLM-based classifier (e.g., GPT)
@MinaAlmasi MinaAlmasi self-assigned this Oct 14, 2024
@MinaAlmasi
Copy link
Collaborator

Sentence embedding-based (sentence transformers)

@MinaAlmasi
Copy link
Collaborator

Sentence Embedding-based -> Sentence Transformers

Currently have embedded with a somewhat big Nvidia-model (see #80 desc.), but we can scale down e.g., with an e5 model.

Notes for that:

  1. Need to check up again, but pretty sure the e5 models require a "passage: " prompt in front of all rows with text, which the current Nvidia model does not.
  2. We might also not need to scale down to FP16 (with model.half()) if we use a smaller model,

@MinaAlmasi
Copy link
Collaborator

MinaAlmasi commented Oct 30, 2024

For LLM detector (bookkeeping)

  • Want to use a "small" (3b-ish max) model that we we will not be using for generation
  • Ruling out phi-3.5-mini since it has been trained on a lot of synthetic data

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants