-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Models #1
Comments
see #51: at the end of the whole process, we might want to:
This should be done at the end of the project though, not before - too many new models all the time! |
Models are sort of "out of date" by now, so we should probably consider new ones. ATM: Updating LLMs (Mina's scribbles):Llama3
Maybe stabilityai/stablelm-2-12b-chat ? (Since we are using stabilityai/beluga7b currently.).
We'll consult Kenneth when we are closer to having a polished pipeline |
Some input for this:
|
I'm not sure that Llama 2 is relevant anymore, I'd probably go for at least 3.1 but preferably 3.2. Likewise, I think that Zephyr is a bit of a dead end. I think my vote (right now, anyway) would be Llama 3.2, Mistral, Qwen 2, and Gemma 2. |
Awesome, let's run with that unless anything mindblowing is released in the meantime. |
Looking both at foundation and instruction tuning models. For this project, the latter is probably going to be the only target, as it would probably work better.
Available
Maybe for later
Not open-source
Open-source
The text was updated successfully, but these errors were encountered: