Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test-text-llm-encodings-without-attention-layers-with-cerebros #131

Open
david-thrower opened this issue Dec 5, 2023 · 1 comment
Open
Labels
audience/technical Issue primarily for technical review and service. kind/enhancement New feature or request kind/performance status/ready-pending-tests Ready to make pull request once tests pass. triage/intermediate-priotrity Intermediate priority...

Comments

@david-thrower
Copy link
Owner

Kind of issue: enhancement

Additional context: Another attempt to make a lighter weight equally robust model.

Suggested Labels (If you don't know, that's ok): kind/enhancement kind/performance

@david-thrower david-thrower added kind/enhancement New feature or request audience/technical Issue primarily for technical review and service. triage/intermediate-priotrity Intermediate priority... kind/performance labels Dec 5, 2023
@david-thrower david-thrower added the status/ready-pending-tests Ready to make pull request once tests pass. label Dec 10, 2023
@david-thrower
Copy link
Owner Author

This is a strong candidate for merging in. The last thing left is to extend the seq length a little on another branch just to see if we can exceed 95% test set accuracy without too much computational work and still being under 50M params.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
audience/technical Issue primarily for technical review and service. kind/enhancement New feature or request kind/performance status/ready-pending-tests Ready to make pull request once tests pass. triage/intermediate-priotrity Intermediate priority...
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

1 participant