Skip to content

Comments

feat(quest-llama-optimized): implemented#2

Closed
rijuyuezhu wants to merge 2 commits intodebugfrom
quest-llama-optimized
Closed

feat(quest-llama-optimized): implemented#2
rijuyuezhu wants to merge 2 commits intodebugfrom
quest-llama-optimized

Conversation

@rijuyuezhu
Copy link
Collaborator

  • Transplant from original quest repo
  • Mocked GQA: slow. A possible solution is to turn off GQA for the full (aligned)
  • float16 only. A possible solution is to use float16 for all methods

+ Transplant from original quest repo
+ Mocked GQA: slow. A possible solution is to turn off GQA for the full (aligned)
+ float16 only. A possible solution is to use float16 for all methods
@rijuyuezhu
Copy link
Collaborator Author

Covered by #4

@rijuyuezhu rijuyuezhu closed this Jan 27, 2025
@DerekHJH DerekHJH deleted the quest-llama-optimized branch February 10, 2025 02:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant