feat(full_llama_optimized):implemented#3

Closed

Kenaz123 wants to merge 1 commit intoquest-llama-optimizedfrom

llama-optimized

Collaborator

Kenaz123 commented Jan 26, 2025

code logic in full-llama-optimized driven code in e2e
disable GQA in full_llama by copy and store the same kv_states in forwarding process


          feat(full_llama_optimized):implemented

dc9d0d5

Collaborator

rijuyuezhu commented Jan 27, 2025

Covered by #4

rijuyuezhu closed this

DerekHJH deleted the llama-optimized branch

February 10, 2025 02:45

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet