Skip to content

Commit

Permalink
refactor: lock only one slot for embedding at pooling
Browse files Browse the repository at this point in the history
Signed-off-by: thxCode <thxcode0824@gmail.com>
  • Loading branch information
thxCode committed Aug 6, 2024
1 parent 5e41179 commit 67215b9
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions llama-box/main.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -2342,6 +2342,8 @@ struct server_context {
batch_type = slot_type;
} else if (batch_type != slot_type) {
continue;
} else if (batch_type == 1 && llama_pooling_type(ctx) > 0) {
continue;
}

// keep only the common part
Expand Down

0 comments on commit 67215b9

Please sign in to comment.