Token usage always shows 0.00 #1128
-
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 1 reply
-
I manually update the logger. Here is some additional information. Does this look alright? [gpu=0] Decode batch. #running-req: 1, #token: 665, total tokens: 4144976, kv pool tokens: 4016566, tree cache: 127745, token usage: 0.00, gen throughput (token/s): 178.62, #queue-req: 0
|
Beta Was this translation helpful? Give feedback.
-
cc @hnyls2002 |
Beta Was this translation helpful? Give feedback.
-
You submit the requests too slow. Use parallel threads to send the requests. |
Beta Was this translation helpful? Give feedback.
You submit the requests too slow. Use parallel threads to send the requests.