-
Notifications
You must be signed in to change notification settings - Fork 205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Good First Issue]: causal_lm/cpp must read EOS token value from rt_info of openvino_tokenizer.xml #277
Comments
.take |
Hello @anzr299, the ticket will be manually assigned to you shortly - the take feature hasn't been introduced to GenAI repo yet, but it will be today. :) Thanks for taking a look at the issue! |
Hello @anzr299, are you still working on this? Is there anything we could help you with? |
Hi, I am still working on it, was busy the past few days. |
I've created a PR #315 |
I have a small question, in |
cc @pavel-esir |
good question! No need to update beam_searcher, just initialize it with eos token you got from IR in the sample. Please look my comments in you PR. |
*Details:* Made*changes to accommodate the dynamic EOS Token *Tickets:* #277 132861
@ilya-lavrenov The issue asked to add a functionality to read EOS tokens only for beam_search_causal_lm and greedy_causal_lm but not for speculative_decoding_lm. I followed the requirement in my work but I thought it could be better if I added the functionality to speculative_decoding_lm too. Is that alright? |
Yes, let's handle this sample as well. |
Sure, I will create a new PR |
@ilya-lavrenov I've created a PR at #353 |
Context
End Of Sequence tokens are an essential part of LLM training and inference. You can find more details in this comment.
Thanks to a PR adding End Of Sequence tokens to Runtime Info openvino_tokenizers now put EOS token value into rt_info section in OpenVINO Intermediate Representation format (
.xml
file to be specific) when converting a tokenizer to OpenVINO.Since EOS has been enabled in OpenVINO, now it needs to be enabled in GenAI text_generation module.
What needs to be done?
beam_search_causal_lm.cpp
andgreedy_causal_lm.cpp
from https://github.com/openvinotoolkit/openvino.genai/tree/master/text_generation/causal_lm/cpp should read the EOS token instead of having a hardcoded value with comment// There's no way to extract special token values from the detokenizer for now
.It’s required to extract the value using
ov::Model::get_rt_info()
and use it. Remove the comments about absence of way to extract that value.Example Pull Requests
Resources
Contact points
@pavel-esir
Ticket
132861
The text was updated successfully, but these errors were encountered: