[Good First Issue]: causal_lm/cpp must read EOS token value from rt_info of openvino_tokenizer.xml #277

p-wysocki · 2024-03-01T12:45:35Z

Context

End Of Sequence tokens are an essential part of LLM training and inference. You can find more details in this comment.

Thanks to a PR adding End Of Sequence tokens to Runtime Info openvino_tokenizers now put EOS token value into rt_info section in OpenVINO Intermediate Representation format (.xml file to be specific) when converting a tokenizer to OpenVINO.

Since EOS has been enabled in OpenVINO, now it needs to be enabled in GenAI text_generation module.

What needs to be done?

beam_search_causal_lm.cpp and greedy_causal_lm.cpp from https://github.com/openvinotoolkit/openvino.genai/tree/master/text_generation/causal_lm/cpp should read the EOS token instead of having a hardcoded value with comment // There's no way to extract special token values from the detokenizer for now.

It’s required to extract the value using ov::Model::get_rt_info() and use it. Remove the comments about absence of way to extract that value.

Example Pull Requests

Resources

Contribution guide - start here!
Intel DevHub Discord channel - engage in discussions, ask questions and talk to OpenVINO developers

Contact points

@pavel-esir

Ticket

132861

The text was updated successfully, but these errors were encountered:

anzr299 · 2024-03-02T19:22:18Z

.take

p-wysocki · 2024-03-04T07:04:23Z

Hello @anzr299, the ticket will be manually assigned to you shortly - the take feature hasn't been introduced to GenAI repo yet, but it will be today. :)

Thanks for taking a look at the issue!

p-wysocki · 2024-03-12T09:57:21Z

Hello @anzr299, are you still working on this? Is there anything we could help you with?

anzr299 · 2024-03-16T17:14:29Z

Hi, I am still working on it, was busy the past few days.

anzr299 · 2024-03-20T17:41:31Z

I've created a PR #315

anzr299 · 2024-03-20T17:43:38Z

I have a small question, in beam_search_casual_lm.cpp, the following lines check for an empty token and not the hardcoded EOS token. Should I change this as well to check for the EOS token?
if (next_tokens.empty()) { break; }
(lines 60-62)

p-wysocki · 2024-03-21T07:52:47Z

cc @pavel-esir

pavel-esir · 2024-03-22T12:00:53Z

I have a small question, in beam_search_casual_lm.cpp, the following lines check for an empty token and not the hardcoded EOS token. Should I change this as well to check for the EOS token? if (next_tokens.empty()) { break; } (lines 60-62)

good question! No need to update beam_searcher, just initialize it with eos token you got from IR in the sample. Please look my comments in you PR.

*Details:* Made*changes to accommodate the dynamic EOS Token *Tickets:* #277 132861

anzr299 · 2024-04-09T17:48:31Z

@ilya-lavrenov The issue asked to add a functionality to read EOS tokens only for beam_search_causal_lm and greedy_causal_lm but not for speculative_decoding_lm. I followed the requirement in my work but I thought it could be better if I added the functionality to speculative_decoding_lm too. Is that alright?

ilya-lavrenov · 2024-04-09T17:52:08Z

I followed the requirement in my work but I thought it could be better if I added the functionality to speculative_decoding_lm too. Is that alright?

Yes, let's handle this sample as well.
The initial requirements were added before the speculative_decoding_lm sample is implemented.

anzr299 · 2024-04-09T17:53:09Z

I followed the requirement in my work but I thought it could be better if I added the functionality to speculative_decoding_lm too. Is that alright?

Yes, let's handle this sample as well. The initial requirements were added before the speculative_decoding_lm sample is implemented.

Sure, I will create a new PR

anzr299 · 2024-04-09T18:44:24Z

@ilya-lavrenov I've created a PR at #353

…g_lm (#353) Extension to issue #277, Added the functionality to read EOS token from model runtime information in the speculative_decoding_lm.

p-wysocki added the good first issue Good for newcomers label Mar 1, 2024

github-project-automation bot added this to Good first issues Mar 1, 2024

github-project-automation bot moved this to Contributors Needed in Good first issues Mar 1, 2024

p-wysocki moved this from Contributors Needed to Assigned in Good first issues Mar 4, 2024

Wovchena assigned anzr299 Mar 4, 2024

anzr299 mentioned this issue Mar 20, 2024

Update greedy_causal_lm.cpp to read EOS Token #315

Merged

p-wysocki linked a pull request Mar 21, 2024 that will close this issue

Update greedy_causal_lm.cpp to read EOS Token #315

Merged

p-wysocki moved this from Assigned to In Review in Good first issues Mar 21, 2024

ilya-lavrenov closed this as completed in #315 Apr 9, 2024

ilya-lavrenov pushed a commit that referenced this issue Apr 9, 2024

Update greedy_causal_lm.cpp to read EOS Token (#315)

72caf05

*Details:* Made*changes to accommodate the dynamic EOS Token *Tickets:* #277 132861

github-project-automation bot moved this from In Review to Closed in Good first issues Apr 9, 2024

anzr299 mentioned this issue Apr 9, 2024

Read EOS token from model runtime information for speculative_decoding_lm #353

Merged

anzr299 mentioned this issue Apr 17, 2024

Make NNCF common accuracy aware training code pass mypy checks openvinotoolkit/nncf#2637

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Good First Issue]: causal_lm/cpp must read EOS token value from rt_info of openvino_tokenizer.xml #277

[Good First Issue]: causal_lm/cpp must read EOS token value from rt_info of openvino_tokenizer.xml #277

p-wysocki commented Mar 1, 2024 •

edited by Wovchena

Loading

anzr299 commented Mar 2, 2024

p-wysocki commented Mar 4, 2024

p-wysocki commented Mar 12, 2024

anzr299 commented Mar 16, 2024

anzr299 commented Mar 20, 2024

anzr299 commented Mar 20, 2024 •

edited

Loading

p-wysocki commented Mar 21, 2024

pavel-esir commented Mar 22, 2024

anzr299 commented Apr 9, 2024

ilya-lavrenov commented Apr 9, 2024

anzr299 commented Apr 9, 2024

anzr299 commented Apr 9, 2024

[Good First Issue]: causal_lm/cpp must read EOS token value from rt_info of openvino_tokenizer.xml #277

[Good First Issue]: causal_lm/cpp must read EOS token value from rt_info of openvino_tokenizer.xml #277

Comments

p-wysocki commented Mar 1, 2024 • edited by Wovchena Loading

Context

What needs to be done?

Example Pull Requests

Resources

Contact points

Ticket

anzr299 commented Mar 2, 2024

p-wysocki commented Mar 4, 2024

p-wysocki commented Mar 12, 2024

anzr299 commented Mar 16, 2024

anzr299 commented Mar 20, 2024

anzr299 commented Mar 20, 2024 • edited Loading

p-wysocki commented Mar 21, 2024

pavel-esir commented Mar 22, 2024

anzr299 commented Apr 9, 2024

ilya-lavrenov commented Apr 9, 2024

anzr299 commented Apr 9, 2024

anzr299 commented Apr 9, 2024

p-wysocki commented Mar 1, 2024 •

edited by Wovchena

Loading

anzr299 commented Mar 20, 2024 •

edited

Loading