Remove the fixed `eot_token` mechanism for SFT #927

Xingfu-Yi · 2024-09-15T03:46:37Z

Background

Not all pretrained LLMs use <|endoftext|> as the eot_token, therefore it's inappropriate to fix it.

Changes

Removed the hardcoded eot_token: args.end_of_conversation_token = "<|endoftext|>".
Added a new argument in the parser called eot_token which is <|endoftext|> by default. Users can manually set the token according to the pretrained model they use.

Not all pretrained LLMs use `<|endoftext|>` as the `eot_token`, therefore it's inappropriate to fix it.

Xingfu-Yi · 2024-09-24T12:13:57Z

Hi @arashb, @duli2012, @awan-10, @eltonzheng,

I hope you're doing well. When you have a moment, could you kindly take a look at this PR? It has already received one approval, but it seems to be stuck and needs further reviews to move forward.

Thank you so much in advance for your time and help.

Best regards,
Yi

loadams · 2024-10-29T22:48:32Z

Hi @arashb, @duli2012, @awan-10, @eltonzheng,

I hope you're doing well. When you have a moment, could you kindly take a look at this PR? It has already received one approval, but it seems to be stuck and needs further reviews to move forward.

Thank you so much in advance for your time and help.

Best regards, Yi

Hi @Xingfu-Yi - we will work on getting this PR merged, sorry for the delay.

Remove the fixed eot_token mechanism for SFT

7697726

Not all pretrained LLMs use `<|endoftext|>` as the `eot_token`, therefore it's inappropriate to fix it.

Xingfu-Yi requested review from tjruwase, awan-10, eltonzheng, duli2012 and arashb as code owners September 15, 2024 03:46

tjruwase approved these changes Sep 20, 2024

View reviewed changes

Merge branch 'master' into eot_token_bugfix

f370d3c

Merge branch 'master' into eot_token_bugfix

44da9cd

Merge branch 'master' into eot_token_bugfix

73b814e

loadams merged commit eefb0ef into microsoft:master Oct 30, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove the fixed `eot_token` mechanism for SFT #927

Remove the fixed `eot_token` mechanism for SFT #927

Xingfu-Yi commented Sep 15, 2024

Xingfu-Yi commented Sep 24, 2024

loadams commented Oct 29, 2024

Remove the fixed eot_token mechanism for SFT #927

Remove the fixed eot_token mechanism for SFT #927

Conversation

Xingfu-Yi commented Sep 15, 2024

Background

Changes

Xingfu-Yi commented Sep 24, 2024

loadams commented Oct 29, 2024

Remove the fixed `eot_token` mechanism for SFT #927

Remove the fixed `eot_token` mechanism for SFT #927