The readme does not work #27

pevnak · 2025-02-14T20:32:03Z

The readme does not work, since tokenizer_from_file was renamed to HuggingFaceTokenizers.from_file I guess.

The text was updated successfully, but these errors were encountered:

pevnak · 2025-02-14T20:56:28Z

There seems to be more bugs.
I have tried to update it to this

using JSON3, HuggingFaceTokenizers, Jjama3
const HFT =  HuggingFaceTokenizers


function Jjama3.nexttoken!(tokens, model, sampler, logits, tokenizer_for_printing)
    tokens[model.pos+1] = sampler(logits[:, end, 1])
    !isnothing(tokenizer_for_printing) && print(HFT.decode(tokenizer_for_printing, [tokens[model.pos+1]], skip_special_tokens = false))
end

config = JSON3.read(read("SmolLM2-360M-Instruct/config.json", String))
model = load_llama3_from_safetensors("SmolLM2-360M-Instruct/model.safetensors", config)
tkn = HFT.from_file(Tokenizer, "SmolLM2-360M-Instruct/tokenizer.json")

prompt = HFT.encode(tkn, "Tell me the two worst things about Python.")
generate(model, prompt.ids,
        max_new_tokens=500,
        tokenizer_for_printing=tkn,
        end_token = HFT.encode(tkn, "<|im_end|>")[end]);

But it does generates garble. Are my fixes correct?

murrellb · 2025-02-14T22:00:26Z

Thanks! Should be fixed now on main (and the readme example gives sensible output - on my machine at least).

The reason your changes didn't work is because Jjama3 needs to do a little translation from the HuggingFaceTokenizer (mostly for 1-indexing).

murrellb · 2025-02-14T22:05:03Z

Spoke slightly too soon - you also need to import HuggingFaceTokenizers to avoid a name clash.

murrellb · 2025-02-14T22:06:20Z

...which should now be fixed in the readme on main.

murrellb · 2025-02-14T22:09:03Z

pevnak · 2025-02-15T07:07:21Z

Thanks for help. It seems to work now.

pevnak closed this as completed Feb 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The readme does not work #27

The readme does not work #27

pevnak commented Feb 14, 2025

pevnak commented Feb 14, 2025

murrellb commented Feb 14, 2025

murrellb commented Feb 14, 2025

murrellb commented Feb 14, 2025

murrellb commented Feb 14, 2025

pevnak commented Feb 15, 2025

The readme does not work #27

The readme does not work #27

Comments

pevnak commented Feb 14, 2025

pevnak commented Feb 14, 2025

murrellb commented Feb 14, 2025

murrellb commented Feb 14, 2025

murrellb commented Feb 14, 2025

murrellb commented Feb 14, 2025

pevnak commented Feb 15, 2025