How can I adjust the token limit for input? #257

al-yakubovich · 2024-08-29T22:37:32Z

What is the maximum input the model can handle for transcription? GPT-4 has a 128k token limit, but it is not clear how much of this is used for generating the response.

mang0sw33t · 2024-08-30T01:36:32Z

Transcription and response generation are different functionality independent of each other.

What is the maximum input the model can handle for transcription?

Transcription is done on audio files. While we can provide a prompt for transcription as well, this isn't something transcribe does.

As you correctly noted, the input token limit for response generation depends on the model.

al-yakubovich · 2024-08-31T21:40:18Z

@mang0sw33t I am interested in token limit for response generation. How does transcribe know token limit of a particular model I chose? If I go with Gemini 1.5 pro with token limit 1M will response generation take into account that the model I use has 1M token limit?

Does MAX_TRANSCRIPTION_PHRASES_FOR_LLM control it? It set to 12. So it takes 12 last dialog parts? It seems very little. I would say it would be like maybe 1k token in total.

UPD: I set MAX_TRANSCRIPTION_PHRASES_FOR_LLM to 100 and now model takes much more context.

mang0sw33t · 2024-09-03T17:21:34Z

For LLM responses, Transcribe does not do any automatic limits on input tokens based on the model.

MAX_TRANSCRIPTION_PHRASES_FOR_LLM is how many previous conversations as visible in the UI, are sent to LLM. You are correct, setting it to a higher number will result in more tokens sent to LLM.

Please consider a PR with making this parameter configurable so other users can change it easily in parameters.yaml file or override.yaml file

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I adjust the token limit for input? #257

How can I adjust the token limit for input? #257

al-yakubovich commented Aug 29, 2024

mang0sw33t commented Aug 30, 2024 •

edited

Loading

al-yakubovich commented Aug 31, 2024 •

edited

Loading

mang0sw33t commented Sep 3, 2024

How can I adjust the token limit for input? #257

How can I adjust the token limit for input? #257

Comments

al-yakubovich commented Aug 29, 2024

mang0sw33t commented Aug 30, 2024 • edited Loading

al-yakubovich commented Aug 31, 2024 • edited Loading

mang0sw33t commented Sep 3, 2024

mang0sw33t commented Aug 30, 2024 •

edited

Loading

al-yakubovich commented Aug 31, 2024 •

edited

Loading