Skip to content
Discussion options

You must be logged in to vote

Hi @JesusMF23 ,

We have LLM stats for each call, see here how it is used:
https://github.com/NVIDIA/NeMo-Guardrails/blob/cb07be6e04f345905d6cef03daf0c7ea8e11a372/nemoguardrails/rails/llm/llmrails.py#L368

At this moment, we reset the stats at the beginning of each generate_async call:
https://github.com/NVIDIA/NeMo-Guardrails/blob/cb07be6e04f345905d6cef03daf0c7ea8e11a372/nemoguardrails/rails/llm/llmrails.py#L336

But you can easily get the stats after each call and monitor the total usage for an app. Is this what you are looking for?

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@JesusMF23
Comment options

@trebedea
Comment options

@JesusMF23
Comment options

@drazvan
Comment options

drazvan Sep 19, 2023
Collaborator

Answer selected by JesusMF23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants