Model inject via tritonclient API #5671

sapjunior · 2023-04-20T16:55:53Z

sapjunior
Apr 20, 2023

It would be beneficial if we there is a way to inject/passthrough model binary via API rather than current single way (via model file path). For example, in on premise use case which require model encryption/protection, user can writing their own mechanism to protect model and inject model into triton and unload without using filesystem as a medium

dyastremsky · 2023-04-20T20:43:08Z

dyastremsky
Apr 20, 2023
Collaborator

@GuanLuo @tanmayv25 Do you know if this is somehow doable? Could we just have the model frameworks load the binaries directly at runtime?

0 replies

tanmayv25 · 2023-04-20T21:54:34Z

tanmayv25
Apr 20, 2023
Maintainer

Yes. It is already supported. See the documentation on this feature here: https://github.com/triton-inference-server/server/blob/main/docs/protocol/extension_model_repository.md#load

0 replies

tanmayv25 · 2023-04-20T21:57:29Z

tanmayv25
Apr 20, 2023
Maintainer

If using the python clients then look at the config and file options: https://github.com/triton-inference-server/client/blob/main/src/python/library/tritonclient/grpc/__init__.py#L656

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model inject via tritonclient API #5671

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Model inject via tritonclient API #5671

sapjunior Apr 20, 2023

Replies: 3 comments

dyastremsky Apr 20, 2023 Collaborator

tanmayv25 Apr 20, 2023 Maintainer

tanmayv25 Apr 20, 2023 Maintainer

sapjunior
Apr 20, 2023

dyastremsky
Apr 20, 2023
Collaborator

tanmayv25
Apr 20, 2023
Maintainer

tanmayv25
Apr 20, 2023
Maintainer