How to deploy Triton on Vertex AI? #7584
Unanswered
btseitlinatwork
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello
I am excited to use Triton with Google Cloud Vertex AI, but I can't seem to find a way.
I followed these tutorials:
None of them produced a working deployment. I am able to build the container (or rather rename the NGC container and upload it to google artifact registry). But deployment of the model to a Vertex endpoint fails with a non-descriptive error:
It seems like the container is receiving some unexpected command. But I don't understand what it was: there are no logs.
Does anyone have an up-to-date guide on how to deploy to Vertex?
Also, I see that all questions and discussions of Triton on Vertex are from 2022 and older. Should I look into deploying on GKE or another solution altogether?
Beta Was this translation helpful? Give feedback.
All reactions