Replies: 3 comments
-
The 23.01 and 23.02 containers will be larger than normally unfortunately. There was a need to support both CUDA 12 and CUDA 11.x, so there are parts of the library CUDA 11.8 SDK in the container. Eventually, we will only need CUDA 12 and the container size will reduce. The overall size of the container is already a huge concern for our team and many other Triton customers. The min container is currently the smallest we can make the container as it contains the bare minimum requirements and libraries needed for Triton to function. The bulk of the container size come from dependent libraries, such as CUDA SDK and cuDNN. To your other request, it is not possible to release container more frequently than we do already. |
Beta Was this translation helpful? Give feedback.
-
This new pull i update here has no vulnaberities and patched all |
Beta Was this translation helpful? Give feedback.
-
Is your feature request related to a problem? Please describe.
Container vulnerability scans block
tritonserver
deployment in our corporate servers because the containers have several high vulnerabilities:Describe the solution you'd like
Slim down the containers, as they are extremely bloated (16 GB), orders of magnitude bigger than our in-house model hosting apps. Some ideas:
nano
is safer/cleaner thanvim
)Describe alternatives you've considered
I've tried to upgrade the packages installed in the container and did manage to remove the high vulnerability in
openssl
, but not inslurm*
(the compilation from source (see docs) did not help remove / overwrite the original package and the vulnerability). So I may end up maintainingtritonserver
containers on our own build servers, with daily and hourly builds.Additional context
I could not make the smaller (but still very bloated) "min" and "sdk" versions to work. The main "py3" containers are as large as our data science and ML modeling development containers used by data scientists, with hundreds of installed python packages (including some CUDA-enabled ones):
vs.
Beta Was this translation helpful? Give feedback.
All reactions