-
I am using bentoml with gunicorn when I use bentoml serve-gunicorn command, I want to use feature of max_requests, max_requests_jitter feature. Because these config is not supported by bentoml, I patched gunicorn_config.py directly into bentoml site-packages path. It looks like these config has applied. However, when the number of requests exceed the limit, 503, and Service unavilable message is returned. It looks like the worker process does not restart properly. Is there any way to use these config ? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Hi, @jisngprk . 😃
|
Beta Was this translation helpful? Give feedback.
Hi, @jisngprk . 😃
I guess you're using BentoML-0.13.1 for your service now. If yes, you could apply your own patch to the following parts by yourself.
I think you can refer this commit also for your purpose. (1db97c3) I hope this helps you in some way.