Triton RuntimeStatus.MethodInfos is missing ModelStreamInfer #80

Legion2 · 2024-01-20T01:18:07Z

Triton provides an extension to the standard gRPC inference api for streaming (inference.GRPCInferenceService/ModelStreamInfer), this extension is required to use vLLM backend with triton.
However currently the triton runtime adapter does not advertise the existence of this gRPC method and trying to call it results in an error (inference.GRPCInferenceService/ModelStreamInfer: UNIMPLEMENTED: Method not found or not permitted: inference.GRPCInferenceService/ModelStreamInfer)

To resolve this issue, I think the ModelStreamInfer method must be added here:

modelmesh-runtime-adapter/model-mesh-triton-adapter/server/server.go

Lines 267 to 269 in f9781d2

    
           mis := make(map[string]*mmesh.RuntimeStatusResponse_MethodInfo) 
        
           mis[tritonServiceName+"/ModelInfer"] = &mmesh.RuntimeStatusResponse_MethodInfo{IdInjectionPath: path1} 
        
           mis[tritonServiceName+"/ModelMetadata"] = &mmesh.RuntimeStatusResponse_MethodInfo{IdInjectionPath: path1}

The text was updated successfully, but these errors were encountered:

Legion2 · 2024-01-21T15:25:15Z

I have created a PR #81 and tested in our environment that the ModelStreamInfer requests work with the patch.

Signed-off-by: Leon Kiefer <leon.k97@gmx.de>

Legion2 added a commit to Legion2/modelmesh-runtime-adapter that referenced this issue Jan 21, 2024

fix: Add ModelStreamInfer to triton MethodInfos (kserve#80)

81f6d8b

Legion2 linked a pull request Jan 21, 2024 that will close this issue

feat: Add ModelStreamInfer to Triton MethodInfos #81

Open

Legion2 added a commit to Legion2/modelmesh-runtime-adapter that referenced this issue Jan 21, 2024

fix: Add ModelStreamInfer to triton MethodInfos (kserve#80)

9ce4393

Signed-off-by: Leon Kiefer <leon.k97@gmx.de>

rafvasq added the enhancement New feature or request label Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Triton RuntimeStatus.MethodInfos is missing ModelStreamInfer #80

Triton RuntimeStatus.MethodInfos is missing ModelStreamInfer #80

Legion2 commented Jan 20, 2024

Legion2 commented Jan 21, 2024

Triton RuntimeStatus.MethodInfos is missing ModelStreamInfer #80

Triton RuntimeStatus.MethodInfos is missing ModelStreamInfer #80

Comments

Legion2 commented Jan 20, 2024

Legion2 commented Jan 21, 2024