WebGPU support for Vision Tasks #5826

pgn-dev · 2025-01-15T09:59:35Z

MediaPipe Solution (you are using)

FaceLandmarker, HandLandmarker, PoseLandmarker

Programming language

typescript

Are you willing to contribute it

None

Describe the feature and the current behaviour/state

Use WebGPU for running vision tasks. Currently the TaskRunners appears to be running WebGL with OffScreenCanvas

Will this change the current API? How?

No response

Who will benefit with this feature?

Everyone

Please specify the use cases for this feature

Should make prediction on browser-based solutions faster.

Any Other info

No response

schmidt-sebastian · 2025-01-16T15:49:55Z

Our Vision tasks do use WebGL at the moment, as our WebGPU support is currently focused on our LLM Inference API. Please do watch our announcement over the coming weeks and months in this space.

pgn-dev · 2025-01-16T16:05:06Z

Thanks for confirming @schmidt-sebastian

Do you expect a significant improvement in latency over WebGL with WebGPU?

TFLite already seems to support WebGPU. Would running the TFLite models from the task files work out of the box?

pgn-dev added the type:feature Enhancement in the New Functionality or Request for a New Solution label Jan 15, 2025

google-ml-butler bot assigned kalyan2789g Jan 15, 2025

kuaashish assigned kuaashish and unassigned kalyan2789g Jan 15, 2025

kuaashish assigned schmidt-sebastian and unassigned kuaashish Jan 16, 2025

kuaashish added the stat:awaiting googler Waiting for Google Engineer's Response label Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WebGPU support for Vision Tasks #5826

WebGPU support for Vision Tasks #5826

pgn-dev commented Jan 15, 2025

schmidt-sebastian commented Jan 16, 2025

pgn-dev commented Jan 16, 2025

WebGPU support for Vision Tasks #5826

WebGPU support for Vision Tasks #5826

Comments

pgn-dev commented Jan 15, 2025

MediaPipe Solution (you are using)

Programming language

Are you willing to contribute it

Describe the feature and the current behaviour/state

Will this change the current API? How?

Who will benefit with this feature?

Please specify the use cases for this feature

Any Other info

schmidt-sebastian commented Jan 16, 2025

pgn-dev commented Jan 16, 2025