Can we add a performing inference example? #418

bhack · 2024-12-28T13:21:21Z

Currently we have a training example with "proxy-optimal" parameters in the repo.

Can we have an inference job example with "proxy-optimal" prams?

I think that an inference job example needs to have the prams for fast reading the input and fast saving the output (same or dedicated mounted buckets).

Generally inference is one time pass so many caching params are a bit useless to reach peak performance and it will be a good case to guide the user on the best practices.

hime · 2025-01-13T21:22:00Z

Hi @bhack, can you give us more details regarding "proxy-optimal" parameter and training example? I am not aware this example is present in the repository.

bhack · 2025-01-13T21:38:24Z

I meant this is "optimized" with params for training:
https://github.com/GoogleCloudPlatform/gcs-fuse-csi-driver/blob/main/examples/pytorch/train-job-pytorch.yaml

Can we have something similar for inference?

hime · 2025-01-13T21:50:35Z

We recently updated our documentation and have a optimized example for inferencing below. Hope this helps!

https://cloud.google.com/kubernetes-engine/docs/how-to/cloud-storage-fuse-csi-driver-perf#inference-serving-example

bhack · 2025-01-13T22:00:11Z

Ok

bhack closed this as not planned Won't fix, can't repro, duplicate, stale Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we add a performing inference example? #418

Can we add a performing inference example? #418

bhack commented Dec 28, 2024

hime commented Jan 13, 2025 •

edited

Loading

bhack commented Jan 13, 2025

hime commented Jan 13, 2025

bhack commented Jan 13, 2025

Can we add a performing inference example? #418

Can we add a performing inference example? #418

Comments

bhack commented Dec 28, 2024

hime commented Jan 13, 2025 • edited Loading

bhack commented Jan 13, 2025

hime commented Jan 13, 2025

bhack commented Jan 13, 2025

hime commented Jan 13, 2025 •

edited

Loading