Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can we add a performing inference example? #418

Closed
bhack opened this issue Dec 28, 2024 · 4 comments
Closed

Can we add a performing inference example? #418

bhack opened this issue Dec 28, 2024 · 4 comments

Comments

@bhack
Copy link

bhack commented Dec 28, 2024

Currently we have a training example with "proxy-optimal" parameters in the repo.

Can we have an inference job example with "proxy-optimal" prams?

I think that an inference job example needs to have the prams for fast reading the input and fast saving the output (same or dedicated mounted buckets).

Generally inference is one time pass so many caching params are a bit useless to reach peak performance and it will be a good case to guide the user on the best practices.

@hime
Copy link
Collaborator

hime commented Jan 13, 2025

Hi @bhack, can you give us more details regarding "proxy-optimal" parameter and training example? I am not aware this example is present in the repository.

@bhack
Copy link
Author

bhack commented Jan 13, 2025

I meant this is "optimized" with params for training:
https://github.com/GoogleCloudPlatform/gcs-fuse-csi-driver/blob/main/examples/pytorch/train-job-pytorch.yaml

Can we have something similar for inference?

@hime
Copy link
Collaborator

hime commented Jan 13, 2025

We recently updated our documentation and have a optimized example for inferencing below. Hope this helps!

https://cloud.google.com/kubernetes-engine/docs/how-to/cloud-storage-fuse-csi-driver-perf#inference-serving-example

@bhack
Copy link
Author

bhack commented Jan 13, 2025

Ok

@bhack bhack closed this as not planned Won't fix, can't repro, duplicate, stale Jan 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants