Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kubernetes backend: Can't run hello world #999

Open
knexator opened this issue Jan 20, 2025 · 0 comments
Open

Kubernetes backend: Can't run hello world #999

knexator opened this issue Jan 20, 2025 · 0 comments

Comments

@knexator
Copy link

❓ Questions and Help

Please note that this issue tracker is not a help form and this issue will be closed.

Before submitting, please ensure you have gone through our
documentation.

Question

Both of these work correctly:
uv run torchx run --scheduler local_cwd -cfg queue=default --workspace="" utils.echo
uv run torchx run --scheduler local_docker -cfg queue=default --workspace="" utils.echo
However, with the kubernetes backend I get:

torchx 2025-01-20 12:35:27 INFO     Launched app: kubernetes://torchx/default:echo-rlkrc4nxcxzxs
torchx 2025-01-20 12:35:27 INFO     AppStatus:
    State: UNKNOWN
    Num Restarts: -1
    Roles:
    Msg: <NONE>
    Structured Error Msg: <NONE>
    UI URL: None

I get the same UNKNOWN by doing torchx status kubernetes://torchx/default:echo-rlkrc4nxcxzxs

I have run pip install torchx[kubernetes] and done kubectl apply -f https://raw.githubusercontent.com/volcano-sh/volcano/v1.6.0/installer/volcano-development.yaml

I am using a local kind cluster, which is otherwise working correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant