Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stingram/network issue fix #592

Draft
wants to merge 9 commits into
base: master
Choose a base branch
from
Draft

Conversation

stingram
Copy link
Collaborator

@stingram stingram commented Feb 3, 2025

Description

Please include a summary of relevant context/issue and your changes.

Tests

Please describe the tests that you ran on Cloud VM to verify changes.

Instruction and/or command lines to reproduce your tests: ...

List links for your tests (use go/shortn-gen for any internal link): ...

Checklist

Before submitting this PR, please make sure (put X in square brackets):

  • I have performed a self-review of my code.
  • I have necessary comments in my code, particularly in hard-to-understand areas.
  • I have run one-shot tests and provided workload links above if applicable.
  • I have made or will make corresponding changes to the doc if needed.

Steven Ingram and others added 9 commits February 3, 2025 19:00
Specifically, two slices of v5p-8.

This is to ensure that both r2.6 and ptxla nightly continue to work in
multi-slice (DCN networking involved) environments.

I noticed that we are not testing llama3 in nightly so I added that too,
copying from our release tests.
…ependencies (#583)

* Run the vLLM TPU test in Docker

* Use Half-width comma

* Use "sudo docker exec $CONTAINER_NAME /bin/bash -c"

* Add HF_TOKEN

* Use a static VLLM_TPU_CONTAINER name

* Remove unused argument

* Try jq

* use escape

* Try \\"

* escape the single quote

* Try 5: escape of single quote + double escape of double quote

* Try 6: double escape of double quote

* Try 7: double quote for command inside container and escape of double quotes

* Try 8

* Try 9: double escape of double quotes

* Escape \n

* Try \\\n

* Try \\$

* Try \\\"

* Get the GCS destination path *before* constructing the command.

* Get the GCS destination path *before* constructing the command.  OUTSIDE the list.

* Add \" and debug information

* passing GCS as an environment variable

* pkill vllm

* Address comments

* Reformat using command "pre-commit run --files dags/solutions_team/configs/vllm/vllm_benchmark_config.py"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants