We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
On latest main 94c9494, CI flow for distributed shows success https://github.com/Lightning-AI/lightning-thunder/runs/23172744261
But looking at the log, there are a few tests that have failed.
Sample
=================================== FAILURES =================================== _ CompileDDPTest.test_fsdp_grad_parity_with_without_bucketing_executor_nvfuser_bucketing_block_zero2 _ /usr/local/lib/python3.10/dist-packages/torch/testing/_internal/common_distributed.py:533: in wrapper self._join_processes(fn) /usr/local/lib/python3.10/dist-packages/torch/testing/_internal/common_distributed.py:752: in _join_processes self._check_return_codes(elapsed_time) _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Link to log: https://dev.azure.com/Lightning-AI/lightning/_build/results?buildId=196660&view=logs&j=47e66f3c-897a-5428-da11-bf5c7745762e&t=97be8351-284a-5dba-49eb-f9fe7c3ed1a2&l=811
cc @Borda
The text was updated successfully, but these errors were encountered:
Raw Log (in case the CI logs are cleaned): log.txt
Sorry, something went wrong.
so CI is fixed in #99 but probably we would also need to fix the issue... 🤔
Borda
Successfully merging a pull request may close this issue.
On latest main 94c9494, CI flow for distributed shows success https://github.com/Lightning-AI/lightning-thunder/runs/23172744261
But looking at the log, there are a few tests that have failed.
Sample
Link to log: https://dev.azure.com/Lightning-AI/lightning/_build/results?buildId=196660&view=logs&j=47e66f3c-897a-5428-da11-bf5c7745762e&t=97be8351-284a-5dba-49eb-f9fe7c3ed1a2&l=811
cc @Borda
The text was updated successfully, but these errors were encountered: