🐛 Bug: Investigate "No space left on device" in our pipelines #1067

DhanshreeA · 2024-03-13T06:39:18Z

Describe the bug.

A number of our model pipelines, typically in the "upload model to dockerhub" stage fail because of "No space left on device".
Exhibits:

Describe the steps to reproduce the behavior

Go to any one of the jobs above and re run. I tried to run these jobs when no other jobs were running on our runners and yet they failed.

Expected behavior.

These jobs should pass.

Screenshots.

No response

Operating environment

Runner OS

Additional context

Potentially useful resources:

GemmaTuron · 2024-03-13T12:29:44Z

Hi @DhanshreeA
I am highlighting this model: https://github.com/ersilia-os/eos9taz/actions as I need it for chemsampler but it is not passing either :)

DhanshreeA · 2024-03-14T08:13:49Z

@GemmaTuron, I'm aware of this (ersilia-os/eos9taz#11) For now, I've pushed it manually to unblock you. However there's a related issue that I'm on top of: #1068 which I opened with specifically this model in mind.

DhanshreeA · 2024-03-19T05:32:28Z

Root Cause: With GitHub hosted runners, one of the guarantees is getting software updates. This could mean one or all of the following frequently: runner updates, runner provisioner updates, patches to the OS on the runner, the software bundled in the OS, etc. Every such update is quite likely to eat into the disk space. For example, here's the list of all the tools installed on the runner OS, that we do not need for our builds. So far there is no straightforward solution other than an aggressive disk clean up. This has been implemented at the level of the eos-template repository. While this works for now, it is not guaranteed that this issue will not come up again.

DhanshreeA added the bug Something isn't working label Mar 13, 2024

DhanshreeA self-assigned this Mar 13, 2024

DhanshreeA closed this as completed Mar 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 Bug: Investigate "No space left on device" in our pipelines #1067

🐛 Bug: Investigate "No space left on device" in our pipelines #1067

DhanshreeA commented Mar 13, 2024 •

edited

Loading

GemmaTuron commented Mar 13, 2024

DhanshreeA commented Mar 14, 2024 •

edited

Loading

DhanshreeA commented Mar 19, 2024

🐛 Bug: Investigate "No space left on device" in our pipelines #1067

🐛 Bug: Investigate "No space left on device" in our pipelines #1067

Comments

DhanshreeA commented Mar 13, 2024 • edited Loading

Describe the bug.

Describe the steps to reproduce the behavior

Expected behavior.

Screenshots.

Operating environment

Additional context

GemmaTuron commented Mar 13, 2024

DhanshreeA commented Mar 14, 2024 • edited Loading

DhanshreeA commented Mar 19, 2024

DhanshreeA commented Mar 13, 2024 •

edited

Loading

DhanshreeA commented Mar 14, 2024 •

edited

Loading